检索结果-内蒙古大学图书馆

International IEEE Conference on Signal-Image Technologies and Internet-Based System

作者： Zumra Malik Ali Mirza Akram Bennour Imran Siddiqi Chawki Djeddi Center of Computer Vision & Pattern Recognition Bahria University Islamabad Pakistan Departement of Mathematics and Computer Science Larbi Tebessi University Tebessa Algeria

This paper presents a system for script recognition of the text appearing in video frames. The textual content in videos is generally extracted and recognized for development of text based indexing and retrieval systems. If the text in videos appears only in a single script, the output of text detector is directly fed to a video Optical Character recognition (OCR) system for recognition. However, in cases where text may appear in multiple scripts, a script recognition module is required to recognize the script of the text so that it can be processed by the respective OCR. We propose a video script recognition system that considers text in each script as a unique texture. A number of texture measures are extracted from text blocks and an artificial neural network is trained to learn to distinguish between different scripts. The system evaluated on video text blocks in five different scripts (Arabic, English, Urdu, Hindi and Chinese) reported promising recognition rates. In addition to the performance of individual textural features, different combinations of texture measures were investigated which realized interesting results.

关键词： Text recognition Image recognition Histograms Optical character recognition software Fractals Character recognition Indexing

来源：评论

学校读者我要写书评

暂无评论

Advances in Biometric Person Authentication 1

引用

丛书名： Lecture Notes in computer Science

1000年

作者： Stan Z. Li Zhenan Sun Tieniu Tan Sharath Pankanti Gérard Chollet David Zhang

来源：评论

学校读者我要写书评

暂无评论

Gender Classification from Offline Handwriting Images Using Textural Features

Gender Classification from Offline Handwriting Images Using ...

引用

International Workshop on Frontiers in Handwriting recognition

作者： Ali Mirza Momina Moetesum Imran Siddiqi Chawki Djeddi Center of Computer Vision and Pattern Recognition Bahria University Islamabad Pakistan LAMIS Laboratory Larbi Tebessi University Tebessa Algeria

ISBN: (纸本)9781509009824

Prediction of gender and other demographic attributes of individuals from handwriting samples offers an interesting basic, as well as applied research problem. The correlation between gender and the visual appearance of handwriting has been validated by a number of studies and the present study is based on the same idea. We exploit the textural measurements as the discriminating attribute between male and female writings. The textural information in a writing is captured by applying a bank of Gabor filters to the image of handwriting. The mean and standard deviation values of the filter responses are collected in matrix and the Fourier transform of the matrix is used as a feature. Classification is carried out using a feed forward neural network. The proposed technique evaluated on a subset of the QUWI database realized promising results under different experimental settings.

关键词： Writing Databases Feature extraction Visualization Training Correlation Standards

来源：评论

学校读者我要写书评

暂无评论

A New Method for Handwritten Scene Text Detection in Video

A New Method for Handwritten Scene Text Detection in Video

引用

International Workshop on Frontiers in Handwriting recognition

作者： Palaiahnakote Shivakumara Anjan Dutta Umapada Pal Chew Lim Tan School of Computing National University of Singapore Singapore Computer Vision Center Universitat Authnòma de Barcelona Barcelona Spain Computer Vision and Pattern Recognition Unit Indian Statistical Institute India

There are many video images where hand written text may appear. Therefore handwritten scene text detection in video is essential and useful for many applications for efficient indexing, retrieval etc. Also there are many video frames where text line may be multi-oriented in nature. To the best of our knowledge there is no work on handwritten text detection in video, which is multi-oriented in nature. In this paper, we present a new method based on maximum color difference and boundary growing method for detection of multi-oriented handwritten scene text in video. The method computes maximum color difference for the average of R, G and B channels of the original frame to enhance the text information. The output of maximum color difference is fed to a K-means algorithm with K = 2 to separate text and non-text clusters. Text candidates are obtained by intersecting the text cluster with the Sobel output of the original frame. To tackle the fundamental problem of different orientations and skews of handwritten text, boundary growing method based on a nearest neighbor concept is employed. We evaluate the proposed method by testing on our own handwritten text database and publicly available video data (Hua's data). Experimental results obtained from the proposed method are promising.

关键词： Pixel Image edge detection Image color analysis Graphics Clustering algorithms Image resolution Measurement

来源：评论

学校读者我要写书评

暂无评论

Non-deterministic behavior of ranking-based metrics when evaluating embeddings

arXiv

引用

arXiv 2018年

作者： Nicolaou, Anguelos Dey, Sounak Christlein, Vincent Maier, Andreas Karatzas, Dimosthenis Computer Vision Center Edificio O Campus UAB Bellaterra08193 Spain Pattern Recognition Lab Friedrich-Alexander-Universitat Erlangen-Nurnberg

Embedding data into vector spaces is a very popular strategy of pattern recognition methods. When distances between embeddings are quantized, performance metrics become ambiguous. In this paper, we present an analysis of the ambiguity quantized distances introduce and provide bounds on the effect. We demonstrate that it can have a measurable effect in empirical data in state-of-the-art systems. We also approach the phenomenon from a computer security perspective and demonstrate how someone being evaluated by a third party can exploit this ambiguity and greatly outperform a random predictor without even access to the input data. We also suggest a simple solution making the performance metrics, which rely on ranking, totally deterministic and impervious to such exploits. Copyright © 2018, The Authors. All rights reserved.

关键词： Embeddings

来源：评论

学校读者我要写书评

暂无评论

Multi-Oriented English Text Line Extraction Using Background and Foreground Information

Multi-Oriented English Text Line Extraction Using Background...

引用

IAPR International Workshop on Document Analysis Systems, DAS

作者： Partha Pratim Roy Umapada Pal Josep Lladós Fumitaka Kimura Computer Vision Center Universitat Autònoma Barcelona Bellaterra Spain Indian Statistical Institute Computer Vision and Pattern Recognition Unit Kolkata India Graduate School of Engineering Mie University Mie Japan

In graphical documents (map, engineering drawing), artistic documents etc. there exist many printed materials where text lines are not parallel to each other and they are multi-oriented and curve in nature. For the OCR of such documents we need to extract individual text lines from the documents. Extraction of individual text lines from multi-oriented and/or curved text document is a difficult problem. In this paper, we propose a novel method to extract individual text lines from such document pages and the method is based on the foreground and background information of the characters of the text. To take care of background information, water reservoir concept is used here. In the proposed scheme at first, individual components are detected and grouped into 3-character clusters using their inter-component distance, size and positional information. Applying concept of graph, initial 3-character clusters are merged to have larger cluster group. Using inter-character background information, orientations of the extreme characters of a larger cluster are decided and based on these orientation, two candidate regions are formed from the cluster. Finally, with the help of these candidate regions, individual lines are extracted. From the experiment, we obtained encouraging result.

关键词： Data mining Water resources Reservoirs Optical character recognition software Image segmentation Character recognition Text analysis Information analysis pattern analysis pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Special Issue on Face Presentation Attack Detection

IEEE Transactions on Biometrics, Behavior, and Identity Scie...

引用

IEEE Transactions on Biometrics, Behavior, and Identity Science 2021年第3期3卷 282-284页

作者： Wan, Jun Escalera, Sergio Escalante, Hugo Jair Guo, Guodong Li, Stan Z. National Laboratory of Pattern Recognition Institute of Automation Chinese Academy of Sciences Beijing100190 China Computer Vision Center Universitat de Barcelona Barcelona08007 Spain Instituto Nacional de Astrofísica Óptica y Electrónica Puebla72840 Mexico Institute of Deep Learning Baidu Research Beijing100193 China Center for Ai Research and Innovation Westlake University Hangzhou310024 China

Face presentation attack detection, also termed Face Anti-Spoofing (FAS) [item 1), 2) in the Appendix), is a hot and challenging research topic that has received much attention from the computer vision and pattern recognition communities in the past. Owing to the development of deep learning and big data, recent advances in this and related fields has increased considerably. However, there are still several challenging tasks that deserve attention from the community, for instance robust techniques to unknown spoofing attacks, cross-domain generalization, and multi-modal fusion in images and video sequences. We edited this special issue with the goal of compiling the latest progress in the field and identifying promising research opportunities on FAS. © 2019 IEEE.

关键词： Special issues and sections Forgery Information integrity Face recognition Streaming media Feature extraction

来源：评论

学校读者我要写书评

暂无评论

Convex hull based approach for multi-oriented character recognition from graphical documents

Convex hull based approach for multi-oriented character reco...

引用

International Conference on pattern recognition

作者： Partha Pratim Roy Umapada Pal Josep Llados Fumitaka Kimura Computer Vision Center Universitat Autònoma De Barcelona Bellaterra Spain Computer Vision and Pattern Recognition Unit Indian Statistical Institute Kolkata India Graduate School of Engineering Mie University Mie Japan

ISBN: (纸本)9781424421749

In this paper, we present a scheme towards recognition of English character in multi-scale and multi-oriented environments. Graphical document such as map consists of text lines which appear in different orientation. Sometimes, characters in a single word may follow a curvilinear way to annotate the graphical curve lines. For recognition of such multi-scale and multi-oriented characters a Support Vector Machine (SVM) based scheme is presented in this paper. The feature used here is invariant to character orientation. Circular ring and convex hull have been used along with angular information of the contour pixels of the character to make the feature rotation invariant. We tested our proposed scheme on two different datasets. Combining circular and convex hull feature we have obtained 96.73% and 99.56% accuracy in these two datasets.

关键词： Character recognition pattern recognition Support vector machines Frequency Text recognition Histograms Clocks Rivers Optical character recognition software Testing

来源：评论

学校读者我要写书评

暂无评论

Combination of product graph and random walk kernel for symbol spotting in graphical documents

Combination of product graph and random walk kernel for symb...

引用

International Conference on pattern recognition

作者： Anjan Dutta Jaume Gibert Josep Lladós Horst Bunke Umapada Pal Computer Vision Center Universitat Autònoma de Barcelona Barcelona Spain Institute of Computer Science and Applied Mathematics Universitat Bern Bern Switzerland Computer Vision and Pattern Recognition Unit Indian Statistical Institute Kolkata India

ISBN: (纸本)9781467322164

This paper explores the utilization of product graph for spotting symbols on graphical documents. Product graph is intended to find the candidate subgraphs or components in the input graph containing the paths similar to the query graph. The acute angle between two edges and their length ratio are considered as the node labels. In a second step, each of the candidate subgraphs in the input graph is assigned with a distance measure computed by a random walk kernel. Actually it is the minimum of the distances of the component to all the components of the model graph. This distance measure is then used to eliminate dissimilar components. The remaining neighboring components are grouped and the grouped zone is considered as a retrieval zone of a symbol similar to the queried one. The entire method works online, i.e., it doesn't need any preprocessing step. The present paper reports the initial results of the method, which are very encouraging.

关键词： Kernel Computational modeling pattern recognition Databases Labeling Equations Performance evaluation

来源：评论

学校读者我要写书评

暂无评论

Edgy salient local binary patterns in inter-plane relationship for image retrieval in Diabetic Retinopathy

引用

Procedia computer Science 2017年 115卷 440-447页

作者： Gajanan M. Galshetwar Laxman M. Waghmare Anil B. Gonde Subrahmanyam Murala Center of Excellence in Signal and Image Processing (COESIP) Department of ECE SGGSIET Nanded Maharashtra 431606 India Computer Vision and Pattern Recognition Laboratory Department of Electrical Engineering IIT Ropar Rupnagar 140001 India

In this paper, a novel approach for content based image retrieval (CBIR) in diabetic retinopathy (DR) is proposed. The concept of salient point selection and inter-plane relationship technique is used. Salient points are selected from edgy image and later using inter-planer relationship, Local Binary patterns (LBPs) are calculated using the salient point as a center pixel. Our approach enhanced the results as we used color features in combination with LBP features. Experimentation is carried out on MESSIDOR database of 1200 retinal images, proposed approach has average precision of 57.82% as compared to the earlier approach whose average precision is 53.70%.

关键词： Content-Based image retrieval (CBIR) Diabetic Retinopathy (DR) Edgy Salient points Local Binary patterns (LBPs)

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：