检索结果-内蒙古大学图书馆

International Conference on pattern Recognition

作者： Biswajit Halder Utpal Garain Department of Information Technology Mallabhum Institute of Technology Bisnupur West Bengal India Computer Vision & Pattern Recognition Unit Indian Statistical Institute Kolkata India

Answering to a query like when a particular document was printed is quite helpful in practice especially forensic purposes. This study attempts to develop a general framework that makes use of image processing and pattern recognition principles for ink age determination in printed documents. The approach, at first, computationally extracts a set of suitable color features and then analyzes them to properly associate them with ink age. Finally, a neural net is designed and trained to determine ages of unknown samples. The dataset used for the present experiment consists of the cover pages of LIFE magazines published in between 1930's and 70's (five decades). Test results show that a viable framework for involving machines in assisting human experts for determining age of printed documents.

关键词： Ink Image color analysis Feature extraction Pixel Forensics Artificial neural networks Accuracy

来源：评论

学校读者我要写书评

暂无评论

Bangla and English City Name Recognition for Indian Postal Automation

Bangla and English City Name Recognition for Indian Postal A...

引用

International Conference on pattern Recognition

作者： Umapada Pal Ramit Kumar Roy Fumitaka Kimura Computer Vision and Pattern Recognition Unit Indian Statistical Institute Kolkata India Saint Xavier's College Kolkata India Graduate School of Engineering Mie University Japan

ISBN: (纸本)9781424475421

Because of multi-lingual behavior destination address block of a postal document of an Indian state may be written in two or more scripts. From a statistical analysis of Indian postal document we noted that about 22.04% of Indian postal documents are written in two scripts. Because of inter-mixing of these scripts in postal address writings, it is very difficult to identify the script by which a city name is written. To avoid such identification difficulties, in this paper we proposed a lexicon-driven bi-lingual (English and Bangla) city name recognition scheme for Indian postal automation. We obtained 93.19% accuracy when tested on 11875 city name samples.

关键词： Cities and towns Handwriting recognition Feature extraction Automation Image segmentation Cavity resonators Dynamic programming

来源：评论

学校读者我要写书评

暂无评论

Devanagari and Bangla text extraction from natural scene images

Devanagari and Bangla text extraction from natural scene ima...

引用

ICDAR2009 - 10th International Conference on Document Analysis and Recognition

作者： Bhattacharya, U. Parui, S.K. Mondal, S. Computer Vision and Pattern Recognition Unit Indian Statistical Institute Kolkata - 108 India

ISBN: (纸本)9780769537252

With the increasing popularity of digital cameras attached with various handheld devices, many new computational challenges have gained significance. One such problem is extraction of texts from natural scene images captured by such devices. The extracted text can be sent to OCR or to a text-to-speech engine for recognition. In this article, we propose a novel and effective scheme based on analysis of connected components for extraction of Devanagari and Bangla texts from camera captured scene images. A common unique feature of these two scripts is the presence of headline and the proposed scheme uses mathematical morphology operations for their extraction. Additionally, we consider a few criteria for robust filtering of text components from such scene images. Moreover, we studied the problem of binarization of such scene images and observed that there are situations when repeated binarization by a well-known global thresholding approach is effective. We tested our algorithm on a repository of 100 scene images containing texts of Devanagari and / or Bangla © 2009 IEEE.

关键词： Extraction

来源：评论

学校读者我要写书评

暂无评论

A complete system for detection and recognition of text in graphical documents using background information

A complete system for detection and recognition of text in g...

引用

4th International Conference on computer vision Theory and Applications, VISAPP 2009

作者： Roy, Partha Pratim Lladòs, Josep Pal, Umapada Spain Computer Vision and Pattern Recognition Unit Indian Statistical Institute Kolkata - 108 India

ISBN: (纸本)9789898111692

Automatic Text/symbols retrieval in graphical documents (map, engineering drawing) involves many challenges because they are not usually parallel to each other. They are multi-oriented and curve in nature to annotate the graphical curve lines and hence follow a curvi-linear way too. Sometimes, text and symbols frequently touch/overlap with graphical components (river, street, border line) which enhances the problem. For OCR of such documents we need to extract individual text lines and their corresponding words/characters. In this paper, we propose a methodology to extract individual text lines and an approach for recognition of the extracted text characters from such complex graphical documents. The methodology is based on the foreground and background information of the text components. To take care of background information, water reservoir concept and convex hull have been used. For recognition of multi-font, multi-scale and multi-oriented characters, Support Vector Machine (SVM) based classifier is applied. Circular ring and convex hull have been used along with angular information of the contour pixels of the characters to make the feature rotation and scale invariant.

关键词： Optical character recognition

来源：评论

学校读者我要写书评

暂无评论

Fisher kernels for handwritten word-spotting

Fisher kernels for handwritten word-spotting

引用

ICDAR2009 - 10th International Conference on Document Analysis and Recognition

作者： Perronnin, Florent Rodriguez-Serrano, Jose A. Textual and Visual Pattern Analysis Xerox Research Centre Europe France Computer Vision Centre Universitat Autonoma de Barcelona Spain

ISBN: (纸本)9780769537252

The Fisher kernel is a generic framework which combines the benefits of generative and discriminative approaches to pattern classification. In this contribution, we propose to apply this framework to handwritten word-spotting. Given a word image and a keyword generative model, the idea is to generate a vector which describes how the parameters of the keyword model should be modified to best fit the word image. This vector can then be used as the input of a discriminative classifier. We compare the performance of the proposed approach with that of a generative baseline on a challenging real-world dataset of customer letters. When the kernel used by the classifier is linear, the performance improvement is marginal but the proposed system is approximately 15 times faster than the baseline. If we use a non-linear kernel devised for this task, we obtain a 15% relative reduction of the error but the detector is approximately 15 times slower. © 2009 IEEE.

关键词： pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Fine classification of unconstrained handwritten Persian/Arabic numerals by removing confusion amongst similar classes

Fine classification of unconstrained handwritten Persian/Ara...

引用

ICDAR2009 - 10th International Conference on Document Analysis and Recognition

作者： Alaei, Alireza Nagabhushan, P. Pal, Umapada Department of Studies in Computer Science University of Mysore Mysore 570 006 India Computer Vision and Pattern Recognition Unit Indian Statistical Institute Kolkata-108 India

ISBN: (纸本)9780769537252

In this paper, we propose two types of feature sets based on modified chain-code direction frequencies in the contour pixels of input image and modified transition features (horizontally and vertically). A multi-level support vector machine (SVM) is proposed as classifier to recognize Persian isolated digits. In first level, we combine similar shaped numerals into a single group and as result;we obtain 7 classes instead of 10 classes. We compute 196-dimension chain-code direction frequencies as features to discriminate 7 classes. In the second level, classes containing more than one numeral because of high resemblance in their shapes are considered. We use modified transition features (horizontally and vertically) for discriminating between two overlapping classes (0 and 1). To separate another overlapping group containing three numerals 2, 3 and 4 we first eliminate common parts of these digits (tail) and then compute chain code features. We employ SVM classifier for the classification and evaluate our scheme on 80,000 handwritten samples of Persian numerals [10]. Using 60,000 samples for training, we tested our scheme on other 20,000 samples and obtained 99.02% accuracy. © 2009 IEEE.

关键词： Support vector machines

来源：评论

学校读者我要写书评

暂无评论

Machine authentication of security documents

Machine authentication of security documents

引用

ICDAR2009 - 10th International Conference on Document Analysis and Recognition

作者： Garain, Utpal Halder, Biswajit Computer Vision and Pattern Recognition Unit India Statistical Institute 203 B.T. Road Kolkata 700108 India WB India

ISBN: (纸本)9780769537252

This paper presents a pioneering effort towards machine authentication of security documents like bank cheques, legal deeds, certificates, etc. that fall under the same class as far as security is concerned. The proposed method first computationally extracts the security features from the document images and then the notion of 'genuine' vs. 'duplicate' is defined in the feature space. Bank cheques are taken as a reference for conducting the present experiment. Support Vector Machines (SVMs) and Neural Networks (NN) are involved to verify authenticity of these cheques. Results on a test dataset of 200 samples show that the proposed approach achieves about 98% accuracy for discriminating duplicate cheques from genuine ones. This strongly attests the viability of involving machine in authenticating security documents. © 2009 IEEE.

关键词： Support vector machines

来源：评论

学校读者我要写书评

暂无评论

MICAI 2010 Organization and Conference Committee

Proceedings of Special Session - 9th Mexican International C...

引用

Proceedings of Special Session - 9th Mexican International Conference on Artificial Intelligence: Advances in Artificial Intelligence and Applications, MICAI 2010 2010年 viii页

作者： Reyes-García, Carlos Alberto Sidorov, Grigori Hernández-Aguirre, Arturo Arroyo, Gustavo Murrieta, Rafael Gonzalez, Jesus A. Gonzalez, Miguel Herrera, Oscar Peña, Alejandro Espinoza, Félix A. Castro Cansino, Joel Suárez Galicia-Haro, Sofia N. Koeppen, Mario Reyes-García, Carlos A. Monroy, Raul Gelbukh, Alexander Mezura-Montes, Efrén Leguizamón, Guillermo Ramírez-Manzanares, Alonso Castillo, Oscar Fuentes, Olac Sánchez, Gildardo Natural Language Processing Mexico Machine Learning and Pattern Recognition Mexico Hybrid Intelligent Systems and Neural Networks Mexico Logic Reasoning Ontologies Knowledge Mgmt. Knowledge-Based Syst. Multi-agent Syst. Mexico Data Mining Mexico Intelligent Tutoring Systems Mexico Evolutionary Algorithms and Other Naturally Inspired Algorithms Mexico Computer Vision and Image Processing Mexico Fuzzy Logic Uncertainty and Probabilistic Reasoning Mexico Bioinformatics and Medical Applications Mexico Robotics Planning and Scheduling Mexico

来源：评论

学校读者我要写书评

暂无评论

Word-wise Thai and Roman script identification

引用

ACM Transactions on Asian Language Information Processing 2009年第3期8卷 1–21页

作者： Chanda, Sukalpa Pal, Umapada Terrades, Oriol Ramos Computer Vision and Pattern Recognition Unit Indian Statistical Institute 203 B. T. Road Kolkatta-700108 India Instituto Tecnologico de Informatica Univ. Politécnica de Valencia 46022 Valencia Spain

In some Thai documents, a single text line of a printed document page may contain words of both Thai and Roman scripts. For the Optical Character Recognition (OCR) of such a document page it is better to identify, at first, Thai and Roman script portions and then to use individual OCR systems of the respective scripts on these identified portions. In this article, an SVM-based method is proposed for identification of word-wise printed Roman and Thai scripts from a single line of a document page. Here, at first, the document is segmented into lines and then lines are segmented into character groups (words). In the proposed scheme, we identify the script of a character group combining different character features obtained from structural shape, profile behavior, component overlapping information, topological properties, and water reservoir concept, etc. Based on the experiment on 10,000 data (words) we obtained 99.62% script identification accuracy from the proposed scheme. © 2009 ACM.

关键词： Optical character recognition

来源：评论

学校读者我要写书评

暂无评论

Handwritten word-image retrieval with synthesized typed queries

Handwritten word-image retrieval with synthesized typed quer...

引用

ICDAR2009 - 10th International Conference on Document Analysis and Recognition

作者： Rodríguez-Serrano, José A. Perronnin, Florent Computer Vision Centre Universitat Autonoma de Barcelona Spain Textual and Visual Pattern Analysis Xerox Research Centre Europe France Computer Science Department Loughborough University United Kingdom

ISBN: (纸本)9780769537252

We propose a new method for handwritten word-spotting which does not require prior training or gathering examples for querying. More precisely, a model is trained "on the fly" with images rendered from the searched words in one or multiple computer fonts. To reduce the mismatch between the typed-text prototypes and the candidate handwritten images, we make use of: (i) local gradient histogram (LGH) features, which were shown to model word shapes robustly, and (ii) semi-continuous hidden Markov models (SC-HMM), in which the typed-text models are constrained to a "vocabulary" of handwritten shapes, thus learning a link between both types of data. Experiments show that the proposed method is effective in retrieving handwritten words, and the comparison to alternative methods reveals that the contribution of both the LGH features and the SC-HMM is crucial. To the best of the authors' knowledge, this is the first work to address this issue in a non-trivial manner. © 2009 IEEE.

关键词： Image retrieval

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：