检索结果-内蒙古大学图书馆

International Conference on Document Analysis and recognition

作者： Harold Mouchere Christian Viard-Gaudin Dae Hwan Kim Jin Hyung Kim Utpal Garain IRCCyN/IVC — UMR CNRS 6597 Ecole Polytechnique de l'Université de Nantes France Division of Computer Science Korea Advanced Institute of Science and Technology Republic of Korea Computer Vision and Pattern Recognition (CVPR) Unit Indian Statistical Institute Kolkata India

ISBN: (纸本)9781457713507

A competition on recognition of online handwritten mathematical expressions is organized. recognition of mathematical expressions has been an attractive problem for the pattern recognition community because of the presence of enormous uncertainties and ambiguities as encountered during parsing of the two-dimensional structure of expressions. The goal of this competition is to bring out a state of the art for the related research. Three labs come together to organize the event and six other research groups participated the competition. The competition defines a standard format for presenting information, provides a training set of 921 expressions and supplies the underlying grammar for understanding the content of the training data. Participants were invited to submit their recognizers which were tested with a new set of 348 expressions. Systems are evaluated based on four different aspects of the recognition problem. However, the final rating of the systems is done based on their correct expression recognition accuracies. The best expression level recognition accuracy (on the test data) shown by the competing systems is 19.83% whereas a baseline system developed by one of the organizing groups reports an accuracy 22.41% on the same data set.

关键词： Grammar Handwriting recognition Training Accuracy Ink Organizing Communities

来源：评论

学校读者我要写书评

暂无评论

Bag-of-visual-words for signature-based multi-script document retrieval

arXiv

引用

arXiv 2018年

作者： Mandal, Ranju Roy, Partha Pratim Pal, Umapada Blumenstein, Michael School of Information and Communication Technology Griffith University QLD Australia Dept. of Computer Science & Engineering Indian Institute of Technology Roorkee India Computer Vision and Pattern Recognition Unit Indian Statistical Institute Kolkata India School of Software University of Technology Sydney Australia

An end-to-end architecture for multi-script document retrieval using handwritten signatures is proposed in this paper. The user supplies a query signature sample and the system exclusively returns a set of documents that contain the query signature. In the first stage, a component-wise classification technique separates the potential signature components from all other components. A bag-of-visual-words powered by SIFT descriptors in a patch-based framework is proposed to compute the features and a Support Vector Machine (SVM)-based classifier was used to separate signatures from the documents. In the second stage, features from the foreground (i.e. signature strokes) and the background spatial information (i.e. background loops, reservoirs etc.) were combined to characterize the signature object to match with the query signature. Finally, three distance measures were used to match a query signature with the signature present in target documents for retrieval. The 'Tobacco' [1] document database and an Indian script database containing 560 documents of Devanagari (Hindi) and Bangla scripts were used for the performance evaluation. The proposed system was also tested on noisy documents and promising results were obtained. A comparative study shows that the proposed method outperforms the state-of-the-art approaches. Copyright © 2018, The Authors. All rights reserved.

关键词： Support vector machines

来源：评论

学校读者我要写书评

暂无评论

Product graph-based higher order contextual similarities for inexact subgraph matching

arXiv

引用

arXiv 2017年

作者： Dutta, Anjan Lladós, Josep Bunke, Horst Pal, Umapada Computer Vision Center Universitat Autònoma de Barcelona Edifici O Campus UAB Bellaterra Barcelona08193 Spain Institute of Computer Science University of Bern Neubrückstrasse 10 BernCH-3012 Switzerland Computer Vision and Pattern Recognition Unit Indian Statistical Institute 203 B.T.Road Kolkata-108 India

Many algorithms formulate graph matching as an optimization of an objective function of pairwise quantification of nodes and edges of two graphs to be matched. Pairwise measurements usually consider local attributes but disregard contextual information involved in graph structures. We address this issue by proposing contextual similarities between pairs of nodes. This is done by considering the tensor product graph (TPG) of two graphs to be matched, where each node is an ordered pair of nodes of the operand graphs. Contextual similarities between a pair of nodes are computed by accumulating weighted walks (normalized pairwise similarities) terminating at the corresponding paired node in TPG. Once the contextual similarities are obtained, we formulate subgraph matching as a node and edge selection problem in TPG. We use contextual similarities to construct an objective function and optimize it with a linear programming approach. Since random walk formulation through TPG takes into account higher order information, it is not a surprise that we obtain more reliable similarities and better discrimination among the nodes and edges. Experimental results shown on synthetic as well as real benchmarks illustrate that higher order contextual similarities add discriminating power and allow one to find approximate solutions to the subgraph matching problem. Copyright © 2017, The Authors. All rights reserved.

关键词： Graphic methods

来源：评论

学校读者我要写书评

暂无评论

A New Lightweight Attention-based Model for Emotion recognition on Distorted Social Media Face Images

TechRxiv

引用

TechRxiv 2023年

作者： Roy, Ayush Shivakumara, Palaiahnakote Pal, Umapada Gornale, Shivanand S. Liu, Cheng-Lin Jadavpur University India Computer Vision and Pattern Recognition Unit Indian Statistical Institute Kolkata India Faculty of Computer Science and Information Technology University of Malaya Kuala Lumpur Malaysia Department of Computer Science Rani Channamma University Belagavi India Institute of Automation Chinese Academy of Sciences China

The recognition of human emotions remains a challenging task for social media images. This is due to distortions created by different social media conflict with the minute changes in facial expression. This study presents a new model called the Global Spectral-Spatial Attention Network (GSSAN), which leverages both local and global information simultaneously. The proposed model comprises a shallow Convolutional Neural Network (CNN) with an MBResNext block, which integrates the features extracted from MobileNet, ResNet, and DenseNet for extracting local features. In addition, to strengthen the discriminating power of the features, GSSAN incorporates Fourier features, which provide essential cues for minute changes in the face images. To test the proposed model for emotion recognition using social media images, we conduct experiments on two widely-used datasets: FER-2013 and AffectNet. The same benchmark datasets are uploaded and downloaded to create a distorted social media image dataset to test the proposed model. Experiments on distorted social media images dataset show that the model surpasses the accuracy of SOTA models by 0.69% for FER-2013 and 0.51% for AffectNet social mediad datasets. The same inference can be drawn from the experiments on standard datasets. © 2023, CC BY.

关键词： Face recognition

来源：评论

学校读者我要写书评

暂无评论

DELP-DAR system for license plate detection and recognition

arXiv

引用

arXiv 2019年

作者： Selmi, Zied Halima, Mohamed Ben Pal, Umapada Alimi, M. Adel REGIM-Lab: Research Groups in Intelligent Machines University of Sfax ENIS BP 1173 Sfax3038 Tunisia Computer vision and Pattern Recognition Unit Indian Statistical Institute 203 B. T. Road olkata700108 India

Automatic License Plate detection and recognition (ALPR) is a quite popular and active research topic in the field of computer vision, image processing and intelligent transport systems. ALPR is used to make detection and recognition processes more robust and efficient in highly complicated environments and backgrounds. Several research investigations are still necessary due to some constraints such as: completeness of numbering systems of countries, different colors, various languages, multiple sizes and varied fonts. For this, we present in this paper an automatic framework for License Plate (LP) detection and recognition from complex scenes. Our framework is based on mask region convolutional neural networks used for LP detection, segmentation and recognition. Although some studies have focused on LP detection, LP recognition, LP segmentation or just two of them, our study uses the maskr-cnn in the three stages. The evaluation of our framework is enhanced by four datasets for different countries and consequently with various languages. In fact, it tested on four datasets including images captured from multiple scenes under numerous conditions such as varied orientation, poor quality images, blurred images and complex environmental backgrounds. Extensive experiments show the robustness and efficiency of our suggested framework in all datasets. Copyright © 2019, The Authors. All rights reserved.

关键词： License plates (automobile)

来源：评论

学校读者我要写书评

暂无评论

Chebyshev-Harmonic-Fourier-Moments and Deep CNNs for Detecting Forged Handwriting

Chebyshev-Harmonic-Fourier-Moments and Deep CNNs for Detecti...

引用

International Conference on pattern recognition

作者： Lokesh Nandanwar Palaiahnakote Shivakumara Sayani Kundu Umapada Pal Tong Lu Daniel Lopresti Faculty of Computer Science and Information Technology University of Malaya Kuala Lumpur Malaysia Computer Vision and Pattern Recognition Unit Indian Statistical Institute Kolkata India National Key Lab for Novel Software Technology Nanjing University Nanjing China Computer Science & Engineering Lehigh University Bethlehem PA USA

Recently developed sophisticated image processing techniques and tools have made easier the creation of high-quality forgeries of handwritten documents including financial and property records. To detect such forgeries of handwritten documents, this paper presents a new method by exploring the combination of Chebyshev-Harmonic-Fourier-Moments (CHFM) and deep Convolutional Neural Networks (D-CNNs). Unlike existing methods work based on abrupt changes due to distortion created by forgery operation, the proposed method works based on inconsistencies and irregular changes created by forgery operations. Inspired by the special properties of CHFM, such as its reconstruction ability by removing redundant information, the proposed method explores CHFM to obtain reconstructed images for the color components of the Original, Forged Noisy and Blurred classes. Motivated by the strong discriminative power of deep CNNs, for the reconstructed images of respective color components, the proposed method used deep CNNs for forged handwriting detection. Experimental results on our dataset and benchmark datasets (namely, ACPR 2019, ICPR 2018 FCD and IMEI datasets) show that the proposed method outperforms existing methods in terms of classification rate.

关键词： Image color analysis Chebyshev approximation Tools Benchmark testing Distortion Forgery pattern recognition

来源：评论

学校读者我要写书评

暂无评论

A new method based on bag of filters for character recognition in scene images by learning

A new method based on bag of filters for character recogniti...

引用

International Conference on Document Analysis and recognition

作者： Qisu Li Tong Lu Palaiahnakote Shivakumara Umapada Pal Chew Lim Tan National Key Lab for Novel Software Technology Nanjing University Nanjing China Faculty of Computer Science and Information Technology University of Malaya Kuala Lumpur Malaysia Computer Vision and Pattern Recognition Unit Indian Statistical Institute India School of Computing National University of Singapore Singapore

ISBN: (纸本)9781479918065

Achieving a good recognition rate for scene characters is a big challenge due to non-uniform illumination effects, perspective distortions, multiple colors or contrasts, different fonts and their various sizes, background or orientation variations, etc. Unlike the existing recognition methods that use binary information or the features extracted from different domains, the proposed method explores gray information in the form of a filter bank to extract the discriminative power for all the 62 scene character classes. We propose a sliding window (patch) operation over a character image for learning the global features, which represent the structures of character images of all the classes by reconstructing a filter bank from the original data. We introduce shareable constrains to activate class-specific filters from the filter bank. Further, we propose constraints by studying the nearest neighbor patches and exemplar selection to maximize the gap between inter-classes and minimize the gap between intra-classes. The method is evaluated and compared with several existing recognition methods in terms of character recognition rate. Experimental results show that the proposed method outperforms the existing methods.

关键词： Character recognition Image segmentation Filter banks Accuracy

来源：评论

学校读者我要写书评

暂无评论

Multiple Training - One Test Methodology for Handwritten Word-Script Identification

Multiple Training - One Test Methodology for Handwritten Wor...

引用

International Workshop on Frontiers in Handwriting recognition

作者： Miguel A. Ferrer Aythami Morales Nayara Rodríguez Umapada Pal Instituto Universitario para el Desarrollo Tecnológico y la Innovación en Comunicaciones Universidad de Las Palmas de Gran Canaria Spain Computer Vision and Pattern Recognition Unit Indian Statistical Institute Kolkata India

Script identification is an important area in handwriting document image analysis field. The script identification at word level on documents written in multiple scripts is an open challenge for the scientific community and a real concern in countries with multiple official languages, e. G. The country like India. Such documents usually contain two scripts: the most of the document are written in the regional script while some words, acronyms or numbers are written in Roman script. In this case a word or even a character level script identification is required to locate the second script characters in the document. Here the major problem is the few script descriptors available for the script estimation which convey high error rates. The literatures try to address this problem by looking for more efficient descriptors. In this paper we propose a Multiple Training - One Test technique to alleviate this problem. Several classifiers are trained, each one with words of similar amount of information. A scale invariable word information index is defined for this sake. To identify the script of a query word, its word information index is worked out, and its script is identified with the most appropriate classifier. Accuracy improvements has been obtained with this promising technique, especially for the shorten words.

关键词： Training Histograms Feature extraction Indexes Accuracy Testing

来源：评论

学校读者我要写书评

暂无评论

New Sharpness Features for Image Type Classification Based on Textual Information

New Sharpness Features for Image Type Classification Based o...

引用

IAPR International Workshop on Document Analysis Systems, DAS

作者： K. S. Raghunandan Palaiahnakote Shivakumara G. Hemantha Kumar Umapada Pal Tong Lu Department of Studies in Computer Science University of Mysore Karnataka India Faculty of Computer Science and Information Technology University of Malaya Kuala Lumpur Malaysia Computer Vision and Pattern Recognition Unit Indian Statistical Institute Kolkata India Narional Key Lab for Novel Software Technology Nanjing University Nanjing China

Achieving good recognition results from a single method for text lines in video/natural scene images captured by high resolution cameras or low resolution mobile cameras, and images in web pages, is often hard. In this paper, we propose new sharpness based features of textual portion of each input text line image using HSI color space for the classification of an input image into one of the four classes (video, scene, mobile or born digital). This helps in choosing an appropriate method based on the class type of the input text for its improved recognition rate. For a given input text line image, the proposed method obtains H, S and I images. Then Canny edge images are obtained for H, S and I spaces, which results in text candidates. We perform sliding window operation over the text candidate image of each text line of each color space to estimate new sharpness by calculating stroke width and gradient information. The sharpness values of the text lines of the three color spaces are then fed to k-means clustering with maximum, minimum and average guesses, which results in three respective clusters. The mean of each cluster for respective color spaces outputs a feature vector having nine feature values for image classification with the help of an SVM classifier. Experimental results on standard datasets, namely, ICDAR 2013, ICDAR 2015 video, ICDAR 2015 natural scene data, ICDAR 2013 born digital data and the images captured by a mobile camera (our own data) show that the proposed classification method helps in improving recognition results.

关键词： Mobile communication Text recognition Image edge detection Image resolution Image color analysis Digital images Optical character recognition software

来源：评论

学校读者我要写书评

暂无评论

Weighted-Gradient Features for Handwritten Line Segmentation

Weighted-Gradient Features for Handwritten Line Segmentation

引用

International Conference on pattern recognition

作者： Vijeta Khare Palaiahnakote Shivakumara B.J. Navya G.C. Swetha D. S. Guru Umapada Pal Tong Lu Faculty of Computer Science and Information Technology University of Malaya Kuala Lumpur Malaysia Department of Studies in Computer Science University of Mysore Karnataka India Computer Vision and Pattern Recognition Unit Indian Statistical Institute Kolkata India National Key Lab for Novel Software Technology Nanjing University Nanjing China

Text line segmentation from handwritten documents is challenging when a document image contains severe touching. In this paper, we propose a new idea based on Weighted-Gradient Features (WGF) for segmenting text lines. The proposed method finds the number of zero crossing points for every row of Canny edge image of the input one, which is considered as the weights of respective rows. The weights are then multiplied with gradient values of respective rows of the image to widen the gap between pixels in the middle portion of text and the other portions. Next, k-means clustering is performed on WGF to classify middle and other pixels of text. The method performs morphological operation to obtain word components as patches for the result of clustering. The patches in both the clusters are matched to find common patch areas, which helps in reducing touching effect. Then the proposed method checks linearity and non-linearity iteratively based on patch direction to segment text lines. The method is tested on our own and standard datasets, namely, Alaei, ICDAR 2013 robust competition on handwriting context and ICDAR 2015-HTR, to evaluate the performance. Further, the method is compared with the state of art methods to show its effectiveness and usefulness.

关键词： Image segmentation Image edge detection Linearity Writing Morphological operations Handwriting recognition Image restoration

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：