检索结果-内蒙古大学图书馆

International Conference on Document Analysis and recognition

作者： Utpal Garain T. Paquet L. Heutte Computer Vision & Pattern Recognition Unit Indian Statistical Institute Kolkata India Laboratoire PSI-FRE CNRS UFR des Sciences University of Rouen Mont Saint Aignan France

This paper proposes an adaptive method for separation of foreground and background in low quality color document images. A connected component labelling is initially implemented to capture the spatially connected similar color pixels. Next, dominant background components are determined to divide the entire image into number of grids each representing local uniformity in illumination, background, etc. Finally foreground parts are located using local information around them. Several color images of old historical documents including manuscripts of high importance are used in the experiment. Apart from a qualitative evaluation, results are quantitatively compared with one popular foreground/background separation technique.

关键词： Clustering algorithms Lighting Labeling Smoothing methods Image color analysis computer vision pattern recognition Image segmentation Internet Layout

来源：评论

学校读者我要写书评

暂无评论

Indian Multi-Script Full Pin-code String recognition for Postal Automation

Indian Multi-Script Full Pin-code String Recognition for Pos...

引用

International Conference on Document Analysis and recognition

作者： Umapada Pal Rami Kumar Roy Kaushik Roy Fumitaka Kimura Computer Vision and Pattern Recognition Unit Indian Statistical Institute Kolkata India Department of Computer Science West Bengal State University Barasat India Graduate School of Engineering TSU Mie University Japan

ISBN: (纸本)9781424445004

Under three-language formula, the destination address block of postal document of an Indian state is generally written in three languages: English, Hindi and the State official language. Because of inter-mixing of these scripts in postal address writings, it is very difficult to identify the script by which a pin-code is written. Also, because of the writing style of different individuals some of the digits in a pin-code string may touch with its neighboring digits. Accurate segmentation of such touching components into individual digits is a difficult task. To avoid such difficulties, in this paper we proposed a tri-lingual (English, Hindi and Bangla) 6-digit full pin-code string recognition. We obtained 99.01% reliability from our proposed system when error and rejection rates are 0.83% and 15.27%, respectively.

关键词： Automation Natural languages pattern recognition Writing Statistics Optical character recognition software Text analysis pattern analysis computer vision Databases

来源：评论

学校读者我要写书评

暂无评论

Lexicon Reduction Technique for Bangla Handwritten Word recognition

Lexicon Reduction Technique for Bangla Handwritten Word Reco...

引用

IAPR International Workshop on Document Analysis Systems, DAS

作者： Tapan Kumar Bhowmik Utpal Roy Swapan K. Parui Faculty of Mathematics and Natural Sciences University of Groningam Netherlands Department of Computer and System Sciences Visva Bharati University Santiniketan India Computer Vision and Pattern Recognition Unit Indian Statistical Institute Kolkata India

In this paper we introduce a stroke based lexicon reduction technique in order to reduce the search space for recognition of handwritten words. The principle of this technique involves mainly two aspects of a word image to constitute a feature vector: one is word-length and the other is shape of the word. The length of the word image is represented by the number of specific vertical strokes present in the word image and, on the other hand, the shape of a word image is realized with the combination of both horizontal and vertical strokes. The experiment has been carried out with a database of 35,700 off-line handwritten Bangla word images. Though our proposed lexicon reduction technique is developed for recognition of Bangla handwritten words, its generalization property can easily be exploited for recognition of handwriting in other scripts also.

关键词： Handwriting recognition Hidden Markov models Shape Vectors Feature extraction Conferences

来源：评论

学校读者我要写书评

暂无评论

Structure Function Based Transform Features for Behavior-Oriented Social Media Image Classification 5th

Structure Function Based Transform Features for Behavior-Ori...

引用

5th Asian Conference on pattern recognition, ACPR 2019

作者： Krishnani, Divya Shivakumara, Palaiahnakote Lu, Tong Pal, Umapada Ramachandra, Raghavendra International Institute of Information Technology Naya Raipur Naya RaipurChhattisgarh India Faculty of Computer Science and Information Technology University of Malaya Kuala Lumpur Malaysia National Key Lab for Novel Software Technology Nanjing University Nanjing China Computer Vision and Pattern Recognition Unit Indian Statistical Institute Kolkata India Faculty of Information Technology and Electrical Engineering Norwegian University of Science and Technology Trondheim Norway

ISBN: (纸本)9783030414030

Social media has become an essential part of people to reflect their day to day activities including emotions, feelings, threatening and so on. This paper presents a new method for the automatic classification of behavior-oriented images like Bullying, Threatening, Neuroticism-Depression, Neuroticism-Sarcastic, Psychopath and Extraversion of a person from social media images. The proposed method first finds facial key points for extracting features based on a face detection algorithm. Then the proposed method labels face regions as foreground and other than face region as background to define context between foreground and background information. To extract context, the proposed method explores Structural Function based Transform (SFBT) features, which study variations on pixel values. To increase discriminating power of the context features, the proposed method performs clustering to integrate the strength of the features. The extracted features are then fed to Support Vector Machines (SVM) for classification. Experimental results on a dataset of six classes show that the proposed method outperforms the existing methods in terms of confusion matrix and classification rate. © Springer Nature Switzerland AG 2020.

关键词： Classification (of information)

来源：评论

学校读者我要写书评

暂无评论

Reconnoitering the class distinguishing abilities of the features, to know them better

arXiv

引用

arXiv 2022年

作者： Sadhukhan, Payel Palit, Sarbani Sengupta, Kausik Institute for Advancing Intelligence Tcg Crest West Bengal Kolkata700091 India Computer Vision and Pattern Recognition Unit Indian Statistical Institute West Bengal Kolkata700108 India

The relevance of machine learning (ML) in our daily lives is closely intertwined with its explainability. Explainability can allow end-users to have a transparent and humane reckoning of a ML scheme's capability and utility. It will also foster the user's confidence in the automated decisions of a system. Explaining the variables or features to explain a model's decision is a need of the present times. We could not really find any work, which explains the features on the basis of their class-distinguishing abilities (specially when the real world data are mostly of multi-class nature). In any given dataset, a feature is not equally good at making distinctions between the different possible categorizations (or classes) of the data points. In this work, we explain the features on the basis of their class or category-distinguishing capabilities. We particularly estimate the class-distinguishing capabilities (scores) of the variables for pair-wise class combinations. We validate the explainability given by our scheme empirically on several realworld, multi-class datasets. We further utilize the class-distinguishing scores in a latent feature context and propose a novel decision making protocol. Another novelty of this work lies with a refuse to render decision option when the latent variable (of the test point) has a high class-distinguishing potential for the likely classes. © 2022, CC BY.

关键词： Decision making

来源：评论

学校读者我要写书评

暂无评论

ICFHR 2020 Competition on Short answer ASsessment and Thai student SIGnature and Name COMponents recognition and Verification (SASIGCOM 2020)

ICFHR 2020 Competition on Short answer ASsessment and Thai s...

引用

International Workshop on Frontiers in Handwriting recognition

作者： Abhijit Das Hemmaphan Suwanwiwat Umapada Pal Michael Blumenstein Computer Vision and Pattern Recognition Unit Indian Statistical Institute Kolkata India Information Technology Academy James Cook University Cairns Australia School of Software University of Technology Sydney Australia

ISBN: (数字)9781728199665

ISBN: (纸本)9781728199672

This paper describes the results of the competition on Short answer ASsessment and Thai student SIGnature and Name COMponents recognition and Verification (SASIGCOM 2020) in conjunction with the 17th International Conference on Frontiers in Handwriting recognition (ICFHR 2020). The competition was aimed to automate the evaluation process short answer-based examination and record the development and gain attention to such system. The proposed competition contains three elements which are short answer assessment (recognition and marking the answers to short-answer questions derived from examination papers), student name components (first and last names) and signature verification and recognition. Signatures and name components data were collected from 100 volunteers. For the Thai signature dataset, there are 30 genuine signatures, 12 skilled and 12 simple forgeries for each writer. With Thai name components dataset, there are 30 genuine and 12 skilfully forged name components for each writer. There are 104 exam papers in the short answer assessment dataset, 52 of which were written with cursive handwriting; the rest of 52 papers were written with printed handwriting. The exam papers contain ten questions, and the answers to the questions were designed to be a few words per question. Three teams from distinguished labs submitted their systems. For short answer assessment, word spotting task was also performed. This paper analysed the results produced by their algorithms using a performance measure and defines a way forward for this subject of research. Both the datasets, along with some of the accompanying ground truth/baseline mask will be made freely available for research purposes via the TC10/TC11.

关键词： Task analysis Handwriting recognition Training Particle measurements Atmospheric measurements Information technology Writing

来源：评论

学校读者我要写书评

暂无评论

Effective Document Image Enhancement Using tokens-to-token Transformer Network

SSRN

引用

SSRN 2023年

作者： Biswas, Risab Roy, Swalpa Kumar Pal, Umapada Maharashtra Mumbai400066 India Department of Computer Science and Engineering Jalpaiguri Government Engineering College West Bengal735102 India Computer Vision and Pattern Recognition Unit Indian Statistical Institute Kolkata700108 India

Document image enhancement is a fundamental and important stage for attaining the best performance in any document analysis assignment because there are many degradation situations that could harm document images, making it more difficult to recognize and analyze them. In this paper, we propose to employ a Tokens-to-Token Transformer net- work for document image enhancement, a novel encoder-decoder architecture based on a tokens-to-token vision transformer. The proposed architecture uses a tokens-to-token architecture in the encoder section. Each image is divided into a set of tokens with a de- fined length using the ViT model, which is then applied several times to model the global relationship between the tokens. However, the conventional tokenization of input data does not adequately reflect the crucial local structure between adjacent pixels of the input image, which results in low efficiency. Instead of using a simple ViT and hard splitting of images for the document image enhancement task, we employed a progressive tokeniza- tion technique to capture this local information from an image for achieving more ef- fective results. Experiments on various DIBCO and H-DIBCO benchmarks demonstrate that the proposed model outperforms the existing CNN and ViT-based state-of-the-art methods. In this research, the primary area of examination is the application of the pro- posed architecture to the task of document binarization. The source code will be made available at https://***/RisabBiswas/T2T-BinFormer. © 2023, The Authors. All rights reserved.

关键词： Image enhancement

来源：评论

学校读者我要写书评

暂无评论

Design of Unsupervised Feature Extraction System for On-line Bangla Handwriting recognition

Design of Unsupervised Feature Extraction System for On-line...

引用

IAPR International Workshop on Document Analysis Systems, DAS

作者： Volkmar Frinken Nilanjana Bhattacharya Umapada Pal Faculty of Information Science and Electrical Engineering Kyushu University Fukuoka-shi Japan Bose Institute Kolkatta India Computer Vision and Pattern Recognition Unit Indian Statistical Institute Kolkatta India

Different systems for handwriting recognition use different features to represent the input text. Even after decades of research, no favorable decision on a best-practice exists and many features are carefully hand-crafted. To facilitate the design phase for on-line handwriting systems, in this paper, we propose an unsupervised feature generation approach based on dissimilarity space embedding (DSE) of local neighborhoods around the points along the trajectory. DSE has high capability of discriminative representation and hence beneficial for classification. We compare the approach with a state-of-the-art feature extraction method and demonstrate its superiority.

关键词： Feature extraction Prototypes Handwriting recognition Writing Hidden Markov models Character recognition Text recognition

来源：评论

学校读者我要写书评

暂无评论

Recognizing Bengali Word Images - A Zero-Shot Learning Perspective

Recognizing Bengali Word Images - A Zero-Shot Learning Persp...

引用

International Conference on pattern recognition

作者： Sukalpa Chanda Daniël Haitink Prashant Kumar Prasad Jochem Baas Umapada Pal Lambert Schomaker Østfold University College Norway Faculty of Science and Engineering. Bernoulli Institute for Mathematics Computer Science and Artificial Intelligence University of Groningen The Netherlands Computer Vision and Pattern Recognition Unit Indian Statistical Institute India

Zero-Shot Learning(ZSL) techniques could classify a completely unseen class, which it has never seen before during training. Thus, making it more apt for any real-life classification problem, where it is not possible to train a system with annotated data for all possible class types. This work investigates recognition of word images written in Bengali Script in a ZSL framework. The proposed approach performs Zero-Shot word recognition by coupling deep learned features procured from various CNN architectures along with 13 basic shapes/stroke primitives commonly observed in Bengali script characters. As per the notion of ZSL framework those 13 basic shapes are termed as “Signature/Semantic Attributes”. The obtained results are promising while evaluation was carried out in a Five-Fold cross-validation setup dealing with samples from 250 word classes.

关键词： Training Couplings Image recognition Shape Character recognition

来源：评论

学校读者我要写书评

暂无评论

A Kalman filtering induced heuristic optimization based partitional data clustering

arXiv

引用

arXiv 2019年

作者： Pakrashi, Arjun Chaudhuri, Bidyut B. Insight Centre for Data Analytics University College Dublin Ireland Computer Vision & Pattern Recognition Unit Indian Statistical Institute 203 B.T. Road Kolkata700108 India

Clustering algorithms have regained momentum with recent popularity of data mining and knowledge discovery approaches. To obtain good clustering in reasonable amount of time, various meta-heuristic approaches and their hybridization, sometimes with K-Means technique, have been employed. A Kalman Filtering based heuristic approach called Heuristic Kalman Algorithm (HKA) has been proposed a few years ago, which may be used for optimizing an objective function in data/feature space. In this paper at first HKA is employed in partitional data clustering. Then an improved approach named HKA-K is proposed, which combines the benefits of global exploration of HKA and the fast convergence of K-Means method. Implemented and tested on several datasets from UCI machine learning repository, the results obtained by HKA-K were compared with other hybrid meta-heuristic clustering approaches. It is shown that HKA-K is atleast as good as and often better than the other compared algorithms. Copyright © 2019, The Authors. All rights reserved.

关键词： Kalman filters

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：