检索结果-内蒙古大学图书馆

International Conference on pattern recognition

作者： Halder, Biswajit Garain, Utpal Dept. of Information Technology Mallabhum Institute of Technology Bisnupur WB India Computer Vision and Pattern Recognition Unit Indian Statistical Institute 203 B.T. Road Kolkata 700108 India

ISBN: (纸本)9780769541099

Answering to a query like when a particular document was printed is quite helpful in practice especially forensic purposes. This study attempts to develop a general framework that makes use of image processing and pattern recognition principles for ink age determination in printed documents. The approach, at first, computationally extracts a set of suitable color features and then analyzes them to properly associate them with ink age. Finally, a neural net is designed and trained to determine ages of unknown samples. The dataset used for the present experiment consists of the cover pages of LIFE magazines published in between 1930's and 70's (five decades). Test results show that a viable framework for involving machines in assisting human experts for determining age of printed documents. © 2010 IEEE.

关键词： Image processing

来源：评论

学校读者我要写书评

暂无评论

Face recognition - A one-shot learning perspective 15

Face recognition - A one-shot learning perspective

引用

15th International Conference on Signal Image Technology and Internet Based Systems, SISITS 2019

作者： Chanda, Sukalpa Gv, Asish Chakrapani Brun, Anders Hast, Anders Pal, Umapada Doermann, David Department of Information Technology Østfold University College Norway Computer Vision and Pattern Recognition Unit Indian Statistical Institute India Centre for Image Analysis Uppsala University Sweden Computer Science and Engineering University at Buffalo United States

ISBN: (纸本)9781728156866

Ability to learn from a single instance is something unique to the human species and One-shot learning algorithms try to mimic this special capability. On the other hand, despite the fantastic performance of Deep Learning-based methods on various image classification problems, performance often depends having on a huge number of annotated training samples per class. This fact is certainly a hindrance in deploying deep neural network-based systems in many real-life applications like face recognition. Furthermore, an addition of a new class to the system will require the need to re-train the whole system from scratch. Nevertheless, the prowess of deep learned features could also not be ignored. This research aims to combine the best of deep learned features with a traditional One-Shot learning framework. Results obtained on 2 publicly available datasets are very encouraging achieving over 90% accuracy on 5-way One-Shot tasks, and 84% on 50-way One-Shot problems. © 2019 IEEE.

关键词： Face recognition

来源：评论

学校读者我要写书评

暂无评论

A New Transformer-Based Approach for Text Detection in Shaky and Non-shaky Day-Night Video 1

引用

7th Asian Conference on pattern recognition, ACPR 2023

作者： Halder, Arnab Shivakumara, Palaiahnakote Pal, Umapada Lu, Tong Blumenstein, Michael Computer Vision and Pattern Recognition Unit Indian Statistical Institute Kolkata India Faculty of Computer Science and Information Technology University of Malaya Kula Lumpur Malaysia Nanjing University Nanjing China University of Technology Sydney Sydney Australia

ISBN: (数字)9783031476372

ISBN: (纸本)9783031476365

Text detection in shaky and non-shaky videos is challenging because of variations caused by day and night videos. In addition, moving objects, vehicles, and humans in the video make the text detection problems more challenging in contrast to text detection in normal natural scene images. Motivated by the capacity of the transformer, we propose a new transformer-based approach for detecting text in both shaky and non-shaky day-night videos. To reduce the effect of object movement, poor quality, and other challenges mentioned above, the proposed work explores temporal frames for obtaining activation frames based on similarity and dissimilarity measures. For estimating similarity and dissimilarity, our method extracts luminance, contrast, and structural features. The activation frames are fed to the transformer which comprises an encoder, decoder, and feed-forward network for text detection in shaky and non-shaky day-night video. Since it is the first work, we create our own dataset for experimentation. To show the effectiveness of the proposed method, experiments are conducted on a standard dataset called the ICDAR-2015 video dataset. The results on our dataset and standard dataset show that the proposed model is superior to state-of-the-art methods in terms of recall, precision, and F-measure. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2023.

关键词： Chemical activation

来源：评论

学校读者我要写书评

暂无评论

Word-wise handwritten Persian and Roman script identification

Word-wise handwritten Persian and Roman script identificatio...

引用

International Conference on Frontiers in Handwriting recognition

作者： Roy, Kaushik Alaei, Alireza Pal, Umapada Department of Computer Science West Bengal State University Kolkata-126 India Department of Studies in Computer Science University of Mysore Mysore 570 006 India Computer Vision and Pattern Recognition Unit Indian Statistical Institute Kolkata-108 India

ISBN: (纸本)9780769542218

Most of the countries use bi-script documents. This is because every country uses its own national language and English as second/foreign language. Therefore, bi-lingual document with one language being the English and other being the national language is very common. Postal documents are a very good example of such bi-lingual/script document. This paper deals with word-wise handwritten script identification from bi-script documents written in Persian and Roman. In the proposed scheme, simple but fast computable set of 12 features based on fractal dimension, position of small component, topology etc. are used and a set of classifiers are employed for script identification experiments. We tested our scheme on a dataset of 5000 handwritten Persian and English words and 99.20% of correct script identification is obtained. © 2010 IEEE.

关键词： Fractal dimension

来源：评论

学校读者我要写书评

暂无评论

A Global-to-Local Approach to Binarization of Degraded Document Images

A Global-to-Local Approach to Binarization of Degraded Docum...

引用

International Conference on pattern recognition

作者： Barun Biswas Ujjwal Bhattacharya Bidyut B. Chaudhuri Computer Vision and Pattern Recognition Unit Indian Statistical Institute Kolkata 108 India

ISBN: (纸本)9781479952106

This article deals with binarization of degraded document images. In the proposed approach, Canny edge image of the input degraded document image is obtained after blurring it with a Gaussian filter. Next, the gray values of the two pixels of the input image at the left and right of each edge pixel are noted to form a histogram of these gray values which possesses two distinct peaks and the lowest valley between them provides the global threshold value. Each pixel with gray value greater than the above threshold is turned as background pixel. A small square window is considered around each non-background pixel and certain simple statistics are computed on the gray values of the pixels of this small window based on which the said pixel is turned either background or foreground. Such a local thresholding method at the latter stage can efficiently handle various degradations in the document. The binarized image so obtained is finally subjected to certain common post-processing operations. The proposed method has been compared with a few existing binarization techniques.

关键词： Image edge detection Histograms Degradation Smoothing methods PSNR Image color analysis

来源：评论

学校读者我要写书评

暂无评论

A cascaded genetic algorithm for efficient optimization and pattern matching 2nd

引用

2nd International Conference on Advances in pattern recognition, ICAPR 2001

作者： Garai, Gautam Computer Division Saha Institute of Nuclear Physics 1/AF Bidhannagar Calcutta700064 India Computer Vision and Pattern Recognition Unit Indian Statistical Institute 203 B. T. Road Calcutta700035 India

ISBN: (纸本)3540417672

A modified Genetic Algorithm (GA) based search strategy is presented here that is computationally more efficient than the conventional GA. Here the idea is to start a GA with the chromosomes of small length. Such chromosomes represent possible solutions with coarse resolution. A finite space around the position of solution in the first stage is subject to the GA at the second stage. Since this space is much smaller than the original search space, chromosomes of same length now represent finer resolution. In this way, the search progresses from coarse to fine solution in a cascaded manner. Since chromosomes of small size are used at each stage, the overall approach becomes computationally more efficient than a single stage algorithm with the same degree of final resolution. Also, since at the lower stage we work on low resolution, the algorithm can avoid local spurious extrema. The effectiveness of the proposed GA has been demonstrated for the optimization of some synthetic functions and on pattern recognition problems namely dot pattern matching and object matching with edge map. © Springer-Verlag Berlin Heidelberg 2001.

关键词： Genetic algorithms

来源：评论

学校读者我要写书评

暂无评论

Symmetry features for license plate classification

引用

CAAI Transactions on Intelligence Technology 2018年第3期3卷 176-183页

作者： Karpuravalli Srinivas Raghunandan Palaiahnakote Shivakumara Lolika Padmanabhan Govindaraju Hemantha Kumar Tong Lu Umapada Pal Department of Studies in Computer Science University of Mysore Karnataka India Faculty of Computer Science and Information Technology University of Malaya Kuala Lumpur Malaysia PES Institute of Technology Bangalore Karnataka India National Key Lab for Novel Software Technology Nanjing University Nanjing People's Republic of China Computer Vision and Pattern Recognition Unit Indian Statistical Institute Kolkata India

Achieving high recognition rate for license plate images is challenging due to multi-type images. We present new symmetry features based on stroke width for classifying each input license image as private, taxi, cursive text, when they expand the symbols by writing and non-text such that an appropriate optical character recognition （OCR） can be chosen for enhancing recognition performance. The proposed method explores gradient vector flow （GVF） for defining symmetry features, namely, GVF opposite direction, stroke width distance, and stroke pixel direction. Stroke pixels in Canny and Sobel which satisfy the above symmetry features are called local candidate stroke pixels. Common stroke pixels of the local candidate stroke pixels are considered as the global candidate stroke pixels. Spatial distribution of stroke pixels in local and global symmetry are explored by generating a weighted proximity matrix to extract statistical features, namely, mean, standard deviation, median and standard deviation with respect the median. The feature matrix is finally fed to an support vector machine （SVM） classifier for classification. Experimental results on large datasets for classification show that the proposed method outperforms the existing methods. The usefulness and effectiveness of the proposed classification is demonstrated by conducting recognition experiments before and after classification.

关键词：车牌图像图像识别识别技术计算机技术

来源：评论

学校读者我要写书评

暂无评论

ARNET: ACTIVE-REFERENCE NETWORK FOR FEW-SHOT IMAGE SEMANTIC SEGMENTATION

ARNET: ACTIVE-REFERENCE NETWORK FOR FEW-SHOT IMAGE SEMANTIC ...

引用

2021 IEEE International Conference on Multimedia and Expo, ICME 2021

作者： Shi, Guangchen Wu, Yirui Palaiahnakote, Shivakumara Pal, Umapada Lu, Tong College of Computer and Information Hohai University China Department of Computer System and Information Technology University of Malaya Malaysia Computer Vision and Pattern Recognition Unit Indian Statistical Institute India National Key Lab for Novel Software Technology Nanjing University China

ISBN: (纸本)9781665438643

To make predictions on unseen classes, few-shot segmentation becomes a research focus recently. However, most methods build on pixel-level annotation requiring quantity of manual work. Moreover, inherent information on same-category objects to guide segmentation could have large diversity in feature representation due to differences in size, appearance, layout, and so on. To tackle these problems, we present an active-reference network (ARNet) for few-shot segmentation. The proposed active-reference mechanism not only supports accurately co-occurrent objects in either support or query images, but also relaxes high constraint on pixel-level labeling, allowing for weakly boundary labeling. To extract more intrinsic feature representation, a category-modulation module (CMM) is further applied to fuse features extracted from multiple support images, thus forgetting useless and enhancing contributive information. Experiments on PASCAL-5i dataset show the proposed method achieves a m-IOU score of 56.5% for 1-shot and 59.8% for 5-shot segmentation, being 0.5% and 1.3% higher than current state-of-the-art method. © 2021 IEEE

关键词： Semantic Segmentation

来源：评论

学校读者我要写书评

暂无评论

CNN based common approach to handwritten character recognition of multiple scripts

CNN based common approach to handwritten character recogniti...

引用

International Conference on Document Analysis and recognition

作者： Durjoy Sen Maitra Ujjwal Bhattacharya Swapan K. Parui Computer Vision & Pattern Recognition Unit Indian Statistical Institute Kolkata-108 India

ISBN: (纸本)9781479918065

There are many scripts in the world, several of which are used by hundreds of millions of people. Handwritten character recognition studies of several of these scripts are found in the literature. Different hand-crafted feature sets have been used in these recognition studies. However, convolutional neural network (CNN) has recently been used as an efficient unsupervised feature vector extractor. Although such a network can be used as a unified framework for both feature extraction and classification, it is more efficient as a feature extractor than as a classifier. In the present study, we performed certain amount of training of a 5-layer CNN for a moderately large class character recognition problem. We used this CNN trained for a larger class recognition problem towards feature extraction of samples of several smaller class recognition problems. In each case, a distinct Support Vector Machine (SVM) was used as the corresponding classifier. In particular, the CNN of the present study is trained using samples of a standard 50-class Bangla basic character database and features have been extracted for 5 different 10-class numeral recognition problems of English, Devanagari, Bangla, Telugu and Oriya each of which is an official Indian script. recognition accuracies are comparable with the state-of-the-art.

关键词： Handwriting recognition Databases Accuracy Training

来源：评论

学校读者我要写书评

暂无评论

New texture-spatial features for keyword spotting in video images 3

New texture-spatial features for keyword spotting in video i...

引用

3rd IAPR Asian Conference on pattern recognition, ACPR 2015

作者： Shivakumara, Palaiahnakote Liang, Guozhu Roy, Sangheeta Pal, Umapada Lu, Tong Faculty of Computer Science and Information Technology University of Malaya Kuala Lumpur Malaysia National Key Lab for Novel Software Technology Nanjing University Nanjing China Computer Vision and Pattern Recognition Unit Indian Statistical Institute Kolkata India

ISBN: (纸本)9781479961009

Keyword spotting in video document images is challenging due to low resolution and complex background of video images. We propose the combination of Texture-Spatial-Features (TSF) for keyword spotting in video images without recognizing them. First, a segmentation method extracts words from text lines in each video image. Then we propose the set of texture features for identifying text candidates in the word image with the help of k-means clustering. The proposed method finds proximity between text candidates to study the spatial arrangement of pixels that result in feature vectors for spotting words in the input frame. The proposed method is evaluated on word images of different fonts, contrasts, backgrounds and font sizes, which are chosen from standard databases such as ICDAR 2013 video and our video data. Experimental results show that the proposed method outperforms the existing method in terms of recall, precision and f-measure. © 2015 IEEE.

关键词： Image segmentation

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：