检索结果-内蒙古大学图书馆

International Workshop on Frontiers in Handwriting recognition

作者： Sangheeta Roy Palaiahnakote Shivakumara Umapada Pal Tong Lu Chew Lim Tan Faculty of Computer Science and Information Technology University of Malaya Kuala Lumpur Malaysia Computer Vision and Pattern Recognition Unit Indian Statistical Institute Kolkata India National Key Lab for Novel Software Technology Nanjing University Nanjing China Department of Computer Science National University of Singapore

ISBN: (纸本)9781509009824

The presence of both caption/graphics/superimposed and scene texts in video frames is the major cause for the poor accuracy of text recognition methods. This paper proposes an approach for identifying tampered information by analyzing the spatial distribution of DCT coefficients in a new way for classifying caption and scene text. Since caption text is edited/superimposed, which results in artificially created texts comparing to scene texts that exist naturally in frames. We exploit this fact to identify the presence of caption and scene texts in video frames based on the advantage of DCT coefficients. The proposed method analyzes the distributions of both zero and non-zero coefficients (only positive values) locally by moving a window, and studies histogram operations over each input text line image. This generates line graphs for respective zero and non-zero coefficient coordinates. We further study the behavior of text lines, namely, linearity and smoothness based on centroid location analysis, and the principal axis direction of each text line for classification. Experimental results on standard datasets, namely, ICDAR 2013 video, 2015 video, YVT video and our own data, show that the performances of text recognition methods are improved significantly after-classification compared to before-classification.

关键词： Discrete cosine transforms Text recognition Image color analysis Linearity Bars Character recognition Mathematical model

来源：评论

学校读者我要写书评

暂无评论

Using electromagnetic input for multi-user or two-handed spatial gestural interaction based on the digital compass 15

Using electromagnetic input for multi-user or two-handed spa...

引用

17th International Conference on Human-computer Interaction with Mobile Devices and Services, MobileHCI 2015

作者： Yuksel, Kamer Ali Baz, Ipek Ozduman, Haluk Computer Vision and Pattern Recognition Laboratory Sabanci University Turkey Research Center for ICT German Turkish Advanced Technical University of Berlin Germany

ISBN: (纸本)9781450336529

Multiple researchers recently proposed the use of the digital compass embedded in mobile devices for touchless interaction in the 3D space around them. These methods overcome several limits imposed by other interaction techniques and were evaluated for a variety of uses. However, they do not support collaborative settings and are prone to dynamic noise caused by external conditions, as with most other sensor-based interaction techniques. In this paper, we propose the use of frequency-modulated electromagnets as an input medium for magnetic interaction to overcome its various constraints and further enable multi-user and two-handed input. Furthermore, we demonstrated the hardware design specifications of a novel input device, referred to as electromagnetic stylus, which is prototyped to conduct a user-study on the proposed method. Experimental results indicate that gestures performed simultaneously by four electromagnetic styli can accurately be recognized using a single magnetic field sensor, and dynamic noises can be substantially reduced. © 2015 ACM.

关键词： Timing circuits

来源：评论

学校读者我要写书评

暂无评论

New Sharpness Features for Image Type Classification Based on Textual Information

New Sharpness Features for Image Type Classification Based o...

引用

IAPR International Workshop on Document Analysis Systems, DAS

作者： K. S. Raghunandan Palaiahnakote Shivakumara G. Hemantha Kumar Umapada Pal Tong Lu Department of Studies in Computer Science University of Mysore Karnataka India Faculty of Computer Science and Information Technology University of Malaya Kuala Lumpur Malaysia Computer Vision and Pattern Recognition Unit Indian Statistical Institute Kolkata India Narional Key Lab for Novel Software Technology Nanjing University Nanjing China

Achieving good recognition results from a single method for text lines in video/natural scene images captured by high resolution cameras or low resolution mobile cameras, and images in web pages, is often hard. In this paper, we propose new sharpness based features of textual portion of each input text line image using HSI color space for the classification of an input image into one of the four classes (video, scene, mobile or born digital). This helps in choosing an appropriate method based on the class type of the input text for its improved recognition rate. For a given input text line image, the proposed method obtains H, S and I images. Then Canny edge images are obtained for H, S and I spaces, which results in text candidates. We perform sliding window operation over the text candidate image of each text line of each color space to estimate new sharpness by calculating stroke width and gradient information. The sharpness values of the text lines of the three color spaces are then fed to k-means clustering with maximum, minimum and average guesses, which results in three respective clusters. The mean of each cluster for respective color spaces outputs a feature vector having nine feature values for image classification with the help of an SVM classifier. Experimental results on standard datasets, namely, ICDAR 2013, ICDAR 2015 video, ICDAR 2015 natural scene data, ICDAR 2013 born digital data and the images captured by a mobile camera (our own data) show that the proposed classification method helps in improving recognition results.

关键词： Mobile communication Text recognition Image edge detection Image resolution Image color analysis Digital images Optical character recognition software

来源：评论

学校读者我要写书评

暂无评论

Fourier Coefficients for Fraud Handwritten Document Classification through Age Analysis

Fourier Coefficients for Fraud Handwritten Document Classifi...

引用

International Workshop on Frontiers in Handwriting recognition

作者： K.S. Raghunandan Palaiahnakote Shivakumara B.J. Navya G. Pooja Navya Prakash G. Hemantha Kumar Umapada Pal Tong Lu Department of Studies in Computer Science University of Mysore Karnataka India Faculty of Computer Science and Information Technology University of Malaya Kuala Lumpur Malaysia Computer Vision and Pattern Recognition Unit Indian Statistical Institute Kolkata India National Key Lab for Novel Software Technology Nanjing University Nanjing China

ISBN: (纸本)9781509009824

As new digital technologies emerge to improve living style, at the same time, it also lead to increase crimes. Unlike existing approaches that use content of handwriting for fraud/forged document identification, in this paper we propose a novel approach that explores the quality of handwritten documents by considering both foreground and background information to identify whether it is old or new. The proposed approach works based on the fact that if a fraud document is created with some gaps after the original one, the fraud document happened to be a new one and the original happened to be an old one in this work. To identify whether a given handwritten document is old or new with gaps, we propose to divide Fourier coefficients of the input image into positive and negative coefficient images, and then reconstruct respective images to conquer two reconstructed ones. The contrast of the reconstructed images obtained before and after divide-conquer is studied to analyze the ages of the document based on image quality. The proposed approach finds a unique relationship between reconstructed images, obtained before and after divide-conquer, to identify the input image as old or new. To evaluate the proposed approach, we conduct experiments on our own handwritten dataset and a standard database, namely, Google-LIFE magazine. Comparative studies with the existing approaches show that the proposed approach outperforms the existing approaches in terms of classification rate.

关键词： Image reconstruction Image edge detection Ink Printers Forgery Distortion Feature extraction

来源：评论

学校读者我要写书评

暂无评论

Evaluation of feature sensitivity to training data inaccuracy in detection of retinal lesions

Evaluation of feature sensitivity to training data inaccurac...

引用

Workshops on Image Processing Theory, Tools and Applications, IPTA

作者： Lauri Laaksonen Antti Hannuksela Ela Claridge Pauli Fält Markku Hauta-Kasari Hannu Uusitalo Lasse Lensu Machine Vision and Pattern Recognition Laboratory Lappeenranta university of Technology Lappeenranta Finland School of Computer Science The University of Birmingham United Kingdom School of Computing University of Eastern Finland Finland Department of Ophthalmology University of Tampere Finland TAUH Eye Center Tampere University Hospital Finland

computer aided diagnostic and segmentation tools have become increasingly important in reducing the workload of medical experts performing diagnosis, monitoring and documentation of various eye diseases such as age-related macular degeneration (AMD), diabetic retinopathy (DR) and glaucoma. Supervised methods have been developed for the segmentation and detection of lesions, and the reported performance has been good. The supervised methods, however, need representative data to properly train the classifier. Inaccuracies in the ground truth may have a significant impact on the performance of a supervised method as the training data are not representative. In this study, a quantitative evaluation of the sensitivity of different image features, including colour, texture, edge and higher-level features, to inaccuracy in the ground truth on exudates is presented. A mean decrease of approx. 20% in sensitivity and 13% in specificity was observed when using the most inaccurate training data.

关键词： Training data Sensitivity Retina Image color analysis Image segmentation Feature extraction Lesions

来源：评论

学校读者我要写书评

暂无评论

Boosted local classifiers for visual tracking

Boosted local classifiers for visual tracking

引用

IEEE International Conference on Multimedia and Expo (ICME)

作者： Weijian Ruan Jun Chen Jinqiao Wang Bo Luo Wenjun Huang Ruimin Hu National Engineering Research Center for Multimedia Software Computer School of Wuhan Univ. China The Key Laboratory of Multimedia and Network Communication Engineering Wuhan University China National Laboratory of Pattern Recognition Chinese Academy of Sciences China Collaborative Innovation Center of Geospatial Technology China

ISBN: (纸本)9781467372596

Most existing discriminative tracking methods model a target object as a whole and train a tracker based on holistic templates, which cannot effectively deal with partial occlusions. Instead, in this paper, by treating the target as a collection of local patches, we propose a novel tracking approach based on boosted local classifiers. Initially, a set of local patches are sampled to train a set of local classifiers, and the weight of each classifier is given based on the estimated error. In addition, the positive examples and negative examples are sampled for model update with two constraints during the tracking process, which helps obtain more negatives for updating the appearance model and improve the updating efficiency. With updating the weights of local classifiers based on the temporal stability, the tracker can effectively handle partial occlusions. Extensive experiments on various challenging image sequences demonstrate the superiority to several state-of-the-art methods.

关键词： Target tracking Training Testing Stability analysis Object tracking Visualization Multimedia communication

来源：评论

学校读者我要写书评

暂无评论

LSSLP – Local structure sensitive label propagation

引用

Information Sciences 2016年 332卷 19-32页

作者： Zhenfeng Zhu Jian Cheng Yao Zhao Jieping Ye Institute of Information Science Beijing Jiaotong University Beijing 100044 China Beijing Key Laboratory of Advanced Information Science and Network Technology Beijing 100044 China National Laboratory of Pattern Recognition Institute of Automation Chinese Academy of Sciences (CAS) 100190 China Department of Electrical Engineering and Computer Science University of Michigan Ann Arbor MI 48109-2218 USA

Label propagation is an approach to iteratively spread the prior state of label confidence associated with each of samples to its neighbors until achieving a global convergence state. Such process has been shown to hold close connection with a general graph-based regularization framework. Within this framework, a closed- form linear system can be built to carry out label propagation. In this paper, to address several issues inherent with previous graph-based label propagation framework, we propose a reformulated one, i.e., local structure sensitive label propagation ( LSSLP ). By associating each graph vertex with a local structure sensitive tuning factor, the empirical loss error on each vertex can be controlled preferably to keep consistent with the commonly preconditioned ‘ cluster assumption ’ of data structure. Out of consideration for information conservation, we relax the state conservation constraint of label confidence from labeled samples proposed by Belkin et al. (2004) to a more general form. Meanwhile, an inverse-warping procedure is incorporated into the proposed local structure sensitive label propagation framework to maintain large and stable enough classification margin. Based on the felicitous inversion technique for blocked matrix, we extend LSSLP to its incremental and inductive versions and also present computationally efficient implementation of it. Experimental results demonstrate the performance of the reformulated regularization framework for label propagation is much competitive.

关键词： Graph model Label propagation Machine learning pattern classification Semi-supervised learning

来源：评论

学校读者我要写书评

暂无评论

A glass image classification method based on multi-feature fusion

A glass image classification method based on multi-feature f...

引用

International Conference on Wavelet Analysis and pattern recognition (ICWAPR)

作者： Liang Zhang Jing Wen Sheng-Zhou Xu Hao-Yang Xing Yu Zhu Heng-Xin Chen College of Computer Science Chongqing University Chongqing China Key Laboratory of Pattern Recognition and Intelligent Chengdu University China School of Computer Science South-central University For Nationalities WuHan China Magnetic Resonance Imaging Research Centre Huaxi Hospital Chengdu Sichuan China Chongqing University Chongqing Sichuan CN

ISBN: (纸本)9781509029181

In this work, a new glass classification method is proposed. Firstly, images are enhanced by image preprocessing. Secondly, a series of glass features including shape and texture features are proposed. Finally, we employ simple minimum distance classifier to classify the input glass images. The experimental results show that the proposed method has high classification efficiency and accuracy.

关键词： Glass pattern recognition Training Shape Wavelet analysis Sorting Production facilities

来源：评论

学校读者我要写书评

暂无评论

Corrigendum to “Road detection algorithm for Autonomous Navigation Systems based on dark channel prior and vanishing point in complex road scenes” [Robot. Auton. Syst. 85 (2016) 1–11]

引用

Robotics and Autonomous Systems 2017年 88卷 202-202页

作者： Yong Li Weili Ding Xuguang Zhang Zhaojie Ju Laboratory of Pattern Recognition and Intelligent Systems Key Laboratory of Industrial Computer Control Engineering of Hebei Province Department of Automation Institute of Electrical Engineering Yanshan University Qinghuangdao Hebei 066004 China College of Information Science and Engineering Northeastern University Shenyang Liaoning 110004 China School of Computing University of Portsmouth PO1 3HE UK

来源：评论

学校读者我要写书评

暂无评论

New texture-spatial features for keyword spotting in video images 3

New texture-spatial features for keyword spotting in video i...

引用

3rd IAPR Asian Conference on pattern recognition, ACPR 2015

作者： Shivakumara, Palaiahnakote Liang, Guozhu Roy, Sangheeta Pal, Umapada Lu, Tong Faculty of Computer Science and Information Technology University of Malaya Kuala Lumpur Malaysia National Key Lab for Novel Software Technology Nanjing University Nanjing China Computer Vision and Pattern Recognition Unit Indian Statistical Institute Kolkata India

ISBN: (纸本)9781479961009

keyword spotting in video document images is challenging due to low resolution and complex background of video images. We propose the combination of Texture-Spatial-Features (TSF) for keyword spotting in video images without recognizing them. First, a segmentation method extracts words from text lines in each video image. Then we propose the set of texture features for identifying text candidates in the word image with the help of k-means clustering. The proposed method finds proximity between text candidates to study the spatial arrangement of pixels that result in feature vectors for spotting words in the input frame. The proposed method is evaluated on word images of different fonts, contrasts, backgrounds and font sizes, which are chosen from standard databases such as ICDAR 2013 video and our video data. Experimental results show that the proposed method outperforms the existing method in terms of recall, precision and f-measure. © 2015 IEEE.

关键词： Image segmentation

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：