In this paper, German, Polish, Spanish, and Portuguese large vocabulary continuous speech recognition (LVCSR) systems developed by the RWth Aachen University are presented. All the above mentioned systems for the afor...
详细信息
ISBN:
(纸本)9781634394352
In this paper, German, Polish, Spanish, and Portuguese large vocabulary continuous speech recognition (LVCSR) systems developed by the RWth Aachen University are presented. All the above mentioned systems for the aforementioned languages are used for the Quaero and EU-Bridge project evaluations. the LVCSR systems developed for these competitive evaluations focus on various domains like broadcast news, podcasts and lecture domain. Transcription of the speech for these tasks is challenging due to huge variability in the acoustic conditions and a significant portion of audio data includes spontaneous speech. Good improvements are obtained using state-of-the-art multilingual bottleneck features, minimum phone error trained acoustic models, language model (LM) adaptation and confusion-network based system combination. In addition, an open vocabulary approach using morphemic units is investigated along withthe LM adaptation for the German LVCSR.
Development of an ITS (Intelligent Transport System) has drawn much attention from computervision community in recent years. In particular, various techniques for detecting pedestrians automatically have been propose...
详细信息
ISBN:
(纸本)9781479959563
Development of an ITS (Intelligent Transport System) has drawn much attention from computervision community in recent years. In particular, various techniques for detecting pedestrians automatically have been proposed by many researchers. Among them, the HOG feature proposed by Dalai & Triggs has gained much interest in the pedestrian detection. However, previous methods including the original HOG feature have not achieved satisfactory detection rates. In this paper, we propose an extension of the HOG feature, i.e., flexible choice of the number of bins and automatic definition of a cell size and a block size by parameterizing their scales. By comparative experiments, it was confirmed that the proposed method outperforms the previous methods in the performance of pedestrian detection.
Accurate evaluation of shrinkage characteristic for tobacco lamina during drying process is important for optimizing tobacco primary process. the present work developed a detection and characterization method of shrin...
详细信息
Abnormal activity detection plays an important role in many areas such as surveillance, military installations, and sports. Existing abnormal activity detectors mostly rely on motion data obtained over a number of fra...
详细信息
Abnormal activity detection plays an important role in many areas such as surveillance, military installations, and sports. Existing abnormal activity detectors mostly rely on motion data obtained over a number of frames to characterize abnormality. However, only motion may not be able to capture all forms of abnormality, in particular, poses that do not amount to motion "outliers". In this paper, we propose two different spatio-temporal descriptors, a silhouette and optic flow based method and a dense trajectory based method which additionally include trajectory shape descriptor, to detect abnormalities. these two descriptors enable us to classify abnormal versus non-abnormal activities using SVM. Comparison with existing methods, using five standard datasets, shows that dense trajectory based method outperforms state-of-the-art results in crowd dataset and silhouette and optic flow based method outperforms others in some datasets.
the intrinsic images of fingerprint, such as orientation field and frequency map, represent the particular and basic characteristics of fingerprint ridge/valley patterns, and play a key role in feature extraction and ...
详细信息
Recognizing human action from video sequences has lots of applications that make it an interesting research subject. Motion History Image (MHI) is a good spatio-temporal template to represent the distinctive profile o...
详细信息
Recognizing human action from video sequences has lots of applications that make it an interesting research subject. Motion History Image (MHI) is a good spatio-temporal template to represent the distinctive profile of an action using a single image. However, in this paper, we use Local Binary patterns (LBP) to extract the highlighted features from the spatio-temporal template and formulate them as a histogram to make the feature vector. Rather than MHL we use Directional MHI (DMHI) for this purpose. We also use shape feature taken from selective silhouettes and concatenate them with LBP histograms. We measured the performance of the proposed action representation method along with some variants of it by employing Weizmann action dataset and found reasonably higher accuracy for practical use.
this paper will discuss about combination of the genetic algorithm and the rough set theory to solve complex, multi-class super-deformed and multi-patternrecognition problems of the offline handwritten chinese charac...
详细信息
this paper will discuss about combination of the genetic algorithm and the rough set theory to solve complex, multi-class super-deformed and multi-patternrecognition problems of the offline handwritten chinese character recognition. It gives a genetic algorithm based offline handwritten chinese character feature simple algorithm, without loss of the original information, reducing the feature vector dimension, reducing the complexity of the recognition processing. It also presents a heuristic method of redundancy reduction samples and reduction redundant training samples, to further reduce the complexity of the recognition processing. It proposes a rule-based confidence offline handwritten chinese character integration recognition rule, the experimental results show that the proposed feature reduction method of the reduction effect of the multidimensional statistical features of offline handwritten chinese character is obvious;rule confidence fusion recognition method can improve the recognition rate of off-line handwritten chinese character recognition system.
Image segmentation is a fundamental process in computervision applications. this paper presents a novel method to deal withthe issue of image segmentation. Each image is first segmented coarsely, and represented as ...
详细信息
Image segmentation is a fundamental process in computervision applications. this paper presents a novel method to deal withthe issue of image segmentation. Each image is first segmented coarsely, and represented as a graph model. then, a semi-supervised algorithm is utilized to estimate the relevance between labeled nodes and unlabeled nodes to construct a relevance matrix. Finally, a normalized cut criterion is utilized to segment images into meaningful units. the experimental results conducted on Berkeley image databases and MSRC image databases demonstrate the effectiveness of the proposed strategy.
Face gender recognition is a challenging problem in the traditional field of patternrecognition. In this paper, we propose a deep learning model that can learn the joint high-level and low-level features of human fac...
详细信息
ISBN:
(纸本)9781479944187
Face gender recognition is a challenging problem in the traditional field of patternrecognition. In this paper, we propose a deep learning model that can learn the joint high-level and low-level features of human face to address this problem. Our deep neural networks apply convolution and subsampling in extracting the local and abstract features of human face, and reconstruct the raw input images to learn global and effective features as supplementary information at the same time. We also add a trainable weight in the networks when combining the two kinds of features to realize the final gender classification. Experiment results show that our method achieves the highest accuracy compared with existing methods, when test on the mixed face dataset. Further, in the generalization test, the average classification rate on 3 public datasets of our method is 5% higher than the joint Local Binary pattern (LBP) and Support Vector Machine (SVM) method, and is nearly 1% higher than the SVM with face pixels method. this proves our method outperforms the traditional methods in both learning ability and generalization ability.
In this paper, we propose LinedCut: a novel method for interactive image segmentation which requires only a single line drawing to identify the object of interest in the image. the handy interaction mode can address t...
详细信息
ISBN:
(纸本)9781479958368
In this paper, we propose LinedCut: a novel method for interactive image segmentation which requires only a single line drawing to identify the object of interest in the image. the handy interaction mode can address the problem of object scale very well. Our approach consists of the following three steps: first, a given image is over-segmented into superpixels using superpixel algorithm; secondly, a merging technique based on bag-of-color feature and k-means is applied to merge similar adjacent superpixel pairs; finally, a graph-cut based approach which exploits user interaction information is introduced to get the final segmentation result. Despite its simplicity, we show that the LinedCut method is able to achieve a performance comparable to the state-of-the-art. the method can be easily developed by replacing any one method among the three steps with other methods.
暂无评论