Prediction of gender and other demographic attributes of individuals from handwriting samples offers an interesting basic, as well as applied research problem. The correlation between gender and the visual appearance ...
详细信息
ISBN:
(纸本)9781509009824
Prediction of gender and other demographic attributes of individuals from handwriting samples offers an interesting basic, as well as applied research problem. The correlation between gender and the visual appearance of handwriting has been validated by a number of studies and the present study is based on the same idea. We exploit the textural measurements as the discriminating attribute between male and female writings. The textural information in a writing is captured by applying a bank of Gabor filters to the image of handwriting. The mean and standard deviation values of the filter responses are collected in matrix and the Fourier transform of the matrix is used as a feature. Classification is carried out using a feed forward neural network. The proposed technique evaluated on a subset of the QUWI database realized promising results under different experimental settings.
作者:
Grauman, K.Betke, M.Lombardi, J.Gips, J.Bradski, G.R.Vision Interface Group
AI Laboratory Massachusetts Institute of Technology 77 Massachusetts Avenue CambridgeMA02139 United States Computer Science Department
Boston University 111 Cummington St BostonMA02215 United States EagleEyes
Computer Science Department Boston College Fulton Hall Chestnut HillMA02467 United States Vision
Graphics and Pattern Recognition Microcomputer Research Laboratory Intel Corporation SC12-303 2200 Mission College Blvd Santa ClaraCA95054-1537 United States
Two video-based human-computer interaction tools are introduced that can activate a binary switch and issue a selection command. "BlinkLink," as the first tool is called, automatically detects a user's e...
详细信息
Egyptian Arabic (EA) is a colloquial version of Arabic. It is a low-resource morphologically rich language that causes problems in Large Vocabulary Continuous Speech recognition (LVCSR). Building LMs on morpheme level...
详细信息
Deep neural networks have achieved remarkable successes in learning feature representations for visual classification. However, deep features learned by the softmax cross-entropy loss generally show excessive intra-cl...
详细信息
Road networks are important features of satellite imagery. The main contribution of the present road detection method consists of an effective enhancement technique and an efficient segmentation technique that removes...
详细信息
Road networks are important features of satellite imagery. The main contribution of the present road detection method consists of an effective enhancement technique and an efficient segmentation technique that removes non-road pixels step by step from the image where parameters involved: in each step images are determined by the sensor characteristics (like spatial resolution and spectral range) of the satellite. Also, the segmentation process depends not only on the road contrast but also on the road length. Thus, a low contrast but long road segment does not get removed. We have tested the algorithm on a number of images from IRS and SPOT satellites and the results are satisfactory.
Recent research on face analysis has demonstrated the richness of information embedded in feature vectors extracted from a deep convolutional neural network. Even though deep learning achieved a very high performance ...
详细信息
Automatic recognition of handwritten texts in video lectures has important applications. In video lectures, the presenter usually writes on white / colored board. The video camera often captures the writing board alon...
详细信息
Automatic recognition of handwritten texts in video lectures has important applications. In video lectures, the presenter usually writes on white / colored board. The video camera often captures the writing board along with certain other objects possibly including the presenter itself. recognition of handwritten texts from such a video frame requires prior detection of the region of texts in the frame. In this article, we present our recent study of text localization in such video lecture frames. Here, we use Scale Invariant Feature Transform (SIFT) descriptors densely over the entire region of the frame. The descriptors are located on a regular grid of 5 pixels following the usual practice and considered a uniform patch size of 60 × 60 pixels as its support on the basis of an empirical study. This SIFT descriptor at each location (grid point) is fed as a 128-dimensional input feature vector to a Multilayer Perceptron (MLP) network which gives response for each grid point as either text or non-text. Depending on certain aggregate response at each pixel we localize text regions in the input video frame. Next, we employ K-means clustering to detect the text components present in the localized region of the video frame. Finally, two simple rules are applied to decide certain possible detected text components as noise. We obtained encouraging simulation results of this approach on a variety of video lecture frames.
Many crowd abnormal motion detection methods in video surveillance have been proposed in resent ***,most of them are based on low semantic features,such gray value,velocity and ***,low semantic features contain weak d...
详细信息
Many crowd abnormal motion detection methods in video surveillance have been proposed in resent ***,most of them are based on low semantic features,such gray value,velocity and ***,low semantic features contain weak discriminative information of the *** addition,these methods often ignore important information in time and space *** this work,a high semantic representation is *** feature analysis(SFA) is adopted to provide high semantic ***,a random walk model,which takes into account the spatio-temporal information,is used to detect the abnormal motions in *** conduct extensive experiments on two datasets to demonstrate the effectiveness of proposed *** results suggest that our method outperforms the state-of-the-art methods.
暂无评论