In this paper, we present a method for detecting individuals in crowd by clustering a group of feature points belonging to the same person. In our approach, a feature point is considered to contain three attributes: t...
详细信息
In this paper, we present a method for detecting individuals in crowd by clustering a group of feature points belonging to the same person. In our approach, a feature point is considered to contain three attributes: the motion trajectory in video sequence, the sparse local appearance around point in current frame, and the structure relationship with body center related with local appearance. We exploit these attributes to cluster them appearing on the same individual to achieve detection purpose. The algorithm does not require observing entire human body and could discriminate different individuals under overlap. Our experiments show that this approach advances the performance of detecting individuals in crowds.
Computational models of visual processes with biological inspiration - and even biological realism - are currently of great interest in the computervision community. This paper provides a biologically plausible model...
详细信息
Computational models of visual processes with biological inspiration - and even biological realism - are currently of great interest in the computervision community. This paper provides a biologically plausible model of 2D shape which incorporates intermediate layers of visual representation that have not previously been fully explored. We propose that endstopping and curvature cells are of great importance for shape selectivity and show how their combination can lead to shape selective neurons. This shape representation model provides a highly accurate fit with neural data from and provides comparable results with real-world images to current computervision systems. The conclusion is that such intermediate representations may no longer require a learning approach as a bridge between early representations based on Gabor or Difference of Gaussian filters (that are not learned since they are well-understood) and later representations closer to object representations that still can benefit from a learning methodology.
This book constitutes the refereed proceedings of the 22nd International Conference on Information Processing in Medical Imaging, IPMI 2011, held at Kloster Irsee, Germany, in July 2011. The 24 full papers and 39 post...
ISBN:
(数字)9783642220920
ISBN:
(纸本)9783642220913
This book constitutes the refereed proceedings of the 22nd International Conference on Information Processing in Medical Imaging, IPMI 2011, held at Kloster Irsee, Germany, in July 2011. The 24 full papers and 39 poster papers included in this volume were carefully reviewed and selected from 224 submissions. The papers are organized in topical sections on segmentation, statistical methods, shape analysis, registration, diffusion imaging, disease progression modeling, and computer aided diagnosis. The poster sessions deal with segmentation, shape analysis, statistical methods, image reconstruction, microscopic image analysis, computer aided diagnosis, diffusion imaging, functional brain analysis, registration and other related topics.
Research towards Indian handwritten document analysis achieved increasing attention in recent years. In patternrecognition and especially in handwritten document recognition, standard databases play vital roles for e...
详细信息
Research towards Indian handwritten document analysis achieved increasing attention in recent years. In patternrecognition and especially in handwritten document recognition, standard databases play vital roles for evaluating performances of algorithms and comparing results obtained by different groups of researchers. For Indian languages, there is a lack of standard database of handwritten texts to evaluate performance of different document recognition approaches and for comparison purpose. In this paper, an unconstrained Kannada handwritten text database (KHTD) is introduced. The KHTD contains 204 handwritten documents of four different categories written by 51 native speakers of Kannada. Total number of text-lines and words in the dataset are 4298 and 26115, respectively. In most of text-pages of the KHTD contains either an overlapping or a touching text-lines and the average number of text-lines in each document on the database is 21. Two types of ground truths based on pixels information and content information are generated for the database. Providing these two types of ground truths for the KHTD, it can be utilized in many areas of document image processing such as sentence recognition/understanding, text-line segmentation, word segmentation, word recognition, and character segmentation. To provide a framework for other researches, recent text-line segmentation results on this dataset are also reported. The KHTD is available for research purposes.
Pedestrian classification is addressed by T. Watanabe et al. using SVM with 34704 CoHOG features. This paper addresses the pedestrian classification using neural network with 1344 CoHOG features (feature size is 25 ti...
详细信息
Pedestrian classification is addressed by T. Watanabe et al. using SVM with 34704 CoHOG features. This paper addresses the pedestrian classification using neural network with 1344 CoHOG features (feature size is 25 times small) and still achieve comparable results.
Particle Swarm Optimization combines with Munkres algorithm is proposed for Point pattern Matching in three dimensions. Point pattern Matching is a fundamental aspect of many fields in computervision and pattern reco...
详细信息
ISBN:
(纸本)9781612847238
Particle Swarm Optimization combines with Munkres algorithm is proposed for Point pattern Matching in three dimensions. Point pattern Matching is a fundamental aspect of many fields in computervision and patternrecognition. The point pattern matching technique could be described as finding an optimal transformation for one point pattern to the other under some measures. The proposed algorithm can be used to perform reliable matching between two different views of an object or scene. A new formula is proposed and used as a reasonable fitness function. The advantages of Munkres and Particle Swarm optimization are used and combined together in this paper. The simulated results show that the new algorithm is very effective.
Human activity analysis based on video is a hot research topic in the field of computervision, in which segmentation of human activity sequence is a fundamental question. In this paper, we present a new unsupervised ...
详细信息
Human activity analysis based on video is a hot research topic in the field of computervision, in which segmentation of human activity sequence is a fundamental question. In this paper, we present a new unsupervised segmentation method for human activity sequence. The main idea of this method is that the most dramatic point, which is regarded as the split point, is detected by gaussian model according to human action's mutation in the video. Experimental results show the feasibility and effectiveness of the segmentation algorithm in this paper.
Person re-identification is an important problem in computervision, which involves matching appearance of individuals between non-overlapping camera views. In this paper we present a novel appearance-based method for...
详细信息
Person re-identification is an important problem in computervision, which involves matching appearance of individuals between non-overlapping camera views. In this paper we present a novel appearance-based method for person re-identification problem. Color feature, Gabor, local binary pattern (LBP) are utilized to form a covariance descriptor to handle the difficulties such as varying illumination, viewpoint angle and non-rigid body, then distances of these features are computed to match these individuals. Experimental results over the challenging dataset VIPeR demonstrate that our method obtains competitive performance.
We propose a framework for efficient storing and scalable browsing of surveillance video based on video synopsis. Our framework employs a novel synopsis analysis scheme named Detail-based video synopsis to generate a ...
详细信息
We propose a framework for efficient storing and scalable browsing of surveillance video based on video synopsis. Our framework employs a novel synopsis analysis scheme named Detail-based video synopsis to generate a set of object flags to store and browse surveillance video synopsis. The main contributions of our work are: 1) highlighting important contents of surveillance video; 2) improving the storage efficiency of original video and synopsis video; 3) realizing multi-scale scalable browsing of synopsis video while reserving essential information. The experiments of implementing the framework are shown compared with the previous independent storage method of original video and synopsis video.
Because of writing styles of different individuals, some of the text-lines may be curved in shape. For recognition of such text-lines, their proper alignment is necessary. In this paper, we propose a text-line alignme...
详细信息
Because of writing styles of different individuals, some of the text-lines may be curved in shape. For recognition of such text-lines, their proper alignment is necessary. In this paper, we propose a text-line alignment technique based on painting algorithm. Here at first, Piece-wise Painting Algorithm (PPA) is used to get a number of black and white rectangular patches all along the text-line for text-line alignment. Identifying the degree of oscillation of the input text-line, some candidate pixels are also obtained based on horizontal projection and center points of the black patches. Using the degree of oscillation of the input text image and the candidate pixels a curve or straight line is fit to trace the baseline. Subsequently, all components of the text-line are deskewed based on analyzing the characteristic of the fit curve or line to align the components with respect to the horizontal imaginary baseline. The proposed algorithm was evaluated with 128 Persian handwritten text-lines containing 4317 sub words. Experimental analysis showed that 92.31% of the sub words were accurately aligned. Further, the proposed algorithm was tested with another Persian handwritten text-lines dataset [6] and remarkable results were achieved.
暂无评论