Local Binary pattern (LBP) is a powerful texture descriptor for its tolerance against illumination changes and its computational simplicity. the basic LBP encodes 256 feature patterns in a 3×3 neighborhood, but n...
详细信息
In this paper, we introduce MMTrack, a hybrid single pedestrian tracking algorithm that puts together the advantages of descriptive and discriminative approaches for tracking. Specifically, we combine the idea of clus...
详细信息
this paper deals with a new registration method based on a specific level-line grouping. Because of its contrast-change invariance, our approach is an appropriate method for matching outdoor image sequences. Moreover,...
详细信息
Automatic facial expression recognition is a challenging problem in computervision, and has gained significant importance in applications of human-computer interaction. this paper presents a new appearance-based feat...
详细信息
We present a novel real-time computervision-based system for facilitating interactions between a single human and a multi-robot system: a user first selects an individual robot from a group of robots, by simply looki...
详细信息
this paper investigates the impact of camera separation on the performance of an H.264/AVC based stereo-vision video codec. To achieve this, the multi-frame referencing property of H.264/AVC has been employed and the ...
详细信息
ISBN:
(纸本)9780889868236
this paper investigates the impact of camera separation on the performance of an H.264/AVC based stereo-vision video codec. To achieve this, the multi-frame referencing property of H.264/AVC has been employed and the standard H.264/AVC reference software has been modified to support stereoscopic video coding. Experimental results were generated using two sets of wide baseline convergent multi-view test videos: Breakdancers and Ballet. To generate a set of synchronized stereo-videos from the same scene with different inter-camera angles, all possible camera pairs are generated and classified according to their inter-camera angles. the resulting sets of stereo videos are coded using a H.264/AVC based stereo-vision and simulcast coding schemes at different bitrates. Results indicate that the stereo-vision codec outperforms the simulcast coding by up to 3.9dB at lower inter-camera angles and it deteriorates as the inter-camera angle increases. Finally, a range of inter-camera angles for best use of either stereo-vision or simulcast coding is determined.
In this paper, we present a vision-based framework to manipulate the augmented reality (AR) objects robustly in a marker-less AR system. It is known that one of the promising ways to develop a marker-less AR system is...
详细信息
Besides the decorative purposes, vehicle manufacture logos can provide rich information for vehicle verification and classification in many applications such as security and information retrieval. Detection and recogn...
详细信息
ISBN:
(纸本)9783642123030
Besides the decorative purposes, vehicle manufacture logos can provide rich information for vehicle verification and classification in many applications such as security and information retrieval. Detection and recognition of vehicle manufacture logos are, however, very challenging because they might lack of discriminative features themselves. In this paper, we propose a method to detect vehicle manufacture logos using contextual information, i.e., the information of surrounding objects near vehicle manufacture logos such as license plates, headlights, and grilles. the experimental results demonstrate that the proposed method is more effective and robust than other methods.
All Han-based scripts (chinese, Japanese, and Korean) possess similar visual characteristics. Hence system development for identification of chinese, Japanese and Korean scripts from a single document page is quite ch...
详细信息
the SignSpeak project will be the first step to approach sign language recognition and translation at a scientific level already reached in similar research fields such as automatic speech recognition or statistical m...
详细信息
ISBN:
(纸本)9782951740860
the SignSpeak project will be the first step to approach sign language recognition and translation at a scientific level already reached in similar research fields such as automatic speech recognition or statistical machine translation of spoken languages. Deaf communities revolve around sign languages as they are their natural means of communication. Although deaf, hard of hearing and hearing signers can communicate without problems amongst themselves, there is a serious challenge for the deaf community in trying to integrate into educational, social and work environments. the overall goal of SignSpeak is to develop a new vision-based technology for recognizing and translating continuous sign language to text. New knowledge about the nature of sign language structure from the perspective of machine recognition of continuous sign language will allow a subsequent breakthrough in the development of a new vision-based technology for continuous sign language recognition and translation. Existing and new publicly available corpora will be used to evaluate the research progress throughout the whole project.
暂无评论