DEtection TRansformer (DETR) and its variant models degrade the object detection performance due to the inability to provide the object position a priori and the lack of shape deviation supervision between prediction ...
详细信息
Remote photoplethysmography (rPPG) aims to measure non-contact physiological signals from facial videos, which has shown great potential in many applications. Most existing methods directly extract video-based rPPG fe...
详细信息
Remote photoplethysmography (rPPG) aims to measure non-contact physiological signals from facial videos, which has shown great potential in many applications. Most existing methods directly extract video-based rPPG fe...
详细信息
Multiple researchers recently proposed the use of the digital compass embedded in mobile devices for touchless interaction in the 3D space around them. These methods overcome several limits imposed by other interactio...
详细信息
There are many video images where hand written text may appear. Therefore handwritten scene text detection in video is essential and useful for many applications for efficient indexing, retrieval etc. Also there are m...
详细信息
Embedding data into vector spaces is a very popular strategy of patternrecognition methods. When distances between embeddings are quantized, performance metrics become ambiguous. In this paper, we present an analysis...
详细信息
Optical Character recognition (OCR) is one of the continuously explored problems. Presently, commercial character recognizers are available reporting near to 100% recognition rates on text in a number of scripts. Desp...
详细信息
Optical Character recognition (OCR) is one of the continuously explored problems. Presently, commercial character recognizers are available reporting near to 100% recognition rates on text in a number of scripts. Despite these advancements, OCR systems however, have yet to mature for cursive scripts like Urdu. This study presents a holistic technique for recognition of Urdu text in Nastaliq font using "complete" ligatures as recognition units. The term "complete" refers to a partial word including its main body and secondary components (dots and diacritic marks). Discrete Wavelet Transform (DWT) is employed as feature extractor while a separate Hidden Markov Model (HMM) is trained for each ligature considered in our study. More than 2000 frequently used unique Urdu ligatures from the standard CLE (center of Language Engineering) dataset are considered in our evaluations. The system reads a promising accuracy of 88.87% on more than 10,000 partial words.
In this paper, we present a scheme towards recognition of English character in multi-scale and multi-oriented environments. Graphical document such as map consists of text lines which appear in different orientation. ...
详细信息
Three-dimensional rotational angiography (3DRA) is a promising imaging technique which yields high-resolution isotropic 3D images of vascular structures. Raw 3DRA images, however, usually suffer from a high noise leve...
详细信息
We propose an approach to 3-D non-rigid motion estimation from image sequence in this paper. First, with the establishment of feature point correspondence between consecutive image frames, the affine motion model and ...
详细信息
ISBN:
(纸本)0780384032
We propose an approach to 3-D non-rigid motion estimation from image sequence in this paper. First, with the establishment of feature point correspondence between consecutive image frames, the affine motion model and the central projection model are presented for local non-rigid motion. Then, in order to obtain the global motion parameters and overcome the ill-posed 3-D estimation problem, a framework of Markov random field (MRF) is proposed. By incorporating the motion prior constrains into the MRF, the motion smoothness feature between local regions is reflected. This converts the ill-posed problem into a well-posed one and guarantees a robust solution. Experimental results from a sequence of synthetic image sequence demonstrate the feasibility of the proposed approach.
暂无评论