We propose a text scanner, which detects wide text strings in a sequence of scene images. For scene text detection, we use a multiple-CAMShift algorithm on a text probability image produced by a multi-layer perceptron...
详细信息
An improved Zernike moment using a region-based shape descriptor is presented. The improved Zernike moment not only has rotation invariance, but also has scale invariance that the unimproved Zernike moment does not ha...
详细信息
An improved Zernike moment using a region-based shape descriptor is presented. The improved Zernike moment not only has rotation invariance, but also has scale invariance that the unimproved Zernike moment does not have. The experimental results show that the improved Zernike moment has better invariant properties than the unimproved Zernike moment using a region-based shape descriptor.
An improved Zernike moment using as a region-based shape descriptor is presented. The improved Zernike moment not only has rotation invariance, but also has scale invariance that the unimproved Zernike moment does not...
详细信息
An improved Zernike moment using as a region-based shape descriptor is presented. The improved Zernike moment not only has rotation invariance, but also has scale invariance that the unimproved Zernike moment does not have. The experimental results show that the improved Zernike moment has better invariant properties than unimproved Zernike moment using as region-based shape descriptor.
Head detection is an important, but difficult task, if no restrictions such as static illumination, frontal face appearance or uniform background can be assumed. We present a system that is able to perform head detect...
详细信息
Vertical Bell Laboratories Layered Space-Time (V-BLAST) is a promising system that realizes the enormous capacity of multiple-input multiple-output (MIMO) communications. We present an extension of V-BLAST, and propos...
详细信息
ISBN:
(纸本)0780375890
Vertical Bell Laboratories Layered Space-Time (V-BLAST) is a promising system that realizes the enormous capacity of multiple-input multiple-output (MIMO) communications. We present an extension of V-BLAST, and propose an effective transmit power allocation scheme for the extended system. The proposed transmit power allocation scheme minimizes the bit error rate (BER) averaged over all detection stages, and requires small feedback overhead from the receiver to the transmitter. Simulation results show that the extended V-BLAST system with the proposed transmit power allocation scheme provides a significant reduction in the BER compared to the conventional V-BLAST system. When the minimum mean square error (MMSE) nulling is adopted, the extended V-BLAST system is found to achieve the BER performance comparable to that of the maximum likelihood (ML) detection for the conventional V-BLAST architecture.
Methods for mobile robot localization that use eigenspaces of panoramic snapshots of the environment are in general sensitive to changes in the illumination of the environment. Therefore, we propose an approach which ...
详细信息
Methods for mobile robot localization that use eigen spaces of panoramic snapshots of the environment are in general sensitive to changes in the illumination of the environment. Therefore, we propose an approach which...
详细信息
This paper designs and implements a financial invoice recognition system based on the features of the Chinese financial invoice. By using the linear whole block moving method in each vertical segment, a new fast algor...
Methods for mobile robot localization that use eigenspaces of panoramic snapshots of the environment are in general sensitive to changes in the illumination of the environment. Therefore, we propose an approach which ...
详细信息
Methods for mobile robot localization that use eigenspaces of panoramic snapshots of the environment are in general sensitive to changes in the illumination of the environment. Therefore, we propose an approach which achieves a reliable localization under severe illumination conditions. The method uses gradient filtering of the eigenspace. After testing the approach on images obtained by a mobile robot, we show that it outperforms the standard eigenspace-based recognition method.
Video object (VO) is an important concept in MPEG-4. For objects can be easily manipulated without visible distortion, the copyright protection of video objects becomes an important issue. This paper presents a waterm...
详细信息
ISBN:
(纸本)0780377133
Video object (VO) is an important concept in MPEG-4. For objects can be easily manipulated without visible distortion, the copyright protection of video objects becomes an important issue. This paper presents a watermarking scheme for video objects. Different from other methods, the proposed scheme employed inertia ellipse to achieve fast synchronization recovery in case the object was manipulated. Shape adaptive DCT and visual mask were combined to embed the watermark into the arbitrarily shaped object, which was designed to achieve the trade-off between the invisibility and the robustness. Experiments show that our algorithm is robust to object manipulations such as rotations, translations, scaling and lossy compression. Our scheme can be easily incorporated into the object-based coding framework of MPEG-4.
暂无评论