The main goal of this work is to investigate a computational architecture for a document management environment. Its purpose is to digitalize and extract information from documents of any type, transforming them into ...
详细信息
The main goal of this work is to investigate a computational architecture for a document management environment. Its purpose is to digitalize and extract information from documents of any type, transforming them into structured electronic documents. The environment is divided into specification and extraction modules. In the first module, the user performs the document specification, capturing physical, semantic and process information from the document. The extraction module uses this information, in order to recognize documents of the same class as the specified one. This environment offers the possibility of visualizing the classification results and to correct eventual mistakes it has made. Also, it allows document reconstruction from physical and semantic information captured in the specification module.
Blocking artifacts are inherent and inevitable phenomena at low bit rates in current block-based video coding schemes. The paper proposes a deblocking technique to reduce blocking artifacts and improve visual quality....
详细信息
ISBN:
(纸本)0780374886
Blocking artifacts are inherent and inevitable phenomena at low bit rates in current block-based video coding schemes. The paper proposes a deblocking technique to reduce blocking artifacts and improve visual quality. Different from the post-processing technique adopted in MPEG-4, the proposed technique is directly applied to the reference image for effective motion prediction instead of just for display purposes. In other words, the deblocking filter is introduced into the motion compensation loop in the video coding. Because of the negative effects of the deblocking filter, such as blurring the details of the image, not every block boundary needs to be filtered. It has to be determined whether the boundaries between two blocks should be filtered or not. Therefore, we also propose a decision-making criterion based on the available motion information (vector and mode). Experimental results show that the proposed deblocking method can improve image quality both in subjective and objective aspects. It also outperforms the post-processing technique in MPEG-4.
visual communication is a subject area rich for exploration and discovery. A multi-perspective approach to learning about the various media that is inherent in modern society allows us to better understand what we see...
详细信息
visual communication is a subject area rich for exploration and discovery. A multi-perspective approach to learning about the various media that is inherent in modern society allows us to better understand what we see and what it means. In essence, it provides a framework within which visual literacy can be attained. This paper introduces some fundamental considerations of visual communication and encourages a multi-perspective approach to exploring visual media with the goal of persuading readers to recognize the importance of developing visual literacy.
We introduce a new approach for blind image watermarking: We derive robust semi-global features in wavelet domain and quantize them in order to embed the watermark. Quantization of statistics is carried out by adding ...
详细信息
We introduce a new approach for blind image watermarking: We derive robust semi-global features in wavelet domain and quantize them in order to embed the watermark. Quantization of statistics is carried out by adding scaled pseudo-random sequences that are visually unnoticeable. Our algorithm exhibits increased robustness against various attacks and it withstands standard benchmark attacks of wider trange than earlier methods (e.g., Stirmark, random bending etc.) and modifications such as compression, provided they do not cause too severe visual distortions.
Feature extraction is one of the key techniques in automatic fingerprint identification. Today, most feature extractions are based on the structure analysis of fingerprint ridges and valleys. The structure analysis de...
详细信息
ISBN:
(纸本)0780374908
Feature extraction is one of the key techniques in automatic fingerprint identification. Today, most feature extractions are based on the structure analysis of fingerprint ridges and valleys. The structure analysis depends on the accuracy of image segmentation. image segmentation is one of the most difficult techniques in imageprocessing. Therefore, the accuracy and correctness of the feature extraction are often not reliable. For this reason, we propose a method for feature extraction in fingerprint identification. In this method, the intensity gradient vector image is first generated using the expanded Prewitt operator. Then the orientation consistency of the gradient vector image is calculated. The feature points of the fingerprints are extracted from the orientation consistency by classification. This method is fast, simple and accurate.
In recent years, there has been a huge growth in the areas of multimedia and wireless communications. image transmission through bandwidth limited and high bit error rate wireless channel requires both compression and...
详细信息
ISBN:
(纸本)0780374029
In recent years, there has been a huge growth in the areas of multimedia and wireless communications. image transmission through bandwidth limited and high bit error rate wireless channel requires both compression and error resilient capabilities. We propose an error resilient image coding and smart error concealment schemes for DCT-based [1] image compression, such as JPEG and MPEG standards. It can successfully prevent error propagation between the transmitted data blocks with little overhead. In addition, we present a novel post-processing error concealment scheme, called Smart-IDCT (SIDCT). It tries to retain error free low frequency DCT information and discarding highly suspicious high frequency information. Then, we can retrieve low-resolution information instead of totally corrupted image block during the decoding process. The required computation power is much less than conventional error detection and correction schemes. Simulation results show that the overhead of ERIC is about 5% compared to the JPEG sequential DCT-based mode without restart marker. With the SIDCT post-processing, it can achieve excellent image quality (PSNR=31.78dB).
This note describes a synchronization method for use of the multi-channel display and the associated clustered PC rendering systems, within multi user Virtual Reality online games with real time frame rates. It provid...
详细信息
ISBN:
(纸本)9781581136203
This note describes a synchronization method for use of the multi-channel display and the associated clustered PC rendering systems, within multi user Virtual Reality online games with real time frame rates. It provides a real time surround game display to render the very complex 3D graphics objects. Our system is composed of Multi-channel Distributor, and visualization Clients. visual client system can be connected to multi monitors or projectors. With Multi-channel distributor, an application can be configured, at runtime, to span any number of visuals to makeup a complete surround and to manage the user game event. All visualization clients produced are frame synchronized to ensure visual uniformity. However, tightly synchronized PC cluster may cause a draw back in its rendering speed. So, we propose a dual scene graph management method to adaptively synchronize. One scene graph is conventionally organizes and control the rendering of its constituent objects. And the other is spatially grouping on the geometric objects found at the edge of two adjacent channels between of the clustered PCs and or not. In this paper describes the method of a construction dual-scene graph based clustered rendering system for scalable display in order to generate high-resolution images at real time frame rates. And also we are planning to develop a multi-projector display system which calibrates itself automatically regardless its shape of display surface and which provides a user with a seamless game display. Our results are well suited for surround 3D game displays such as multi-projector or multi-monitor running on a PC cluster.
A large body of human imageprocessing techniques use skin detection as a first step for subsequent feature extraction. The objective of this work is to provide an efficient tool to detect human skin in color images. ...
详细信息
A large body of human imageprocessing techniques use skin detection as a first step for subsequent feature extraction. The objective of this work is to provide an efficient tool to detect human skin in color images. Well-established methods of color modeling, such as histograms and Gaussian mixture models have enabled the construction of suitably accurate skin detectors. However such techniques are not ideal for use in various environments. We describe a method of skin detection using a back propagation neural network, and show considerable good performance for a large variety of color images. We also introduce genetic algorithms into the weights and biases optimization of the neural network. The paper focuses on the novel approach to design a neural network based skin detector, which is later used to retrieve skin-like homogeneous regions in color face images.
A visually-optimal quantization and rate-control strategy based on results of recent contrast sensitivity and suprathreshold summation experiments is proposed. At suprathreshold contrasts, masked detection thresholds ...
详细信息
A visually-optimal quantization and rate-control strategy based on results of recent contrast sensitivity and suprathreshold summation experiments is proposed. At suprathreshold contrasts, masked detection thresholds for wavelet subband quantization distortions were approximately equal for scale-3, 4, and 5 distortions; approximately 52% greater for scale-2 distortions; and approximately 84% greater for scale-1 distortions. Based on a suprathreshold error-pooling model, contrasts for individual subbands are selected to match these contrast ratios, and are adjusted to account for changes in relative sensitivity at suprathreshold contrasts. Quantization step sizes are then computed from the adjusted base contrasts. A target contrast is estimated from the desired rate, and rate control is performed by adjusting this contrast until the rate is met. images compressed with the proposed algorithm show improved visual quality at low bit rates.
This paper proposes a distance metric for motion vector histograms. Of a number of existing distance metrics for histograms, the weighted quadratic form distance is known to be efficient because of its flexibility. In...
详细信息
This paper proposes a distance metric for motion vector histograms. Of a number of existing distance metrics for histograms, the weighted quadratic form distance is known to be efficient because of its flexibility. In order for the distance to be efficient, however, a suitable selection for its weighting matrix is necessary. Proposed in this paper is a determination technique for the weighting matrix which can reflect human perceptual characteristics for motion similarity. Experimental results have confirmed that the proposed technique is useful as a similarity measure in video retrieval.
暂无评论