We study the problem of representing images within a multimedia Database Management System (DBMS), in order to support fast retrieval operations without compromising storage efficiency. To achieve this goal, we propos...
We study the problem of representing images within a multimedia Database Management System (DBMS), in order to support fast retrieval operations without compromising storage efficiency. To achieve this goal, we propose new image coding techniques which combine a wavelet representation, embedded coding of the wavelet coefficients, and segmentation of image-domain regions in the wavelet domain. A bitstream is generated in which each image region is encoded independently of other regions, without having to explicitly store information describing the regions. Simulation results show that our proposed algorithms achieve coding performance which compares favorably, both perceptually and objectively, to that achieved using state-of-the-art image/video coding techniques while additionally providing region-based support.
Smooth interpolants defined over tetrahedra are currently being developed for they have many applications in geography, solid modeling, finite element analysis, etc. In this paper, we will characterize a certain class...
详细信息
Smooth interpolants defined over tetrahedra are currently being developed for they have many applications in geography, solid modeling, finite element analysis, etc. In this paper, we will characterize a certain class of C-1 discrete tetrahedral interpolants with only C-1 data required. As special cases of the class characterized, we give two C-1 discrete tetrahedral interpolants which have concise expressions.
This paper is concerned with algorithms for removal additive noise from images. The proposed (α,ß)-trimmed mean filtering is suitable for application to real images corrupted by Gaussian, uniform and impulsive n...
详细信息
This paper is concerned with algorithms for removal additive noise from images. The proposed (α,ß)-trimmed mean filtering is suitable for application to real images corrupted by Gaussian, uniform and impulsive noise. The developed technique is a generalization of α-trimmed mean filter and have the same basic properties as rank-order filters. The actual performance of proposed technique was compared with that of average, median and midpoint filters and evaluated on noisy images by the error of restoration. The illustrative examples are given.
Concept generalization under incomplete domain theory is a very important research aspect in artificial intelligence. Current multilayer perceptron and EBL (explanation based learning) approaches cannot deal with it e...
详细信息
ISBN:
(纸本)0780342534
Concept generalization under incomplete domain theory is a very important research aspect in artificial intelligence. Current multilayer perceptron and EBL (explanation based learning) approaches cannot deal with it effectively. We present a new method, hybrid multilayer perceptron/EBL approach for concept generalization, which can deal with concept generalization more effectively.
The paper presents an image registration method based on a two-dimensional Hopfield neural network, where the problem of image matching is treated with the minimization of the energy function of the Hopfield neural ne...
详细信息
ISBN:
(纸本)0818682035
The paper presents an image registration method based on a two-dimensional Hopfield neural network, where the problem of image matching is treated with the minimization of the energy function of the Hopfield neural network. The input data used for registration are the locations of the corner points extracted from the images. In order to improve and expedite the matching process, a fast block-based algorithm is put forward, together with the laboratory results obtained, which show the effectiveness of the algorithm.
The framework of constructing a distributed multimedia system based on the server/client architecture is described in this paper. We focus our attention on the realization of synchronization presentation of different ...
详细信息
ISBN:
(纸本)0818678763
The framework of constructing a distributed multimedia system based on the server/client architecture is described in this paper. We focus our attention on the realization of synchronization presentation of different media in a multimedia application, and a set of QoS (qualify of service) parameters is given as a criterion to make a trade-off between overall performance of the system and the synchronization presentation in each multimedia application.
By studying the MPEG-2 based output rate control strategy in detail, the mechanism of accurate bit-rate control and rational bit allocation is analyzed. To solve the difficulties of bit-rate control from the drastic c...
详细信息
ISBN:
(纸本)0780343719
By studying the MPEG-2 based output rate control strategy in detail, the mechanism of accurate bit-rate control and rational bit allocation is analyzed. To solve the difficulties of bit-rate control from the drastic changes of coded bits, a bit-rate control strategy under scene change is proposed.
Matching of appearance-based object representations using eigenimages is computationally very demanding. Most commonly, to recognize an object in an image, parts of the input image are projected onto the eigenspace an...
详细信息
This paper introduces an approach to the regularization of the iterative image restoration methods based on the median filtering. The comparative analysis of low-pass and median regularization is performed. The median...
详细信息
This paper introduces an approach to the regularization of the iterative image restoration methods based on the median filtering. The comparative analysis of low-pass and median regularization is performed. The median filtering is shown to be more efficient as the regularization in the case of noise with mixed distribution (i.e. Gaussian + impulsive). The nonlinearity of the iterative method is provided by the constrain on nonnegativity that makes possible to solve the problem of band-limited extrapolation. The use of median regularization does not require to choose the regularization parameter in contrast to Tikhonov regularization. However, the window size is to be chosen according to the noise level and could be considered as a parameter for the adaptive regularization to preserve edges according to the masking effect of human vision system.
When an Automatic Speech recognition (ASR) system is applied in noisy environments, Voice Activity Detection (VAD) is crucial to the performance of the overall system. The employment of the VAD for ASR on embedded mob...
When an Automatic Speech recognition (ASR) system is applied in noisy environments, Voice Activity Detection (VAD) is crucial to the performance of the overall system. The employment of the VAD for ASR on embedded mobile systems will minimize physical distractions and make the system convenient to use. Conventional VAD algorithm is of high complexity, which makes it unsuitable for embedded mobile devices; or of low robustness, which holds back its application in mobile noisy environments. In this paper, we propose a robust VAD algorithm specifically designed for ASR on embedded mobile devices. The architecture of the proposed algorithm is based on a two-level decision making strategy, where there is an interaction between a lower features-based level and subsequent decision logic based on a finite-state machine. Many discriminating features are employed in the lower level to improve the robustness of the VAD. The two-level decision strategy allows different features to be used in different states and reduces the cost of the algorithm, which makes the proposed algorithm suitable for embedded mobile devices. The evaluation experiments show the proposed VAD algorithm is robust and contribute to the overall performance gain of the ASR system in various acoustic environments.
暂无评论