In this paper, a fast-search algorithm is introduced to reduce the complexity of LSF quantization in speech coding. A new inequality between the weighted mean and the weighted Euclidean distance is derived. Using this...
详细信息
In this paper building statistical language models for Persian language using a corpus and incorporating them in Persian continuous speech recognition (CSR) system are described We used Persian Text Corpus for buildin...
详细信息
In this paper building statistical language models for Persian language using a corpus and incorporating them in Persian continuous speech recognition (CSR) system are described We used Persian Text Corpus for building the language models First we preprocessed the texts of corpus by correcting the different orthography of words Also, the number of POS tags was decreased by clustering POS tags manually Then we extracted word based monogram and POS-based bigram and trigram language models from the corpus We also present the procedure of incorporating language models in a Persian CSR system By using the language models 274% reduction m word error rate was achieved in the best case.
Multiple reference frames are recommended for motion estimation and compensation to provide coding efficiency and error concealment in video codecs. However, multiple reference frames take more hardware cost than one ...
详细信息
ISBN:
(纸本)9780780393110
Multiple reference frames are recommended for motion estimation and compensation to provide coding efficiency and error concealment in video codecs. However, multiple reference frames take more hardware cost than one reference frame used in the conventional video codecs. Hence, there is a need to reduce the memory size for storing multi-frames. In this work, we look closely at a MacroBlock (MB) of the currently decoded frame to determine the data type, compressed or reconstructed one, for storing itself, according to the coding parameters of blocks in this MB. In addition, the computational Complexity and Memory-required Distribution (CMD) of currently decoded frame is explored in a Group Of Picture (GOP). According to CMD, the computational complexity is minimized by means of making a full use of the given memory size. Furthermore, when the given memory size is insufficient, the signal concealment scheme is utilized to enables MEBs being stored as near-end motion vectors without residuals. Experimental results reveal that the proposed scheme can effectively constrain the memory usage under a given memory size for storing five reference frames as full as possible. In addition, because adequately allocating the buffer size to store the currently decoded frame, a fairly good video quality can be achieved as compared to the conventional work using re-compression schemes for storing reference frames.
In this paper we introduce a new error measure, integrated reconstruction error (IRE), the minimization of which leads to principal eigenvectors (without rotational ambiguity) of the data covariance matrix. Then we pr...
详细信息
ISBN:
(纸本)9781424404681
In this paper we introduce a new error measure, integrated reconstruction error (IRE), the minimization of which leads to principal eigenvectors (without rotational ambiguity) of the data covariance matrix. Then we present iterative algorithms for the IRE minimization, through the projection approximation. The proposed algorithm is referred to as COnstrained Projection Approximation (COPA) algorithm and its limiting case is called COPAL. We also discuss regularized algorithms, referred to as R-COPA and R-COPAL. Numerical experiments demonstrate that these algorithms successfully find exact principal eigenvectors of the data covariance matrix.
This paper presents and implements an approach to parallel ACO algorithms. The principal idea is to make multiple ant colonies share and utilize only one pheromone matrix. We call it SHOP (SHaring One Pheromone matrix...
详细信息
After analyzing the disadvantages of traditional text clustering method based on keywords set, a novel approach for clustering of Chinese text based on concept hierarchy is presented. It introduces a Chinese topic cla...
详细信息
Quality and intelligibility of narrowband telephone speech can be improved by artificial bandwidth expansion (ABE), which expands the speech bandwidth using only information availab.e in the narrowband speech signal. ...
详细信息
ISBN:
(纸本)9781604234497
Quality and intelligibility of narrowband telephone speech can be improved by artificial bandwidth expansion (ABE), which expands the speech bandwidth using only information availab.e in the narrowband speech signal. This paper describes an ABE method that generates a high-band expansion using spectral folding and then modifies the magnitude spectrum of the expansion band with spline curves. The performance of the ABE algorithm was evaluated by formal listening tests in three languages: American English, Russian, and Mandarin Chinese. The results of the listening tests indicate that ABE-processed speech was preferred to narrowband speech in all tested languages.
Steganographic fmgerprints are convened with the individuality of the steganographic methods. The basic goals of this article are: 1. To evaluate sequential and randomly embedded steganographic evidence within digital...
详细信息
In opportunistic SDMA, the base station forms several random beams simultaneously and relies on fast but limited feedback for scheduling users on the beams. In this paper, opportunistic SDMA is applied to an OFDMA dow...
详细信息
Barcode has been widely applied in the modern world. This paper presents a fast and robust recognition method of noisy Code 39 barcode. The proposed method can be divided into two steps: search and decoding. In the fi...
详细信息
暂无评论