This paper considers the problem of lossy source coding with side information at the decoder only, for Gaussian sources, when the joint statistics of the sources are partly unknown. We propose a practical universal co...
详细信息
ISBN:
(纸本)9781479903566
This paper considers the problem of lossy source coding with side information at the decoder only, for Gaussian sources, when the joint statistics of the sources are partly unknown. We propose a practical universal coding scheme based on scalar quantization and non-binary LDPC codes, which avoids the binarization of the quantized coefficients. We first explain how to choose the rate and to construct the EDPC coding matrix. Then, a decoding algorithm that jointly estimates the source sequence and the joint statistics of the sources is proposed. The proposed coding scheme suffers no loss compared to the practical coding scheme with same rate hut known variance.
A computationally efficient method of three-dimensional (3-D) localization of underwater acoustic sources using an acoustic vector sensor (AVS) array is presented in this paper. Noise is modeled as a Gaussian mixture ...
详细信息
ISBN:
(纸本)9789380095455
A computationally efficient method of three-dimensional (3-D) localization of underwater acoustic sources using an acoustic vector sensor (AVS) array is presented in this paper. Noise is modeled as a Gaussian mixture (GM), and an expectationmaximization (EM) algorithm is used to estimate all the unknown parameters. The proposed method consists of an initialization stage followed by iterative updates which converge in a small number of iterations. Performance of the proposed method is better than that of existing methods, and performance of the AVS array is much better than that of the acoustic pressure sensor (APS) array.
Background: Many problems in computational biology require alignment-free sequence comparisons. One of the common tasks involving sequence comparison is sequence clustering. Here we apply methods of alignment-free com...
详细信息
Background: Many problems in computational biology require alignment-free sequence comparisons. One of the common tasks involving sequence comparison is sequence clustering. Here we apply methods of alignment-free comparison (in particular, comparison using sequence composition) to the challenge of sequence clustering. Results: We study several centroid based algorithms for clustering sequences based on word counts. Study of their performance shows that using k-means algorithm with or without the data whitening is efficient from the computational point of view. A higher clustering accuracy can be achieved using the soft expectationmaximization method, whereby each sequence is attributed to each cluster with a specific probability. We implement an open source tool for alignment-free clustering. It is publicly available from github: https://***/luscinius/afcluster. Conclusions: We show the utility of alignment-free sequence clustering for high throughput sequencing analysis despite its limitations. In particular, it allows one to perform assembly with reduced resources and a minimal loss of quality. The major factor affecting performance of alignment-free read clustering is the length of the read.
Background: Highly mutable RNA viruses exist in infected hosts as heterogeneous populations of genetically close variants known as quasispecies. Next-generation sequencing (NGS) allows for analysing a large number of ...
详细信息
Background: Highly mutable RNA viruses exist in infected hosts as heterogeneous populations of genetically close variants known as quasispecies. Next-generation sequencing (NGS) allows for analysing a large number of viral sequences from infected patients, presenting a novel opportunity for studying the structure of a viral population and understanding virus evolution, drug resistance and immune escape. Accurate reconstruction of genetic composition of intra-host viral populations involves assembling the NGS short reads into whole-genome sequences and estimating frequencies of individual viral variants. Although a few approaches were developed for this task, accurate reconstruction of quasispecies populations remains greatly unresolved. Results: Two new methods, AmpMCF and ShotMCF, for reconstruction of the whole-genome intra-host viral variants and estimation of their frequencies were developed, based on Multicommodity Flows (MCFs). AmpMCF was designed for NGS reads obtained from individual PCR amplicons and ShotMCF for NGS shotgun reads. While AmpMCF, based on covering formulation, identifies a minimal set of quasispecies explaining all observed reads, ShotMCS, based on packing formulation, engages the maximal number of reads to generate the most probable set of quasispecies. Both methods were evaluated on simulated data in comparison to Maximum Bandwidth and ViSpA, previously developed state-of-the-art algorithms for estimating quasispecies spectra from the NGS amplicon and shotgun reads, respectively. Both algorithms were accurate in estimation of quasispecies frequencies, especially from large datasets. Conclusions: The problem of viral population reconstruction from amplicon or shotgun NGS reads was solved using the MCF formulation. The two methods, ShotMCF and AmpMCF, developed here afford accurate reconstruction of the structure of intra-host viral population from NGS reads. The implementations of the algorithms are available at http://***/vira.h
This paper is concerned with identification of nonlinear systems with multiple and correlated scheduling variables. Multiple auto regressive exogenous (ARX) models are identified on different process operating conditi...
详细信息
This paper is concerned with identification of nonlinear systems with multiple and correlated scheduling variables. Multiple auto regressive exogenous (ARX) models are identified on different process operating conditions, and a normalized exponential function as the probability density function associated with each of the local ARX models taking effect is then used to combine all the local models to represent the complete dynamics of a nonlinear system. The parameters of the local ARX models and the exponential functions are estimated simultaneously under the framework of the expectationmaximization (EM) algorithm. A numerical example is applied to demonstrate the proposed identification method.
This paper presents an unsupervised satellite color image segmentation approach based on Bivariate Beta type-II. Such a method could be considered as original since it uses a K-Means clustering algorithm in order to i...
详细信息
ISBN:
(纸本)9781479900626
This paper presents an unsupervised satellite color image segmentation approach based on Bivariate Beta type-II. Such a method could be considered as original since it uses a K-Means clustering algorithm in order to initialize the image class number. Moreover, it exploits a Bivariate Beta type-II for statistical distributions applying it for each class. Satellite image exploitation requires the use of different approaches, especially those based on the unsupervised statistical segmentation principle. Such approaches necessitate the definition of several parameters such as image class number, class variables estimation and mixture distributions. The use of statistical image attributes has allowed us to get convincing results, provided that we ensure under the condition of having an initialization step with appropriate statistical distributions. Bivariate Beta type-II associated with a K-means clustering algorithm and expectation-maximization (EM) algorithm could be adapted to such a problem. For each image class, Bivariate Beta type-II attributes a specific distribution type according to different parameters. Different adapted algorithms (namely K-Means clustering algorithm, EM algorithm and Bivariate Beta type-II algorithm) are then applied to the satellite image segmentation problem. The efficiency of those combined algorithms is validated with the Mean Squared Errors (MSE), Signal to Noise Ratio (SNR) and Maximum Distance (MD).
This paper considers the problem of lossy source coding with side information at the decoder only, for Gaussian sources, when the joint statistics of the sources are partly unknown. We propose a practical universal co...
详细信息
ISBN:
(纸本)9781479903573
This paper considers the problem of lossy source coding with side information at the decoder only, for Gaussian sources, when the joint statistics of the sources are partly unknown. We propose a practical universal coding scheme based on scalar quantization and non-binary LDPC codes, which avoids the binarization of the quantized coefficients. We first explain how to choose the rate and to construct the LDPC coding matrix. Then, a decoding algorithm that jointly estimates the source sequence and the joint statistics of the sources is proposed. The proposed coding scheme suffers no loss compared to the practical coding scheme with same rate but known variance.
Term suggestion is a kind of information retrieval technique that attempts to suggest relevant terms to help users formulate more effective queries and reduce unnecessary search steps. In this paper, we apply two sema...
详细信息
Term suggestion is a kind of information retrieval technique that attempts to suggest relevant terms to help users formulate more effective queries and reduce unnecessary search steps. In this paper, we apply two semantic analysis methods, the probabilistic analysis model and semantic analysis graph, to design a term suggestion system that can effectively deal with the problems of synonymy and polysemy. The main contributions of this paper are the following. First, we apply two semantic analysis methods to design a high-performance term suggestion system. Second, we design an intelligent mechanism that can effectively balance cost and performance to minimize the number of iterations required for our system. (C) 2012 Elsevier B.V. All rights reserved.
This paper proposes a novel graph-based transductive learning algorithm based on manifold regularization. First, the manifold regularization was introduced to probabilistic discriminant model for semi-supervised class...
详细信息
This paper proposes a novel graph-based transductive learning algorithm based on manifold regularization. First, the manifold regularization was introduced to probabilistic discriminant model for semi-supervised classification task. And then a variation of the expectationmaximization (EM) algorithm was derived to solve the optimization problem, which leads to an iterative algorithm. Although our method is developed in probabilistic framework, there is no need to make assumption about the specific form of data distribution. Besides, the crucial updating formula has closed form. This method was evaluated for text categorization on two standard datasets, 20 news group and Reuters-21578. Experiments show that our approach outperforms the state-of-the-art graph-based transductive learning methods.
Reliable detection of primary user activity increases the opportunity to access temporarily unused bands and prevents harmful interference to the primary system. By extracting a global decision from local sensing resu...
详细信息
Reliable detection of primary user activity increases the opportunity to access temporarily unused bands and prevents harmful interference to the primary system. By extracting a global decision from local sensing results, cooperative sensing achieves high reliability against multipath fading. For the effective combining of sensing results, which is generalized by a likelihood ratio test, the fusion center should learn some parameters, such as the probabilities of primary transmission, false alarm, and detection at the local sensors. During the training period in supervised learning, the on/off log of primary transmission serves as the output label of decision statistics from the local sensor. In this paper, we extend unsupervised learning techniques with an expectation maximization algorithm for cooperative spectrum sensing, which does not require an external primary transmission log. Local sensors report binary hard decisions to the fusion center and adjust their operating points to enhance learning performance. Increasing the number of sensors, the joint-expectation step makes a confident classification on the primary transmission as in the supervised learning. Thereby, the proposed scheme provides accurate parameter estimates and a fast convergence rate even in low signal-to-noise ratio regimes, where the primary signal is dominated by the noise at the local sensors.
暂无评论