Phylo-grammars, probabilistic models combining Markov chain substitution models with stochastic grammars, are powerful models for annotating structured features in multiple sequence alignments and analyzing the evolut...
详细信息
Phylo-grammars, probabilistic models combining Markov chain substitution models with stochastic grammars, are powerful models for annotating structured features in multiple sequence alignments and analyzing the evolution of those features. In the past, these methods have been cumbersome to implement and modify. xrate provides means for the rapid development of phylo-grammars (using a simple file format) and automated parameterization of those grammars from training data (via the expectation Maximization algorithm). xREI (pron. 'X-ray') is an intuitive, flexible AJAX (Asynchronous Javascript And XML) web interface to xrate providing grammar visualization tools as well as access to xrate's training and annotation functionality. It is hoped that this application will serve as a valuable tool to those developing phylo-grammars, and as a means for the exploration and dissemination of such models. xREI is available at http://***/xrei/.
An iterative Bayesian reconstruction algorithm for limited view angle tomography, or ectomography, based on the three-dimensional total variation (TV) norm has been developed. The TV norm has been described ill the li...
详细信息
An iterative Bayesian reconstruction algorithm for limited view angle tomography, or ectomography, based on the three-dimensional total variation (TV) norm has been developed. The TV norm has been described ill the literature as a method for reducing noise in two-dimensional images while preserving edges, without introducing ringing or edge artefacts. It has also been proposed as a 2D regularization function in Bayesian reconstruction, implemented in an expectation maximization algorithm (TV-EM). The TV-EM was developed for 2D single photon emission computed tomography imaging, and the algorithm is capable of smoothing noise while maintaining edges without introducing artefacts. The TV norm was extended from 2D to 3D and incorporated into an ordered subsets expectation maximization algorithm for limited view angle geometry. The algorithm, called TV3D-EM, was evaluated using a modelled point spread function and digital phantoms. Reconstructed images were compared with those reconstructed with the 2D filtered backprojection algorithm currently used in ectomography. Results show a substantial reduction in artefacts related to the limited view angle geometry, and noise levels were also improved. Perhaps most important, depth resolution was improved by at least 45%. In conclusion, the proposed algorithm has been shown to improve the perceived image quality.
Reinforcement learning models generally assume that a stimulus is presented that allows a learner to unambiguously identify the state of nature, and the reward received is drawn from a distribution that depends on tha...
详细信息
Reinforcement learning models generally assume that a stimulus is presented that allows a learner to unambiguously identify the state of nature, and the reward received is drawn from a distribution that depends on that state. However, in any natural environment, the stimulus is noisy. When there is state uncertainty, it is no longer immediately obvious how to perform reinforcement learning, since the observed reward cannot be unambiguously allocated to a state of the environment. This letter addresses the problem of incorporating state uncertainty in reinforcement learning models. We show that simply ignoring the uncertainty and allocating the reward to the most likely state of the environment results in incorrect value estimates. Furthermore, using only the information that is available before observing the reward also results in incorrect estimates. We therefore introduce a new technique, posterior weighted reinforcement learning, in which the estimates of state probabilities are updated according to the observed rewards ( e. g., if a learner observes a reward usually associated with a particular state, this state becomes more likely). We show analytically that this modified algorithm can converge to correct reward estimates and confirm this with numerical experiments. The algorithm is shown to be a variant of the expectation-maximization algorithm, allowing rigorous convergence analyses to be carried out. A possible neural implementation of the algorithm in the cortico-basal-ganglia-thalamic network is presented, and experimental predictions of our model are discussed.
The usual algorithm for internal preference mapping requires a complete set of observations, meaning the technique cannot be used to analyse trials based on incomplete block designs. A simulation study was carried out...
详细信息
The usual algorithm for internal preference mapping requires a complete set of observations, meaning the technique cannot be used to analyse trials based on incomplete block designs. A simulation study was carried out to compare techniques for imputing missing values under various conditions. Sets of simulated preference data with different characteristics were constructed. Monte Carlo simulation was used to create missing observations in these sets;the imputation techniques were applied to the data;and the results of preference mapping based on the imputed data compared to those from the complete data set. Convergence problems were found with two techniques. Analysis of variance revealed that effects on performance were dominated by the proportion of data missing, the level of noise in the data, and the size of the data set. Differences in performance among the three convergent imputation techniques were small;mean substitution is recommended, as it performed as well as more complex iterative techniques. The results were broadly confirmed by a similar study on a genuine set of preference data.
Beta regression has been extensively used by statisticians and practitioners to model bounded continuous data without a strong competitor having the same main features. A class of normalised inverse-Gaussian (N-IG) pr...
详细信息
Beta regression has been extensively used by statisticians and practitioners to model bounded continuous data without a strong competitor having the same main features. A class of normalised inverse-Gaussian (N-IG) process was introduced in the literature and has been explored in the Bayesian context as a powerful alternative to the Dirichlet process. Until this moment, no attention has been paid to the univariate N-IG distribution in the classical inference. In this paper, we propose the bessel regression based on the univariate N-IG distribution, which is an alternative to the beta model. The estimation of the parameters is done through an expectation-maximisation (EM) algorithm and the paper discusses how to perform inference. A useful and practical discrimination procedure is proposed for model selection between bessel and beta regressions. A new R package called bbreg is developed for fitting both bessel and beta regression models based on the EM-algorithm and further providing graphical tools for model adequacy and model selection as well. Proper documentation for this package is available. The performances of the models are evaluated under misspecification in a simulation study. An empirical illustration is explored to confront results from bessel and beta regressions by using the new R package bbreg.
The rate of change in a continuous variable, measured serially over time, is often used as an outcome in longitudinal studies or clinical trials. When patients terminate the study before the scheduled end of the study...
详细信息
The rate of change in a continuous variable, measured serially over time, is often used as an outcome in longitudinal studies or clinical trials. When patients terminate the study before the scheduled end of the study, there is a potential for bias in estimation of rate of change using standard methods which ignore the missing data mechanism. These methods include the use of unweighted generalized estimating equations methods and likelihood-based methods assuming an ignorable missing data mechanism. We present a model for analysis of informatively censored data, based on an extension of the two-stage linear random effects model, where each subject's random intercept and slope are allowed to be associated with an underlying time to event. The joint distribution of the continuous responses and the time-to-event variable are then estimated via maximum likelihood using the EM algorithm, and using the bootstrap to calculate standard errors. We illustrate this methodology and compare it to simpler approaches and usual maximum likelihood using data from a multi-centre study of the effects of diet and blood pressure control on progression of renal disease, the Modification of Diet in Renal Disease (MDRD) Study. Sensitivity analyses and simulations are used to evaluate the performance of this methodology in the context of the MDRD data, under various scenarios where the drop-out mechanism is ignorable as well as non-ignorable. Copyright (C) 2001 John Wiley & Sons, Ltd.
Case-parent triad data are considered a robust basis for studying association between variants of a gene and a disease. Methods evaluating statistical significance of association, like the TDT-test and its extensions,...
详细信息
Case-parent triad data are considered a robust basis for studying association between variants of a gene and a disease. Methods evaluating statistical significance of association, like the TDT-test and its extensions, are frequently used. When there are prior hypotheses of a causal effect of the gene under study, however, methods measuring penetrance of alleles or haplotypes as relative risks will be more informative. Log-linear models have been proposed as a flexible tool for such relative risk estimation. We demonstrate an extension of the log-linear model to a natural framework for also estimating effects of multiple alleles or haplotypes, incorporating both single- and double-dose effects. The model also incorporates effects of single- and double-dose maternal haplotypes on a fetus during pregnancy. Unknown phase of haplotypes as well as missing parents are accounted for by the EM algorithm. A number of numerical improvements to maximum likelihood estimation are also implemented to facilitate a larger number of haplotypes. Software for these analyses, HAPLIN, is publicly available through our web site. As an illustration we have re-analyzed data on the MSX1 homeobox-gene on chromosome 4 to show how haplotypes may influence the risk of oral clefts.
A realistic numerical brain phantom, developed by Zubal et al, was used for a region-of-interest evaluation of the accuracy and noise variance of the following SPECT reconstruction methods: 1) Maximum-Likelihood recon...
详细信息
A realistic numerical brain phantom, developed by Zubal et al, was used for a region-of-interest evaluation of the accuracy and noise variance of the following SPECT reconstruction methods: 1) Maximum-Likelihood reconstruction using the expectation-Maximization (ML-EM) algorithm;2) an EM algorithm using ordered-subsets (OS-EM);3) a re-scaled block iterative EM algorithm (RBI-EM);and 4) a filtered backprojection algorithm that uses a combination of the Bellini method for attenuation compensation and an iterative spatial blurring correction method using the frequency-distance principle (FDP). The Zubal phantom was made from segmented MRI slices of the brain, so that neuro-anatomical structures are well defined and indexed. Small regions-of-interest (ROIs) from the white matter, grey matter in the center of the brain and grey matter from the peripheral area of the brain were selected for the evaluation. Photon attenuation and distance-dependent collimator blurring were modeled. Multiple independent noise realizations were generated for two different count levels. The simulation study showed that the ROI bias measured for the EM-based algorithms decreased as the iteration number increased, and that the OS-EM and RBI-EM algorithms (16 and 64 subsets were used) achieved the equivalent accuracy of the ML-EM algorithm at about the same noise variance, with much fewer number of iterations. The Bellini-FDP restoration algorithm converged fast and required less computation per iteration. The ML-EM algorithm had a slightly better ROI bias vs. variance trade-off than the other algorithms.
Interval-censored failure time data occur in many medical investigations as well as other studies such as demographical and sociological studies. They include the usual right-censored failure time data as a special ca...
详细信息
Interval-censored failure time data occur in many medical investigations as well as other studies such as demographical and sociological studies. They include the usual right-censored failure time data as a special case but provide much more complex structure and less relevant information than the right-censored data. This article reviews some basic concepts, issues and the corresponding statistical approaches related to the analysis of interval-censored data as well as recent advances. In particular, we discuss estimation of a survival function, comparison of several treatments and regression analysis as well as competing risks analysis and truncation in the presence of interval censoring. A well-known example of interval-censored data is described and analysed to illustrate some of the statistical procedures discussed..
We propose a multivariate regime switching model based on a Student-t$$ t $$ copula function with parameters controlling the strength of correlation between variables and that are governed by a latent Markov process. ...
详细信息
We propose a multivariate regime switching model based on a Student-t$$ t $$ copula function with parameters controlling the strength of correlation between variables and that are governed by a latent Markov process. To estimate model parameters by maximum likelihood, we consider a two-step procedure carried out through the expectation-maximisation algorithm. To address the main computational burden related to the estimation of the matrix of dependence parameters and the number of degrees of freedom of the Student-t$$ t $$ copula, we show a novel use of the Lagrange multipliers, which simplifies the estimation process. The simulation study shows that the estimators have good finite sample properties and the estimation procedure is computationally efficient. An application concerning log-returns of five cryptocurrencies shows that the model permits identifying bull and bear market periods based on the intensity of the correlations between crypto assets.
暂无评论