检索结果-内蒙古大学图书馆

Seeking efficient data augmentation schemes via conditional and marginal augmentation

BIOMETRIKA 1999年第2期86卷 301-320页

作者： Meng, XL Van Dyk, DA Univ Chicago Dept Stat Chicago IL 60637 USA Harvard Univ Dept Stat Cambridge MA 02138 USA

Data augmentation, sometimes known as the method of auxiliary variables, is a powerful tool for constructing optimisation and simulation algorithms. In the context of optimisation, Meng & van Dyk (1997, 1998) reported several successes of the 'working parameter' approach for constructing efficient data-augmentation schemes for fast and simple em-type algorithms. This paper investigates the use of working parameters in the context of Markov chain Monte Carlo, in particular in the context of Tanner & Wong's (1987) data augmentation algorithm, via a theoretical study of two working-parameter approaches, the conditional augmentation approach and the marginal augmentation approach. Posterior sampling under the univariate t model is used as a running example, which particularly illustrates how the marginal augmentation approach obtains a fast-mixing positive recurrent Markov chain by first constructing a nonpositive recurrent Markov chain in a larger space.

关键词： auxiliary variable em algorithm incomplete data Markov chain Monte Carlo PXem algorithm rate of convergence working parameter

来源：评论

学校读者我要写书评

暂无评论

Penalized likelihood smoothing in robust state space models

引用

METRIKA 1999年第3期49卷 173-191页

作者： Fahrmeir, L Künstler, R Univ Munich Inst Stat D-80539 Munich Germany

In likelihood-based approaches to robustify state space models, Gaussian error distributions are replaced by non-normal alternatives with heavier tails. Robustified observation models are appropriate for time series with additive outliers, while state or transition equations with heavy-tailed error distributions lead to filters and smoothers that can cope with structural changes in trend or slope caused by innovations outliers. As a consequence, however, conditional filtering and smoothing densities become analytically intractable. Various attempts have been made to deal with this problem, reaching from approximate conditional mean type estimation to fully Bayesian analysis using MCMC simulation. In this article we consider penalized likelihood smoothers, this means estimators which maximize penalized likelihoods of, equivalently, posterior densities. Filtering and smoothing for additive and innovations outlier models can be carried out by computationally efficient Fisher scoring steps or iterative Kalman-type filters. Special emphasis is on the Student family, for which em-type algorithms to estimate unknown hyperparameters are developed. Operational behaviour is illustrated by simulation experiments and by real data applications.

关键词： additive outliers em algorithm innovations outliers iterative Kalman Filtering non-Gaussian state space models

来源：评论

学校读者我要写书评

暂无评论

Recovering tree heights from airborne laser scanner data

引用

FOREST SCIENCE 1999年第3期45卷 407-422页

作者： Magnussen, S Eggermont, P LaRiccia, VN Canadian Forest Serv Victoria BC V8Z 1M5 Canada Univ Delaware Dept Math Sci Newark DE 19716 USA

Airborne laser scanner data collected over forests provide a canopy height, To obtain tree heights from airborne laser scanner data one needs a recovery model. Two such models, one (A) assuming that observations are sampled with probability proportional to displayed crown area, and the other (B) derived from the probability that a laser beam penetrates to a given canopy depth, were developed and applied to laser scanner data obtained over stands of Douglas-fir. Model estimates of recovered arithmetic mean tree heights and quantiles (75%, 85%, and 95%) were not significantly (P > 0.24) different from ground-based equivalents. An overall mean bias of -3 m in the laser canopy heights was eliminated by both methods, The median absolute difference between observed and predicted plot means and quantiles we re reduced by 40 to 60%. Three alternative recovery procedures are presented for model B. For a single plot, the predictions varied significantly among the models and estimation procedures with no consistent pattern, Predictions of arithmetic mean heights were best for plots with no understory, while predictions of upper quantiles were consistent in all plots.

关键词： canopy height crown area PPS sampling deconvolution error function Weibull distribution extreme value distribution em algorithm

来源：评论

学校读者我要写书评

暂无评论

A note on inconsistency of NPMLE of the distribution function from left truncated and case I interval censored data

引用

LIFETIME DATA ANALYSIS 1999年第3期5卷 281-291页

作者： Pan, W Chappell, R Univ Minnesota Div Biostat Minneapolis MN 55455 USA Univ Wisconsin Dept Stat Madison WI 53706 USA Univ Wisconsin Dept Biostat Madison WI 53706 USA

We show that under reasonable conditions the nonparametric maximum likelihood estimate (NPMLE) of the distribution function from left-truncated and case 1 interval-censored data is inconsistent, in contrast to the consistency properties of the NPMLE from only left-truncated data or only interval-censored data. However, the conditional NPMLE is shown to be consistent. Numerical examples are provided to illustrate their finite sample properties.

关键词： cumulative hazard function em algorithm survival analysis vague convergence

来源：评论

学校读者我要写书评

暂无评论

Modeling of high cost patient distribution within renal failure diagnosis related group

引用

JOURNAL OF CLINICAL EPIDemIOLOGY 1999年第3期52卷 251-258页

作者： Quantin, C Sauleau, E Bolard, P Mousson, C Kerkri, M Lecomte, PB Moreau, T Dusserre, L Teaching Publ Hosp Dept Biostat Dijon France Teaching Publ Hosp Dept Nephrol Dijon France Publ Hosp Dept Biostat Mulhouse France Natl Inst Med Res INSERM Dept Epidemiol & Biostat Villejuif France

Modeling by mixed-distribution was proposed in order to analyze heterogeneity of costs and length of stays within Diagnosis Related Groups (DRGs). A mixed-distribution model based on Weibull distributions was applied to 791 discharge abstracts of French DRG no. 450 (Health Care Financing Administration 3 DRG no. 316 "Renal failure") from a national database. Three subgroups of cost and length of stay were identified. Except for age, clinical criteria significantly linked with the long-stay subgroup were the same as those associated with the high-cost subgroup: acute renal failure, intensive care, infectious complications, and vascular investigations. The identification of factors associated with high costs, based on the proposed model, will allow physicians to understand more accurately how their choice of specific procedures influences hospital costs. J CLIN EPIDemIOL 52;3:251-258, 1999. (C) 1999 Elsevier Science Inc.

关键词： DRGs mixed-distribution model em algorithm cost renal failure

来源：评论

学校读者我要写书评

暂无评论

Estimation with missing data

引用

MATHemATICAL AND COMPUTER MODELLING OF DYNAMICAL SYSTemS 1999年第3期5卷 220-244页

作者： Goodwin, GC Feuer, A Univ Newcastle Ctr Integrated Dynam & Control Dept Elect & Comp Engn Newcastle NSW 2308 Australia Technion Israel Inst Technol Dept Elect Engn IL-32000 Haifa Israel

This paper reviews estimation problems with missing, or hidden data. We formulate this problem in the context of Markov models and consider two interrelated issues, namely, the estimation of a state given measured data and model parameters, and the estimation of model parameters given the measured data alone. We also consider situations where the measured data is, itself, incomplete in some sense. We deal with various combinations of discrete and continuous states and observations.

关键词： data smoothing em algorithm estimation filtering (hidden) Markov models missing data

来源：评论

学校读者我要写书评

暂无评论

Probabilistic principal component analysis

引用

JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY 1999年第3期61卷 611-622页

作者： Tipping, ME Bishop, CM Microsoft Res Cambridge CB2 3NH England

Principal component analysis (PCA) is a ubiquitous technique for data analysis and processing, but one which is not based on a probability model. We demonstrate how the principal axes of a set of observed data vectors may be determined through maximum likelihood estimation of parameters in a latent variable model that is closely related to factor analysis. We consider the properties of the associated likelihood function, giving an em algorithm for estimating the principal subspace iteratively, and discuss, with illustrative examples, the advantages conveyed by this probabilistic approach to PCA.

关键词： density estimation em algorithm Gaussian mixtures maximum likelihood principal component analysis probability model

来源：评论

学校读者我要写书评

暂无评论

Constrained mixture models in competing risks problems

引用

ENVIRONMETRICS 1999年第6期10卷 753-767页

作者： Ng, SK McLachlan, GJ McGiffin, DC O'Brien, MF Univ Queensland Dept Math St Lucia Qld 4072 Australia Univ Alabama Dept Surg Birmingham AL 35294 USA Prince Charles Hosp Dept Cardiac Surg Brisbane Qld 4032 Australia

We consider the problem of modelling the failure-time distribution, where failure is due to two distinct causes. One approach is to adopt a two-component mixture model where the components correspond to the two different causes of failure. However, routine application of this approach with typical parametric forms for the component densities proves to be inadequate in modelling the time to a re-replacement operation or death after the initial replacement of the aortic valve in the heart by a prosthesis, such as a xenograft valve. Hence we consider modifications to the usual mixture model approach to handle situations where there exists a strong dependency between the failure times of the distinct causes. With these modifications, a suitable model is able to be provided for the distribution of the time to a re-replacement operation conditional on the age of the patient at the time of the initial replacement operation. The estimate so obtained by the probability that a patient of a given age will undergo a re-replacement operation provides a useful guide to heart surgeons on the type of valve to be used in view of the patient's age. Copyright (C) 1999 John Wiley & Sons, Ltd.

关键词： competing risks constrained mixture models em algorithm proportional hazards model

来源：评论

学校读者我要写书评

暂无评论

A likelihood-based method of identifying contaminated lots of blood product

引用

INTERNATIONAL JOURNAL OF EPIDemIOLOGY 1999年第4期28卷 787-792页

作者： Reilly, M Lawlor, E Natl Univ Ireland Univ Coll Dublin Dept Stat Dublin 4 Ireland Univ Dublin Trinity Coll Dublin 2 Ireland

Background In 1994 a small cluster of hepatitis-C cases in Rhesus-negative women in Ireland prompted a nationwide screening programme for hepatitis-C antibodies in all anti-D recipients. A total of 55 386 women presented for screening and a history of exposure to anti-D was sought from all those testing positive and a sample of those testing negative. The resulting data comprised 620 antibody-positive and 1708 antibody-negative women with known exposure history, and interest was focused on using these data to estimate the infectivity of anti-D in the period 1970-1993. Methods Any exposure to anti-D provides an opportunity for infection, but the infection status at each exposure dme is not observed. Instead, the available data from antibody testing only indicate whether at least one of the exposures resulted in infection. Using a simple Bernoulli model to describe the risk of infection in each year, the absence of information regarding which exposure(s) led to infection fits neatly into the framework of 'incomplete data'. Hence the expectation-maximization (em) algorithm provides estimates of the infectiousness of anti-D in each of the 24 years studied. Results The analysis highlighted the 1977 anti-D as a source of infection, a fact which was confirmed by laboratory investigation. Other suspect batches were also identified, helping to direct the efforts of laboratory investigators. Conclusions We have presented a method to estimate the risk of infection at each exposure time from multiple exposure data. The method can also be used to estimate transmission rates and the risk associated with different sources of infection in a range of infectious disease applications.

关键词： hepatitis C blood-borne diseases intravenous immunoglobulin infection rate incomplete data em algorithm

来源：评论

学校读者我要写书评

暂无评论

Nonlinear compensation for stochastic matching

引用

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING 1999年第6期7卷 643-655页

作者： Surendran, AC Lee, CH Rahim, M AT&T Labs Res Florham Pk NJ 07932 USA

The performance of an automatic speech recognizer degrades when there exists an acoustic mismatch between the training and the testing conditions in the data. Though it is certain that the mismatch is nonlinear, its exact form is unknown. Tackling the problem of nonlinear mismatches is a difficult task that has not been adequately addressed before. In this paper, we develop an approach that uses nonlinear transformations in the stochastic matching framework to compensate for acoustic mismatches, The functional form of the nonlinear transformation is modeled by neural networks. We develop a new technique to train neural networks using the generalized em algorithm. This technique eliminates the need for stereo databases, which are difficult to obtain in practical applications. The new technique is data-driven and hence can be used under a wide variety of conditions without a priori knowledge of the environment, Using this technique, we show that we can provide improvement under various types of acoustic mismatch;in some cases a 72% reduction in word error rate is achieved.

关键词： em algorithm neural networks nonlinear compensation robust speech recognition stochastic matching

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：