检索结果-内蒙古大学图书馆

Estimation of a residual distribution with small numbers of repeated measurements

CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE 2002年第3期30卷 383-400页

作者： Susko, E Nadon, R Dalhousie Univ Dept Math & Stat Halifax NS B3H 3J5 Canada

The authors consider the estimation of a residual distribution for different measurement problems with a common measurement error process. The problem is motivated by issues arising in the analysis of gene expression data but should have application in other similar settings. It is implicitly assumed throughout that there are large numbers of measurements but small numbers of repeated measurements. As a consequence, the distribution of the estimated residuals is a biased estimate of the residual distribution. The authors present two methods for the estimation of the residual distribution with some restriction on the form of the distribution. They give an upper bound for the rate of convergence for an estimator based on the characteristic function and compare its performance with that of another estimator with simulations.

关键词： characteristic function em algorithm Fourier transformation marginal likelihood measurement error microarray experiment normal mixture model residual distribution

来源：评论

学校读者我要写书评

暂无评论

Parameter estimation for hidden Markov chains

引用

JOURNAL OF STATISTICAL PLANNING AND INFERENCE 2002年第1-2期108卷 365-390页

作者： Archer, GEB Titterington, DM GlaxoSmithKline Pharmaceut Harlow Essex England Univ Glasgow Dept Stat Glasgow G12 8QQ Lanark Scotland

The problem of estimating parameters within hidden Markov models is not straightforward. In particular, calculation of maximum likelihood estimates (MLE) is nontrivial. Some variations on MLE are described that are computationally less burdensome, and detailed comparisons are drawn for the case of hidden binary isotropic Markov chains. (C) 2002 Elsevier Science B.V. All rights reserved.

关键词： em algorithm hidden Markov model incomplete data maxinnim likelihood maximum pseudo-likelihood mean-field approximations method of moments

来源：评论

学校读者我要写书评

暂无评论

Double-semi parametric method for missing covariates in cox regression models

引用

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION 2002年第458期97卷 565-576页

作者： Chen, HY Univ Illinois Sch Publ Hlth Div Epidemiol & Biostat Chicago IL 60612 USA

The problem of nuisance covariate model specification is considered in Cox regression where the maximum semiparametric likelihood method is used to handle the missing covariates. A component of the covariates is modeled nonparametric ally to achieve robustness against covariate model misspecification and to reduce the number of possibly intractable integrations involved in the parametric modeling of the covariates. The statistical properties of the proposed method are examined. It is found that in some important situations, the maximum semiparametric likelihood can be applied without making any additional parametric model assumptions on covariates. The proposed method can yield a more efficient estimator than the nonparametric imputation methods and does not require specification of the missingness mechanism when compared with the inverse probability weighting method. A real data example is analyzed to demonstrate use of the proposed method.

关键词： censoring em algorithm missing-data mechanism nuisance model semiparametric likelihood

来源：评论

学校读者我要写书评

暂无评论

Analysis of multivariate longitudinal outcomes with nonignorable dropouts and missing covariates: Changes in methadone treatment practices

引用

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION 2002年第457期97卷 40-52页

作者： Roy, J Lin, XH Brown Univ Ctr Stat Sci Providence RI 02912 USA Univ Michigan Dept Biostat Ann Arbor MI 48109 USA

This article analyzes changes in treatment practices in outpatient methadone treatment units from a national panel study. The analysis of this dataset is challenging due to several difficulties, including multiple longitudinal outcomes, nonignorable nonresponses, and missing covariates. Specifically, the data included several variables that measure the effectiveness of methadone treatment practices for each unit. A substantial percentage of units (33%) did not respond during the follow-up. These dropout units tended to be units with less effective treatment practices: the dropout mechanism thus may be nonignorable. Finally, the time-varying covariates for the units that dropped out were missing at the time of dropout. A valid analysis hence needs to address these three issues simultaneously. Our approach assumes that the observed outcomes measure a latent variable (e.g., treatment practice effectiveness) with error. We model the relationship between this latent variable and covariates using a linear mixed model. To account for nonignorable dropouts, we apply a selection model in which die dropout probability depends on the latent variable. Finally, we accommodate missing time-varying covariates by modeling them using a transition model. In view of multidimensional integration in full-likelihood estimation, we develop the em algorithm to estimate the model parameters, We apply the proposed approach to the methadone treatment practices data. Our results show that methadone treatment practices have improved in the last decade. Our results are also useful for identifying the types of methadone treatment units that need improvement.

关键词： em algorithm latent variable model missing covariate multiple outcomes nonignorable dropout panel data random-effects model

来源：评论

学校读者我要写书评

暂无评论

Regression models for allele sharing: analysis of accumulating data in affected sib pair studies

引用

STATISTICS IN MEDICINE 2002年第3期21卷 431-444页

作者： Bull, SB Greenwood, CMT Mirea, L Morgan, K Mt Sinai Hosp Samuel Lunenfeld Res Inst Toronto ON M5G 1X5 Canada Univ Toronto Dept Publ Hlth Sci Toronto ON M5G 1X5 Canada McGill Univ Dept Epidemiol & Biostat Montreal PQ Canada McGill Univ Ctr Hlth Res Inst Montreal Genome Ctr Montreal PQ Canada McGill Univ Dept Human Genet Montreal PQ Canada McGill Univ Dept Med Montreal PQ Canada

Advances in human genome mapping have led to the identification of large numbers of genetic markers that allow systematic searches for multiple disease susceptibility genes for complex traits. A common design involves the recruitment of families with at least two children affected with the disease of interest. The objective is to find chromosomal regions that harbour susceptibility genes for the disease. The affected children, their parents if available, and sometimes other, unaffected, siblings are genotyped using sets of microsatellite DNA markers representing chromosomal sites distributed across the genome. Each marker can occur in several different variants known as alleles, and a pair of alleles constitutes the marker genotype. Each child randomly inherits one of their mother's two alleles and one of their father's two alleles. If a marker is close to a disease susceptibility gene, then affected siblings are expected to have more sharing of the same maternal and/or paternal marker alleles. Statistical methods are used to estimate the distribution of allele sharing in each affected sib pair (ASP) using the set of markers typed across each chromosome, and to test for the presence of excess sharing in the families as a group at each point across the genome. Regression models that allow the allele sharing proportions to depend on characteristics of the family such as diagnostic subtype or ethnic background have been developed to address the heterogeneity that is characteristic of complex disease, but these have not yet been widely applied. In this paper, we apply regression modelling to investigate variation associated with family-level covariates and with the order in which families are recruited and genotyped. We also discuss how some of the concepts of group sequential analysis apply to accumulating data from genome scans of complex disease. Copyright (C) 2002 John Wiley Sons, Ltd.

关键词： complex disease em algorithm family data genetic linkage hidden Markov model identical by descent

来源：评论

学校读者我要写书评

暂无评论

Asymptotic approximations to posterior distributions via conditional moment equations

引用

BIOMETRIKA 2002年第4期89卷 755-767页

作者： Yee, JL Johnson, WO Samaniego, FJ US Geol Survey Western Ecol Res Ctr Sacramento CA 95826 USA Univ Calif Davis Dept Stat Davis CA 95616 USA

We consider asymptotic approximations to joint posterior distributions in situations where the full conditional distributions referred to in Gibbs sampling are asymptotically normal. Our development focuses on problems where data augmentation facilitates simpler calculations, but results hold more generally. Asymptotic mean vectors are obtained as simultaneous solutions to fixed point equations that arise naturally in the development. Asymptotic covariance matrices flow naturally from the work of Arnold & Press (1989) and involve the conditional asymptotic covariance matrices and first derivative matrices for conditional mean functions. When the fixed point equations admit an analytical solution, explicit formulae are subsequently obtained for the covariance structure of the joint limiting distribution, which may shed light on the use of the given statistical model. Two illustrations are given.

关键词： Bayesian approach data augmentation em algorithm fixed point theorem Gibbs sampling latent data screening data

来源：评论

学校读者我要写书评

暂无评论

Bayesian estimation of dynamical systems: An application to fMRI

引用

NEUROIMAGE 2002年第2期16卷 513-530页

作者： Friston, KJ Inst Neurol Wellcome Dept Cognit Neurol London WC1N 3BG England

This paper presents a method for estimating the conditional or posterior distribution of the parameters of deterministic dynamical systems. The procedure conforms to an em implementation of a Gauss-Newton search for the maximum of the conditional or posterior density. The inclusion of priors in the estimation procedure ensures robust and rapid convergence and the resulting conditional densities enable Bayesian inference about the model parameters. The method is demonstrated using an input-state-output model of the hemodynamic coupling between experimentally designed causes or factors in fMRI studies and the ensuing BOLD response. This example represents a generalization of current fMRI analysis models that accommodates nonlinearities and in which the parameters have an explicit physical interpretation. Second, the approach extends classical inference, based on the likelihood of the data given a null hypothesis about the parameters, to more plausible inferences about the parameters of the model given the data. This inference provides for confidence intervals based on the conditional density. (C) 2002 Elsevier Science (USA).

关键词： fMRI Bayesian inference nonlinear dynamics model identification hemodynamics Volterra series em algorithm Gauss-Newton method

来源：评论

学校读者我要写书评

暂无评论

Likelihood analysis and flexible structural modeling for measurement error model regression

引用

JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION 2002年第1期72卷 33-45页

作者： Schafer, DW Oregon State Univ Dept Stat Corvallis OR 97331 USA

A computational approach is presented for likelihood analysis of regression models with measurement errors in explanatory variables. If y, x, and w represent the response, an unobservable true value of an explanatory variable, and an observable measurement of x, then the likelihood function is based on the density of the observable variables: f(y, w) = integralf(y, w\x)f(x)dx. For realistic model specifications the integral must be approximated numerically. While one could conceivably use a general-purpose optimization routine for finding estimates that maximize the approximate likelihood, that tends not to work very well. The approximate density, however, has the form of a finite mixture model so that the standard em algorithm for that problem can be applied, The resulting approach is practically important since it easily permits realistic distributional modeling and can be accomplished through iterative application of readily available routines.

关键词： em algorithm errors-in-variables generalized linear models linear regression mixture model nonlinear regression

来源：评论

学校读者我要写书评

暂无评论

A type of restricted maximum likelihood estimator of variance components in generalised linear mixed models

引用

BIOMETRIKA 2002年第2期89卷 401-409页

作者： Liao, JG Lipsitz, SR Univ Med & Dent New Jersey Div Biometr New Brunswick NJ 08903 USA Med Univ S Carolina Dept Biometry & Epidemiol Charleston SC 29425 USA

The maximum likelihood estimator of the variance components in a linear model can be biased downwards. Restricted maximum likelihood (RemL) corrects this problem by using the likelihood of a set of residual contrasts and is generally considered superior. However, this original restricted maximum likelihood definition does not directly extend beyond linear models. We propose a RemL-type estimator for generalised linear mixed models by correcting the bias in the profile score function of the variance components. The proposed estimator has the same consistency properties as the maximum likelihood estimator if the number of parameters in the mean and variance components models remains fixed. However, the estimator of the variance components has a smaller finite sample bias. A simulation study with a logistic mixed model shows that the proposed estimator is effective in correcting the downward bias in the maximum likelihood estimator.

关键词： bias correction em algorithm logistic mixed model

来源：评论

学校读者我要写书评

暂无评论

Testing for multivariate outliers in the presence of missing data

引用

PURE AND APPLIED GEOPHYSICS 2002年第4期159卷 889-903页

作者： Woodward, WA Sain, SR Grah, HL Zhao, BJ Fisk, MD So Methodist Univ Dallas TX 75275 USA Mission Res Corp Santa Barbara CA 93101 USA

We consider the problem of multivariate outlier testing for purposes of distinguishing seismic signals of underground nuclear events from training samples based on non-nuclear seismic events when certain data are missing. We consider the case in which the training data follow a multivariate normal distribution. Assume a potential outlier is observed on which k features of interest are measured. Assume further that the available training set of n observations on these k features is available but that some of the observations in the training data have missing features. The approach currently used in practice is to perform the outlier testing using a generalized likelihood ratio test procedure based only on the data vectors in the training data with complete data. When there is a substantial amount of missing data within the training set, use of this strategy may lead to a loss of valuable information. An alternative procedure is to incorporate all n of the data vectors in the training data using the em algorithm to appropriately handle the missing data in the training set. Resampling methods are used to find appropriate critical regions. We use simulation results and analysis of models fit to Pg/Lg ratios for the WMQ station in China to compare these two strategies for dealing with missing data.

关键词： outlier testing nuclear monitoring em algorithm multivariate normal missing data

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：