检索结果-内蒙古大学图书馆

Estimating successive cancer risks in Lynch Syndrome families using a progressive three-state model

STATISTICS IN MEDICINE 2014年第4期33卷 618-638页

作者： Choi, Yun-Hee Briollais, Laurent Green, Jane Parfrey, Patrick Kopciuk, Karen Univ Western Ontario Dept Epidemiol & Biostat London ON N6A 5C1 Canada Mt Sinai Hosp Samuel Lunenfeld Res Inst Toronto ON M5G 1X5 Canada Univ Toronto Dalla Lana Sch Publ Hlth Toronto ON M5S 1A1 Canada Mem Univ Newfoundland Discipline Genet St John NF A1B 3X9 Canada Mem Univ Newfoundland Clin Epidemiol Unit St John NF A1B 3X9 Canada Alberta Hlth Serv Canc Care Populat Hlth Res Calgary AB T2S 3C3 Canada Univ Calgary Dept Math & Stat Calgary AB T2N 1N2 Canada

Lynch Syndrome (LS) families harbor mutated mismatch repair genes,which predispose them to specific types of cancer. Because individuals within LS families can experience multiple cancers over their lifetime, we developed a progressive three-state model to estimate the disease risk from a healthy (state 0) to a first cancer (state 1) and then to a second cancer (state 2). Ascertainment correction of the likelihood was made to adjust for complex sampling designs with carrier probabilities for family members with missing genotype information estimated using their family's observed genotype and phenotype information in a one-step expectation-maximization algorithm. A sandwich variance estimator was employed to overcome possible model misspecification. The main objective of this paper is to estimate the disease risk (penetrance) for age at a second cancer after someone has experienced a first cancer that is also associated with a mutated gene. Simulation study results indicate that our approach generally provides unbiased risk estimates and low root mean squared errors across different family study designs, proportions of missing genotypes, and risk heterogeneities. An application to 12 large LS families from Newfoundland demonstrates that the risk for a second cancer was substantial and that the age at a first colorectal cancer significantly impacted the age at any LS subsequent cancer. This study provides new insights for developing more effective management of mutation carriers in LS families by providing more accurate multiple cancer risk estimates. Copyright (c) 2013 John Wiley & Sons, Ltd.

关键词： ascertainment correction expectation-maximization algorithm family study designs Lynch Syndrome missing genotypes penetrance

来源：评论

学校读者我要写书评

暂无评论

Addressing misclassification for binary data: probit and t-link regressions

引用

JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION 2014年第10期84卷 2187-2213页

作者： Naranjo, L. Martin, J. Perez, C. J. Rufo, M. J. Univ Extremadura Dept Matemat Badajoz 06006 Spain

Generalized linear models are addressed to describe the dependence of data on explanatory variables when the binary outcome is subject to misclassification. Both probit and t-link regressions for misclassified binary data under Bayesian methodology are proposed. The computational difficulties have been avoided by using data augmentation. The idea of using a data augmentation framework (with two types of latent variables) is exploited to derive efficient Gibbs sampling and expectation-maximization algorithms. Besides, this formulation has allowed to obtain the probit model as a particular case of the t-link model. Simulation examples are presented to illustrate the model performance when comparing with standard methods that do not consider misclassification. In order to show the potential of the proposed approaches, a real data problem arising when studying hearing loss caused by exposure to occupational noise is analysed.

关键词： Bayesian methods binary regression data augmentation expectation-maximization algorithm generalized linear models Markov chain Monte Carlo methods misclassification

来源：评论

学校读者我要写书评

暂无评论

Missing data in principal component analysis of questionnaire data: a comparison of methods

引用

JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION 2014年第11期84卷 2298-2315页

作者： Van Ginkel, Joost R. Kroonenberg, Pieter M. Kiers, Henk A. L. Leiden Univ Fac Social & Behav Sci NL-2300 RB Leiden Netherlands Univ Groningen Fac Behav & Social Sci NL-9712 TS Groningen Netherlands

Principal component analysis (PCA) is a widely used statistical technique for determining subscales in questionnaire data. As in any other statistical technique, missing data may both complicate its execution and interpretation. In this study, six methods for dealing with missing data in the context of PCA are reviewed and compared: listwise deletion (LD), pairwise deletion, the missing data passive approach, regularized PCA, the expectation-maximization algorithm, and multiple imputation. Simulations show that except for LD, all methods give about equally good results for realistic percentages of missing data. Therefore, the choice of a procedure can be based on the ease of application or purely the convenience of availability of a technique.

关键词： expectation-maximization algorithm least-squares fitting multiple imputation missing data missing data passive principal component analysis regularized principal component analysis

来源：评论

学校读者我要写书评

暂无评论

Identifying multiple change points in a linear mixed effects model

引用

STATISTICS IN MEDICINE 2014年第6期33卷 1015-1028页

作者： Lai, Yinglei Albert, Paul S. George Washington Univ Dept Stat Washington DC 20052 USA George Washington Univ Ctr Biostat Washington DC 20052 USA Eunice Kennedy Shriver Natl Inst Child Hlth & Hum Biostat & Bioinformat Branch Div Epidemiol Stat & Prevent Res Rockville MD 20852 USA

Although change-point analysis methods for longitudinal data have been developed, it is often of interest to detect multiple change points in longitudinal data. In this paper, we propose a linear mixed effects modeling framework for identifying multiple change points in longitudinal Gaussian data. Specifically, we develop a novel statistical and computational framework that integrates the expectation-maximization and the dynamic programming algorithms. We conduct a comprehensive simulation study to demonstrate the performance of our method. We illustrate our method with an analysis of data from a trial evaluating a behavioral intervention for the control of type I diabetes in adolescents with HbA1c as the longitudinal response variable. Copyright (c) 2013 John Wiley & Sons, Ltd.

关键词： change point longitudinal data linear mixed effects model expectation-maximization algorithm dynamic programming algorithm

来源：评论

学校读者我要写书评

暂无评论

Maximum-likelihood estimation of a log-concave density based on censored data

引用

ELECTRONIC JOURNAL OF STATISTICS 2014年第1期8卷 1405-1437页

作者： Duembgen, Lutz Rufibach, Kaspar Univ Bern Inst Math Stat & Actuarial Sci CH-3012 Bern Switzerland F Hoffmann La Roche & Cie AG Biostat Oncol CH-4070 Basel Switzerland

We consider nonparametric maximum-likelihood estimation of a log-concave density in case of interval-censored, right-censored and binned data. We allow for the possibility of a subprobability density with an additional mass at +infinity, which is estimated simultaneously. The existence of the estimator is proved under mild conditions and various theoretical aspects are given, such as certain shape and consistency properties. An EM algorithm is proposed for the approximate computation of the estimator and its performance is illustrated in two examples.

关键词： Active set algorithm binning cure parameter expectation-maximization algorithm interval-censoring qualitative constraints right-censoring

来源：评论

学校读者我要写书评

暂无评论

Spatial Matern Fields Driven by Non-Gaussian Noise

引用

SCANDINAVIAN JOURNAL OF STATISTICS 2014年第3期41卷 557-579页

作者： Bolin, David Umea Univ Dept Math & Math Stat SE-90187 Umea Sweden

The article studies non-Gaussian extensions of a recently discovered link between certain Gaussian random fields, expressed as solutions to stochastic partial differential equations (SPDEs), and Gaussian Markov random fields. The focus is on non-Gaussian random fields with Matern covariance functions, and in particular, we show how the SPDE formulation of a Laplace moving-average model can be used to obtain an efficient simulation method as well as an accurate parameter estimation technique for the model. This should be seen as a demonstration of how these techniques can be used, and generalizations to more general SPDEs are readily available.

关键词： expectation-maximization algorithm Laplace noise Markov random fields Matern covariances non-Gaussian stochastic partial differential equation

来源：评论

学校读者我要写书评

暂无评论

Morphometric variation of Seminavis pusilla (Bacillariophyceae) and its relationship to salinity in inter-dune lakes of the Badain Jaran Desert, Inner Mongolia, China

引用

PHYCOLOGICAL RESEARCH 2014年第4期62卷 282-293页

作者： Rioual, Patrick Lu, Yanbin Chu, Guoqiang Zhu, Bingqi Yang, Xiaoping Chinese Acad Sci Key Lab Cenozo Geol & Environm Inst Geol & Geophys Beijing Peoples R China

We used light and scanning electron microscope analyses to quantify morphometric features (valve length, width, stria density, lineola density and valve curvature) from the observation of valves representing Seminavis pusilla. Cluster analysis based on Gaussian mixture models and the expectation-maximization algorithm was used for delineating two species, Seminavis pusilla sensu stricto and Seminavis lata (Krammer) Rioual comb. et stat. nov. By comparison with ***, S. lata is characterized by wider valves and lower stria density. The two species have also markedly different ecology. *** is most abundant in the most saline lakes of the dataset, while S. lata is most abundant in the less saline lakes. Our results indicate that combining the two species into *** sensu lato would lead to a loss of ecological information and a decrease of the performance of transfer functions developed for quantitative reconstruction of past salinity from fossil diatom assemblages in sediment cores.

关键词： Badain Jaran Desert expectation-maximization algorithm morphometric analysis model-based clustering Navicymbula salinity Seminavis lata comb et stat nov Seminavis pusilla

来源：评论

学校读者我要写书评

暂无评论

Direct Calculation of the Variance of Maximum Penalized Likelihood Estimates via EM algorithm

引用

AMERICAN STATISTICIAN 2014年第2期68卷 93-97页

作者： Lee, Woojoo Pawitan, Yudi Inha Univ Dept Stat Inchon 402751 South Korea Karolinska Inst Dept Med Epidemiol & Biostat S-17177 Stockholm Sweden

The variance of the maximum penalized likelihood estimate obtained through the EM algorithm has not been explored in detail. We provide a simple and intuitive new representation for the variance that can be computed from the EM algorithm directly. For pedagogical purposes, we illustrate the new formula with two examples where analytical solutions are possible.

关键词： expectation-maximization algorithm Observed information Standard errors

来源：评论

学校读者我要写书评

暂无评论

Modeling Longitudinal Data by Latent Markov Models with Application to Educational and Psychological Measurement 1

引用

Joint International Meeting of the Japanese-Classification-Society and the Classification-and-Data-Analysis-Group of the Italian-Statistical-Society (JCS-CLADAG)

作者： Bartolucci, Francesco Univ Perugia Dept Econ Perugia Italy

ISBN: (数字)9783319066929

ISBN: (纸本)9783319066929;9783319066912

I review a class of models for longitudinal data, showing how it may be applied in a meaningful way for the analysis of data collected by the administration of a series of items finalized to educational or psychological measurement. In this class of models, the unobserved individual characteristics of interest are represented by a sequence of discrete latent variables, which follows aMarkov chain. Inferential problems involved in the application of these models are discussed considering, in particular, maximum likelihood estimation based on the expectation-maximization algorithm, model selection, and hypothesis testing. Most of these problems are common to hidden Markov models for time-series data. The approach is illustrated by different applications in education and psychology.

关键词： Forward and Backward recursions expectation-maximization algorithm Hidden Markov models Rasch model

来源：评论

学校读者我要写书评

暂无评论

EM-based Phoneme Confusion Matrix Generation for Low-resource Spoken Term Detection

EM-based Phoneme Confusion Matrix Generation for Low-resourc...

引用

IEEE Workshop on Spoken Language Technology (SLT 2014)

作者： Xu, Di Wang, Yun Metze, Florian Carnegie Mellon Univ Sch Comp Sci Language Technol Inst Pittsburgh PA 15213 USA

ISBN: (纸本)9781479971299

The idea of using a data-driven phoneme confusion matrix (PCM) to enhance speech recognition and retrieval performance is not new to the speech community. Although empirical results show various degrees of improvements brought by introducing a PCM, the underlying data-driven processes introduced in most papers are rather ad-hoc and lack rigorous statistical justifications. In this paper we will focus on the statistical aspects of PCM generation, propose and justify a novel expectation-maximization based algorithm for data-driven PCM generation. We will evaluate the performance of the generated PCMs under the context of low-resource spoken term detection, with primary focus on out-of-vocabulary keywords.

关键词： expectation-maximization algorithm machine learning information retrieval spoken term detection out-of-vocabulary words

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：