检索结果-内蒙古大学图书馆

Flexible regression modeling for censored data based on mixtures of student-t distributions

COMPUTATIONAL STATISTICS 2019年第1期34卷 123-152页

作者： Lachos, Victor H. Cabral, Celso R. B. Prates, Marcos O. Dey, Dipak K. Univ Connecticut Dept Stat Storrs CT 06269 USA Univ Fed Amazonas Dept Estat Ave Gen Rodrigo Octavio 6200Coroado 1 BR-69080900 Manaus Amazonas Brazil Univ Fed Minas Gerais Dept Estat Belo Horizonte MG Brazil

In some applications of censored regression models, the distribution of the error terms departs significantly from normality, for instance, in the presence of heavy tails, skewness and/or atypical observation. In this paper we extend the censored linear regression model with normal errors to the case where the random errors follow a finite mixture of Student-t distributions. This approach allows us to model data with great flexibility, accommodating multimodality, heavy tails and also skewness depending on the structure of the mixture components. We develop an analytically tractable and efficient em-type algorithm for iteratively computing maximum likelihood estimates of the parameters, with standard errors as a by-product. The algorithm has closed-form expressions at the E-step, that rely on formulas for the mean and variance of the truncated Student-t distributions. The efficacy of the method is verified through the analysis of simulated and real datasets. The proposed algorithm and methods are implemented in the new R package CensMixReg.

关键词： Censored regression model em-type algorithms Finite mixture models Heavy-tails Tobit model

来源：评论

学校读者我要写书评

暂无评论

Finite mixture modeling of censored data using the multivariate Student-t distribution

引用

JOURNAL OF MULTIVARIATE ANALYSIS 2017年 159卷 151-167页

作者： Lachos, Victor H. Lopez Moreno, Edgar J. Chen, Kun Barbosa Cabral, Celso Romulo Univ Estadual Campinas Dept Estat Campinas SP Brazil Univ Connecticut Dept Stat Mansfield CT USA Univ Fed Amazonas Dept Estat Manaus Amazonas Brazil

Finite mixture models have been widely used for the modeling and analysis of data from a heterogeneous population. Moreover, data of this kind can be subject to some upper and/or lower detection limits because of the restriction of experimental apparatus. Another complication arises when measures of each population depart significantly from normality, for instance, in the presence of heavy tails or atypical observations. For such data structures, we propose a robust model for censored data based on finite mixtures of multivariate Student-t distributions. This approach allows us to model data with great flexibility, accommodating multimodality, heavy tails and also skewness depending on the structure of the mixture components. We develop an analytically simple, yet efficient, em-type algorithm for conducting maximum likelihood estimation of the parameters. The algorithm has closed-form expressions at the E-step that rely on formulas for the mean and variance of the multivariate truncated Student-t distributions. Further, a general information-based method for approximating the asymptotic covariance matrix of the estimators is also presented. Results obtained from the analysis of both simulated and real datasets are reported to demonstrate the effectiveness of the proposed methodology. The proposed algorithm and methods are implemented in the new R package CensMixReg. (C) 2017 Elsevier Inc. All rights reserved.

关键词： Censored data Detection limit em-type algorithms Finite mixture models Multivariate Student-t

来源：评论

学校读者我要写书评

暂无评论

Learning from incomplete data via parameterized t mixture models through eigenvalue decomposition

引用

COMPUTATIONAL STATISTICS & DATA ANALYSIS 2014年 71卷 183-195页

作者： Lin, Tsung-I Natl Chung Hsing Univ Inst Stat Taichung 40227 Taiwan China Med Univ Dept Publ Hlth Taichung Taiwan

A framework of using t mixture models with fourteen eigen-decomposed covariance structures for the unsupervised learning of heterogeneous multivariate data with possible missing values is designed and implemented. Computationally flexible em-type algorithms are developed for parameter estimation of these models under a missing at random (MAR) mechanism. For ease of computation and theoretical developments, two auxiliary indicator matrices are incorporated into the estimating procedure for exactly extracting the location of observed and missing components of each observation. Computational aspects related to the specification of starting values, convergence assessment and model choice are also discussed. The practical usefulness of the proposed methodology is illustrated with real data examples and a simulation study with varying proportions of missing values. (C) 2013 Elsevier B.V. All rights reserved.

关键词： Eigenvalue decomposition em-type algorithms F-G algorithm Integrated completed likelihood Model-based clustering Multivariate t mixture models

来源：评论

学校读者我要写书评

暂无评论

Flexible mixture modelling using the multivariate skew-t-normal distribution

引用

STATISTICS AND COMPUTING 2014年第4期24卷 531-546页

作者： Lin, Tsung-I Ho, Hsiu J. Lee, Chia-Rong Natl Chung Hsing Univ Inst Stat Taichung 402 Taiwan China Med Univ Dept Publ Hlth Taichung 404 Taiwan

This paper presents a robust probabilistic mixture model based on the multivariate skew-t-normal distribution, a skew extension of the multivariate Student's t distribution with more powerful abilities in modelling data whose distribution seriously deviates from normality. The proposed model includes mixtures of normal, t and skew-normal distributions as special cases and provides a flexible alternative to recently proposed skew t mixtures. We develop two analytically tractable em-type algorithms for computing maximum likelihood estimates of model parameters in which the skewness parameters and degrees of freedom are asymptotically uncorrelated. Standard errors for the parameter estimates can be obtained via a general information-based method. We also present a procedure of merging mixture components to automatically identify the number of clusters by fitting piecewise linear regression to the rescaled entropy plot. The effectiveness and performance of the proposed methodology are illustrated by two real-life examples.

关键词： em-type algorithms Entropy Flow cytometry ICL MSTN distribution Skewness

来源：评论

学校读者我要写书评

暂无评论

Accelerating the quadratic lower-bound algorithm via optimizing the shrinkage parameter

引用

COMPUTATIONAL STATISTICS & DATA ANALYSIS 2012年第2期56卷 255-265页

作者： Tian, Guo-Liang Tang, Man-Lai Liu, Chunling Hong Kong Baptist Univ Dept Math Kowloon Tong Hong Kong Peoples R China Univ Hong Kong Dept Stat & Actuarial Sci Hong Kong Hong Kong Peoples R China Hong Kong Polytech Univ Dept Appl Math Kowloon Hong Kong Peoples R China

When the Newton-Raphson algorithm or the Fisher scoring algorithm does not work and the em-type algorithms are not available, the quadratic lower-bound (QLB) algorithm may be a useful optimization tool. However, like all em-type algorithms, the QLB algorithm may also suffer from slow convergence which can be viewed as the cost for having the ascent property. This paper proposes a novel 'shrinkage parameter' approach to accelerate the QLB algorithm while maintaining its simplicity and stability (i.e., monotonic increase in log-likelihood). The strategy is first to construct a class of quadratic surrogate functions Qr(theta vertical bar theta((t))) that induces a class of QLB algorithms indexed by a 'shrinkage parameter' r (r is an element of R) and then to optimize r over R under some criterion of convergence. For three commonly used criteria (i.e., the smallest eigenvalue, the trace and the determinant), we derive a uniformly optimal shrinkage parameter and find an optimal QLB algorithm. Some theoretical justifications are also presented. Next, we generalize the optimal QLB algorithm to problems with penalizing function and then investigate the associated properties of convergence. The optimal QLB algorithm is applied to fit a logistic regression model and a Cox proportional hazards model. Two real datasets are analyzed to illustrate the proposed methods. (C) 2011 Elsevier B.V. All rights reserved.

关键词： Cox proportional hazards model em-type algorithms Logistic regression Newton-Raphson algorithm Optimal QLB algorithm QLB algorithm

来源：评论

学校读者我要写书评

暂无评论

Robust mixture modeling using the skew t distribution

引用

STATISTICS AND COMPUTING 2007年第2期17卷 81-92页

作者： Lin, Tsung I. Lee, Jack C. Hsieh, Wan J. Natl Chung Hsing Univ Dept Appl Math Taichung 40227 Taiwan Natl Chiao Tung Univ Grad Inst Finance Hsinchu Taiwan Natl Chiao Tung Univ Inst Stat Hsinchu Taiwan

A finite mixture model using the Student's t distribution has been recognized as a robust extension of normal mixtures. Recently, a mixture of skew normal distributions has been found to be effective in the treatment of heterogeneous data involving asymmetric behaviors across subclasses. In this article, we propose a robust mixture framework based on the skew t distribution to efficiently deal with heavy-tailedness, extra skewness and multimodality in a wide range of settings. Statistical mixture modeling based on normal, Student's t and skew normal distributions can be viewed as special cases of the skew t mixture model. We present analytically simple em-type algorithms for iteratively computing maximum likelihood estimates. The proposed methodology is illustrated by analyzing a real data example.

关键词： em-type algorithms maximum likelihood outlying observations PX-em algorithm skew t mixtures truncated normal

来源：评论

学校读者我要写书评

暂无评论

Deconvolution in High-Energy Astrophysics: Science, Instrumentation, and Methods

引用

BAYESIAN ANALYSIS 2006年第2期1卷 189-235页

作者： van Dyk, David A. Connors, Alanna Esch, David N. Freeman, Peter Kang, Hosung Karovska, Margarita Kashyap, Vinay Siemiginowska, Aneta Zezas, Andreas Univ Calif Irvine Dept Stat Irvine CA 92717 USA Eureka Sci Oakland CA USA Harvard Univ Dept Stat Boston MA 02115 USA Harvard Smithsonian Ctr Astrophys Boston MA USA

In recent years, there has been an avalanche of new data in observational high-energy astrophysics. Recently launched or soon-to-be launched space-based telescopes that are designed to detect and map ultra-violet, X-ray, and gamma-ray electromagnetic emission are opening a whole new window to study the cosmos. Because the production of high-energy electromagnetic emission requires temperatures of millions of degrees and is an indication of the release of vast quantities of stored energy, these instruments give a completely new perspective on the hot and turbulent regions of the universe. The new instrumentation allows for very high resolution imaging, spectral analysis, and time series analysis;the Chandra X-ray Observatory, for example, produces images atleast thirty times sharper than any previous X-ray telescope. The complexity of the instruments, of the astronomical sources, and of the scientific questions leads to a subtle inference problem that requires sophisticated statistical tools. For example, data are subject to non-uniform stochastic censoring, heteroscedastic errors in measurement, and background contamination. Astronomical sources exhibit complex and irregular spatial structure. Scientists wish to draw conclusions as to the physical environment and structure of the source, the processes and laws which govern the birth and death of planets, stars, and galaxies, and ultimately the structure and evolution of the universe. The California-Harvard Astrostatistics Collaboration is a group of astrophysicists and statisticians working together to develop statistical methods, computational techniques, and freely available software to address outstanding inferential problems in high-energy astrophysics. We emphasize fully model-based statistical inference;we explicitly model the complexities of both astronomical sources and the data generation mechanisms inherent in new high-tech instruments, and fully utilize the resulting highly structured models in learning a

关键词： Background ontamination Censoring Chandra X-ray Observatory Chi Square Fitting Count Data Contingency Tables Deconvolution Differential emission Measure em-type algorithms Frequency Evaluations Richardson-Lucy Hardness Ratios Hubble Space Telescope Image Analysis Log-Linear Models Markov chain Monte Carlo Measurement Errors Multiscale Methods Sampling Distributions Smoothing Prior Distribution Point Spread Function Posterior Predictive Checks Power Law Poisson Models Spectral Analysis Timing Analysis

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：