检索结果-内蒙古大学图书馆

Mixture cure rate models with neural network estimated nonparametric components

COMPUTATIONAL STATISTICS 2021年第4期36卷 2467-2489页

作者： Xie, Yujing Yu, Zhangsheng Shanghai Jiao Tong Univ Sch Math Sci Shanghai Peoples R China Shanghai Jiao Tong Univ SJTU Yale Joint Ctr Biostat Sch Math Sci Dept Bioinformat & Biostat Shanghai Peoples R China

Survival data including potentially cured subjects are common in clinical studies and mixture cure rate models are often used for analysis. The non-cured probabilities are often predicted by non-parametric, high-dimensional, or even unstructured (e.g. image) predictors, which is a challenging task for traditional nonparametric methods such as spline and local kernel. We propose to use the neural network to model the nonparametric or unstructured predictors' effect in cure rate models and retain the proportional hazards structure due to its explanatory ability. We estimate the parameters by Expectation-Maximization algorithm. Estimators are showed to be consistent. Simulation studies show good performance in both prediction and estimation. Finally, we analyze Open Access Series of Imaging Studies data to illustrate the practical use of our methods.

关键词： Consistency Deep learning em algorithm Survival analysis

来源：评论

学校读者我要写书评

暂无评论

Causal mediation analysis with latent subgroups

引用

STATISTICS IN MEDICINE 2021年第25期40卷 5628-5641页

作者： Wang, WenWu Xu, Jinfeng Schwartz, Joel Baccarelli, Andrea Liu, Zhonghua Qufu Normal Univ Sch Stat Qufu Shandong Peoples R China Univ Hong Kong Dept Stat & Actuarial Sci Pokfulam Hong Kong Peoples R China Harvard Univ Dept Environm Hlth Boston MA USA Columbia Univ Dept Environm Hlth Sci New York NY USA

In biomedical studies, the causal mediation effect might be heterogeneous across individuals in the study population due to each study subject's unique characteristics. While individuals' mediation effects may differ from each other, it is often reasonable and more interpretable to assume that individuals belong to several distinct latent subgroups with similar attributes. In this article, we first show that the subgroup-specific mediation effect can be identified under the group-specific sequential ignorability assumptions. Then, we propose a simple mixture modeling approach to account for the latent subgroup structure where each mixture component corresponds to one latent subgroup in the linear structural equation model framework. Model parameters can be estimated using the standard expectation-maximization (em) algorithm. Each individual's subgroup membership can be inferred based on the posterior probability. We propose to use the singular Bayesian information criterion to consistently select the number of latent subgroups by recognizing that the Fisher information matrix for mixture models might be singular. We then propose to use nonparametric bootstrap method to compute standard errors and confidence intervals. We conducted simulation studies to evaluate the empirical performance of our proposed method named iMed. Finally, we reanalyzed a DNA methylation data set from the Normative Aging Study and found that the mediation effects of two well-documented DNA methylation CpG sites are heterogeneous across two latent subgroups in the causal pathway from smoking behavior to lung function. We also developed an R package iMed for public use.

关键词： DNA methylation em algorithm heterogeneous mediation effects latent subgroups mixture model singular Bayesian information criterion

来源：评论

学校读者我要写书评

暂无评论

Estimation of parameters in multivariate wrapped models for data on ap-torus

引用

COMPUTATIONAL STATISTICS 2021年第1期36卷 193-215页

作者： Nodehi, Anahita Golalizadeh, Mousa Maadooliat, Mehdi Agostinelli, Claudio Tarbiat Modares Univ Dept Stat Tehran Iran Inst Res Fundamental Sci IPM Sch Biol Sci POB 19395-5746 Tehran Iran Marquette Univ Dept Math & Stat Sci Milwaukee WI 53233 USA Univ Trento Dept Math Trento Italy

Multivariate circular observations, i.e. points on a torus arise frequently in fields where instruments such as compass, protractor, weather vane, sextant or theodolite are used. Multivariate wrapped models are often appropriate to describe data points scattered onp-dimensional torus. However, the statistical inference based on such models is quite complicated since each contribution in the log-likelihood function involves an infinite sum of indices in Z(p), wherepis the dimension of the data. To overcome this problem, for moderate dimensionp, we propose two estimation procedures based on Expectation-Maximisation and Classification Expectation-Maximisation algorithms. We study the performance of the proposed techniques on a Monte Carlo simulation and further illustrate the advantages of the new procedures on three real-world data sets.

关键词： Cem algorithm em algorithm Estimation procedures Multivariate wrapped distributions Torus

来源：评论

学校读者我要写书评

暂无评论

Mixture Modeling Using the Multivariate Restricted Skew-Normal Scale Mixture of Birnbaum-Saunders Distributions

引用

IRANIAN JOURNAL OF SCIENCE AND TECHNOLOGY TRANSACTION A-SCIENCE 2021年第1期45卷 271-282页

作者： Samary, Hossaein Khodadadi, Zahra Jafarpour, Hedieh Islamic Azad Univ Marvdasht Branch Dept Stat Marvdasht Iran Shiraz Islamic Azad Univ Dept Stat Shiraz Iran

Mixture models are promising statistical tools aiming to modeling and clustering data arisen from a heterogeneous population. This paper presents a mixture model based on the assumption that the mixing components follow the multivariate restricted skew-normal scale mixture of Birnbaum-Saunders (SNBS) distributions. A computationally feasible expectation-maximization algorithm is developed to carry out maximum likelihood estimation of the new model. Simulation studies are carried out to check the clustering performance and classification accuracy. Finally, illustrative example is presented by analyzing a real-world data set.

关键词： Restricted skew-normal Birnbaum– Saunders distribution em algorithm GH distribution

来源：评论

学校读者我要写书评

暂无评论

Semiparametric estimation for proportional hazards mixture cure model allowing non-curable competing risk

引用

JOURNAL OF STATISTICAL PLANNING AND INFERENCE 2021年 211卷 171-189页

作者： Wang, Yijun Zhang, Jiajia Cai, Chao Lu, Wenbin Tang, Yincai Zhejiang Gongshang Univ Sch Stat & Math Hangzhou 310018 Zhejiang Peoples R China East China Normal Univ Sch Stat Key Lab Adv Theory & Applicat Stat & Data Sci MOE Shanghai 200062 Peoples R China Univ South Carolina Dept Epidemiol & Biostat Columbia SC 29208 USA North Carolina State Univ Dept Stat Raleigh NC 27695 USA

With advancements in medical research, broader range of diseases may be curable, which indicates some patients may not die owing to the disease of interest. The mixture cure model, which can capture patients being cured, has received an increasing attention in practice. However, the existing mixture cure models only focus on major events with potential cures while ignoring the potential risks posed by other non-curable competing events, which are commonly observed in the real world. The main purpose of this article is to propose a new mixture cure model allowing non-curable competing risk. A semiparametric estimation method is developed via an em algorithm, the asymptotic properties of parametric estimators are provided and its performance is demonstrated through comprehensive simulation studies. Finally, the proposed method is applied to a prostate cancer clinical trial dataset. (C) 2020 Elsevier B.V. All rights reserved.

关键词： PH mixture cure model Competing risks em algorithm Semiparametric estimation Logistic regression

来源：评论

学校读者我要写书评

暂无评论

Mixed-effects models for censored data with autoregressive errors

引用

JOURNAL OF BIOPHARMACEUTICAL STATISTICS 2021年第3期31卷 273-294页

作者： Olivari, Rommy C. Garay, Aldo M. Lachos, Victor H. Matos, Larissa A. Univ Fed Pernambuco Dept Stat BR-50670901 Recife PE Brazil Univ Connecticut Dept Stat Storrs CT 06269 USA Univ Estadual Campinas Dept Stat Campinas SP Brazil

Mixed-effects models, with modifications to accommodate censored observations (LMEC/NLMEC), are routinely used to analyze measurements, collected irregularly over time, which are often subject to some upper and lower detection limits. This paper presents a likelihood-based approach for fitting LMEC/NLMEC models with autoregressive of order dependence of the error term. An em-type algorithm is developed for computing the maximum likelihood estimates, obtaining as a byproduct the standard errors of the fixed effects and the likelihood value. Moreover, the constraints on the parameter space that arise from the stationarity conditions for the autoregressive parameters in the em algorithm are handled by a reparameterization scheme, as discussed in Lin and Lee (2007). To examine the performance of the proposed method, we present some simulation studies and analyze a real AIDS case study. The proposed algorithm and methods are implemented in the new R package ARpLMEC.

关键词： Autoregressive AR(p) models censored data em algorithm HIV viral load linear nonlinear mixed-effects models

来源：评论

学校读者我要写书评

暂无评论

Change point detection in Cox proportional hazards mixture cure model

引用

STATISTICAL METHODS IN MEDICAL RESEARCH 2021年第2期30卷 440-457页

作者： Wang, Bing Li, Jialiang Wang, Xiaoguang Dalian Univ Technol Sch Math Sci Dalian Peoples R China Natl Univ Singapore Duke Univ NUS Grad Med Sch Singapore Eye Res Inst Dept Stat & Appl Probabil Singapore Singapore

The mixture cure model has been widely applied to survival data in which a fraction of the observations never experience the event of interest, despite long-term follow-up. In this paper, we study the Cox proportional hazards mixture cure model where the covariate effects on the distribution of uncured subjects' failure time may jump when a covariate exceeds a change point. The nonparametric maximum likelihood estimation is used to obtain the semiparametric estimates. We employ a two-step computational procedure involving the Expectation-Maximization algorithm to implement the estimation. The consistency, convergence rate and asymptotic distributions of the estimators are carefully established under technical conditions and we show that the change point estimator is n consistency. The m out of n bootstrap and the Louis algorithm are used to obtain the standard errors of the estimated change point and other regression parameter estimates, respectively. We also contribute a test procedure to check the existence of the change point. The finite sample performance of the proposed method is demonstrated via simulation studies and real data examples.

关键词： Mixture cure model change point detection empirical processes em algorithm subgroup identification

来源：评论

学校读者我要写书评

暂无评论

A Variational Approximations-DIC Rubric for Parameter Estimation and Mixture Model Selection Within a Family Setting

引用

JOURNAL OF CLASSIFICATION 2021年第1期38卷 89-108页

作者： Subedi, Sanjeena McNicholas, Paul D. SUNY Binghamton Dept Math Sci 4400 Vestal Pkwy East Binghamton NY 13902 USA McMaster Univ Dept Math & Stat 1280 Main St W Hamilton ON L8S 4K1 Canada

Mixture model-based clustering has become an increasingly popular data analysis technique since its introduction over fifty years ago, and is now commonly utilized within a family setting. Families of mixture models arise when the component parameters, usually the component covariance (or scale) matrices, are decomposed and a number of constraints are imposed. Within the family setting, model selection involves choosing the member of the family, i.e., the appropriate covariance structure, in addition to the number of mixture components. To date, the Bayesian information criterion (BIC) has proved most effective for model selection, and the expectation-maximization (em) algorithm is usually used for parameter estimation. In fact, this em-BIC rubric has virtually monopolized the literature on families of mixture models. Deviating from this rubric, variational Bayes approximations are developed for parameter estimation and the deviance information criteria (DIC) for model selection. The variational Bayes approach provides an alternate framework for parameter estimation by constructing a tight lower bound on the complex marginal likelihood and maximizing this lower bound by minimizing the associated Kullback-Leibler divergence. The framework introduced, which we refer to as VB-DIC, is applied to the most commonly used family of Gaussian mixture models, and real and simulated data are used to compared with the em-BIC rubric.

关键词： BIC Clustering DIC em algorithm GPCM Mixture models Model-based clustering Variational approximations Variational Bayes VB-DIC

来源：评论

学校读者我要写书评

暂无评论

Regression analysis of mixed panel count data with informative indicator processes

引用

STATISTICS IN MEDICINE 2021年第5期40卷 1262-1271页

作者： Ge, Lei Zhu, Liang Sun, Jianguo Jilin Univ Ctr Appl Stat Res Sch Math Changchun Peoples R China Univ Texas Hlth Sci Ctr Houston Div Clin & Translat Sci Dept Internal Med Houston TX 77030 USA Univ Missouri Dept Stat Columbia MO 65211 USA

Panel count data occur often in event history studies and in these situations, one observes only incomplete information, the number of events rather than the occurrence times of each event, about the point processes of interest.(2) Sometimes one may have to face a more complicated type of panel count data, mixed panel count data in which instead of the number of events, one only knows if there is an occurrence of an event.(3) Furthermore, this may depend on the underlying point process of interest or in other words, the point process of interest and the observation type process may be related. To address this, a sieve maximum likelihood estimation approach is proposed with the use of Bernstein polynomials, and for the implementation, an em algorithm is developed. To assess the finite sample performance of the proposed approach, a simulation study is conducted and suggests that it works well for practical situations. The method is then applied to a motivating example about cancer survivors.

关键词： Bernstein polynomial em algorithm logistic model proportional mean model

来源：评论

学校读者我要写书评

暂无评论

Matrix Completion, Counterfactuals, and Factor Analysis of Missing Data

引用

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION 2021年第536期116卷 1746-1763页

作者： Bai, Jushan Ng, Serena Columbia Univ Dept Econ 420 W 118 StMC 3308 New York NY 10025 USA NBER New York NY USA

This article proposes an imputation procedure that uses the factors estimated from a tall block along with the re-rotated loadings estimated from a wide block to impute missing values in a panel of data. Assuming that a strong factor structure holds for the full panel of data and its sub-blocks, it is shown that the common component can be consistently estimated at four different rates of convergence without requiring regularization or iteration. An asymptotic analysis of the estimation error is obtained. An application of our analysis is estimation of counterfactuals when potential outcomes have a factor structure. We study the estimation of average and individual treatment effects on the treated and establish a normal distribution theory that can be useful for hypothesis testing.

关键词： em algorithm Missing-at-random Nuclear-norm regularization Synthetic controls

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：