检索结果-内蒙古大学图书馆

Efficient estimation for the non-mixture cure model with current status data

STATISTICS 2020年第4期54卷 756-777页

作者： Wang, Xiaoguang Han, Bo Dalian Univ Technol Sch Math Sci Dalian Liaoning Peoples R China

Medical advances including the neoadjuvant anti-PD-1 immunotherapy play a role in promoting clinical outcomes such as improved overall and progression-free survival probabilities. This paper considers the regression analysis of current status data with a cured subgroup in the population using a semiparametric non-mixture cure model. We propose a sieve maximum likelihood estimation for the model with the Bernstein polynomials. Moreover, an expectation-maximization (em) algorithm is developed under the non-mixture cure model to calculate the estimators for both parametric and non-parametric components. Under some mild conditions, the asymptotic properties of the estimators are established, including the strong consistency, the convergence rate and the asymptotic normality. Simulation studies are conducted to investigate the finite sample performance of the proposed estimators. A real dataset from the tumorigenicity experiment is analysed for illustration.

关键词： Current status data non-mixture cure model sieve maximum likelihood estimation em algorithm asymptotic normality

来源：评论

学校读者我要写书评

暂无评论

Estimating the intervention effect in calibration substudies

引用

STATISTICS IN MEDICINE 2020年第3期39卷 239-251页

作者： Talitman, Michal Gorfine, Malka Steinberg, David M. Tel Aviv Univ Dept Stat & Operat Res Tel Aviv Israel

Exposure assessment is often subject to measurement errors. We consider here the analysis of studies aimed at reducing exposure to potential health hazards, in which exposure is the outcome variable. In these studies, the intervention effect may be estimated using either biomarkers or self-report data, but it is not common to combine these measures of exposure. Bias in the self-reported measures of exposure is a well-known fact;however, only few studies attempt to correct it. Recently, Keogh et al addressed this problem, presenting a model for measurement error in this setting and investigating how self-report and biomarker data can be combined. Keogh et al find the maximum likelihood estimate for the intervention effect in their model via direct numerical maximization of the likelihood. Here, we exploit an alternative presentation of the model that leads us to a closed formula for the MLE and also for its variance, when the number of biomarker replicates is the same for all subjects in the substudy. The variance formula enables efficient design of such intervention studies. When the number of biomarker replicates is not constant, our approach can be used along with the em-algorithm to quickly compute the MLE. We compare the MLE to Buonaccorsi's method (Buonaccorsi, 1996) and find that they have similar efficiency when most subjects have biomarker data, but that the MLE has clear advantages when only a small fraction of subjects has biomarker data. This conclusion extends the findings of Keogh et al (2016) and has practical importance for efficiently designing studies.

关键词： em algorithm intervention effect measurement error self-report data

来源：评论

学校读者我要写书评

暂无评论

Relationship validation among items and attributes

引用

JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION 2020年第18期90卷 3360-3375页

作者： Qin, Chunying Jia, Shuang Fang, Xingwu Yu, Xiaofeng Nanchang Normal Univ Dept Math & Comp Sci Nanchang Jiangxi Peoples R China Bozhou Univ Dept Elect & Informat Engn Bozhou Peoples R China Bozhou Univ Div Human Resources Bozhou Peoples R China Jiangxi Normal Univ Sch Psychol Nanchang Jiangxi Peoples R China

In the typical application of a cognitive diagnosis assessment, the Q matrix, which illustrates the relationship between skills and the items, is assumed to be known. However, the Q matrix is usually determined by subject matter experts and test developers, and so there may be misspecification of its elements. This paper proposes a data-driven approach to jointly estimate the Q matrix, model parameters, and the examinees' skill profiles. The key component is the likelihood ratio statistic that relates the Q matrix, responses, and the cognitive diagnosis model. Simulation studies show that the Q matrix validation based on the likelihood ratio statistic has a promising performance.

关键词： Formative assessment cognitive diagnosis Q matrix likelihood ratio DINA em algorithm

来源：评论

学校读者我要写书评

暂无评论

A skew factor analysis model based on the normal mean-variance mixture of Birnbaum-Saunders distribution

引用

JOURNAL OF APPLIED STATISTICS 2020年第16期47卷 3007-3029页

作者： Hashemi, Farzane Naderi, Mehrdad Jamalizadeh, Ahad Lin, Tsung-, I Shahid Bahonar Univ Kerman Fac Math & Comp Dept Stat Kermana Iran Univ Pretoria Fac Nat & Agr Sci Dept Stat Pretoria South Africa Natl Chung Hsing Univ Inst Stat Taichung Taiwan China Med Univ Dept Publ Hlth Taichung Taiwan

This paper presents a robust extension of factor analysis model by assuming the multivariate normal mean-variance mixture of Birnbaum-Saunders distribution for the unobservable factors and errors. A computationally analytical em-based algorithm is developed to find maximum likelihood estimates of the parameters. The asymptotic standard errors of parameter estimates are derived under an information-based paradigm. Numerical merits of the proposed methodology are illustrated using both simulated and real datasets.

关键词： Birnbaum-Saunders distribution em algorithm factor analysis skewness outliers

来源：评论

学校读者我要写书评

暂无评论

A bivariate inverse Weibull distribution and its application in complementary risks model

引用

JOURNAL OF APPLIED STATISTICS 2020年第6期47卷 1084-1108页

作者： Mondal, Shuvashree Kundu, Debasis Indian Inst Technol Dept Math & Stat Kanpur 208016 Uttar Pradesh India

In reliability and survival analysis the inverse Weibull distribution has been used quite extensively as a heavy tailed distribution with a non-monotone hazard function. Recently a bivariate inverse Weibull (BIW) distribution has been introduced in the literature, where the marginals have inverse Weibull distributions and it has a singular component. Due to this reason this model cannot be used when there are no ties in the data. In this paper we have introduced an absolutely continuous bivariate inverse Weibull (ACBIW) distribution omitting the singular component from the BIW distribution. A natural application of this model can be seen in the analysis of dependent complementary risks data. We discuss different properties of this model and also address the inferential issues both from the classical and Bayesian approaches. In the classical approach, the maximum likelihood estimators cannot be obtained explicitly and we propose to use the expectation maximization algorithm based on the missing value principle. In the Bayesian analysis, we use a very flexible prior on the unknown model parameters and obtain the Bayes estimates and the associated credible intervals using importance sampling technique. Simulation experiments are performed to see the effectiveness of the proposed methods and two data sets have been analyzed to see how the proposed methods and the model work in practice.

关键词： Marshall-Olkin bivariate distribution block and basu bivariate distribution maximum likelihood estimation em algorithm Gamma-Dirichlet prior complementary risk

来源：评论

学校读者我要写书评

暂无评论

Adaptive lasso for the Cox regression with interval censored and possibly left truncated data

引用

STATISTICAL METHODS IN MEDICAL RESEARCH 2020年第4期29卷 1243-1255页

作者： Li, Chenxi Pak, Daewoo Todem, David Michigan State Univ Dept Epidemiol & Biostat E Lansing MI 48824 USA Univ Texas MD Anderson Canc Ctr Dept Biostat Houston TX 77030 USA

We propose a penalized variable selection method for the Cox proportional hazards model with interval censored data. It conducts a penalized nonparametric maximum likelihood estimation with an adaptive lasso penalty, which can be implemented through a penalized em algorithm. The method is proven to enjoy the desirable oracle property. We also extend the method to left truncated and interval censored data. Our simulation studies show that the method possesses the oracle property in samples of modest sizes and outperforms available existing approaches in many of the operating characteristics. An application to a dental caries data set illustrates the method's utility.

关键词： Adaptive lasso Caries research em algorithm interval censoring left truncation oracle property semiparametric inference variable selection

来源：评论

学校读者我要写书评

暂无评论

SEASONAL WARRANTY PREDICTION BASED ON RECURRENT EVENT DATA

引用

ANNALS OF APPLIED STATISTICS 2020年第2期14卷 929-955页

作者： Shan, Qianqian Hong, Yili Meeker, William Q. Iowa State Univ Dept Stat Ames IA 50011 USA Virginia Tech Dept Stat Blacksburg VA USA

Warranty return data from repairable systems, such as home appliances, lawn mowers, computers and automobiles, result in recurrent event data. The nonhomogeneous Poisson process (NHPP) model is used widely to describe such data. Seasonality in the repair frequencies and other variabilities, however, complicate the modeling of recurrent event data. Not much work has been done to address the seasonality, and this paper provides a general approach for the application of NHPP models with dynamic covariates to predict seasonal warranty returns. The methods presented here, however, can be applied to other applications that result in seasonal recurrent event data. A hierarchical clustering method is used to stratify the population into groups that are more homogeneous than the overall population. The stratification facilitates modeling the recurrent event data with both time-varying and time-constant covariates. We demonstrate and validate the models using warranty claims data for two different types of products. The results show that our approach provides important improvements in the predictive power of monthly events compared with models that do not take the seasonality and covariates into account.

关键词： em algorithm hierarchical clustering missing data NHPP random effects seasonal dynamic covariates

来源：评论

学校读者我要写书评

暂无评论

A robust joint modeling approach for longitudinal data with informative dropouts

引用

COMPUTATIONAL STATISTICS 2020年第4期35卷 1759-1783页

作者： Zhang, Weiping Xie, Feiyue Tan, Jiaxin Univ Sci & Technol China Sch Management Dept Stat & Finance Hefei 230026 Peoples R China

This article proposes a robust method for analysing longitudinal continuous responses with informative dropouts and potential outliers by using the multivariatet-distribution. We specify a dropout mechanism and a missing covariate distribution and incorporate them into the complete data log-likelihood. Unlike the existing approaches which mainly focus on the inference of regression mean and dropouts process, our approach aims to reveal the dynamics in the location function, marginal scale function and association by joint parsimonious modeling the location and dependence structure. A parametric fractional imputation algorithm is developed to speed up the computation associated with the em algorithm for maximum likelihood estimation with missing data. The resulting estimators are shown to be consistent and asymptotically normally distributed. Data examples and simulations demonstrate the effectiveness of the proposed approach.

关键词： Robust estimation em algorithm Joint modeling

来源：评论

学校读者我要写书评

暂无评论

Imputation and low-rank estimation with Missing Not At Random data

引用

STATISTICS AND COMPUTING 2020年第6期30卷 1629-1643页

作者： Sportisse, Aude Boyer, Claire Josse, Julie Sorbonne Univ Lab Probabilites Stat & Modelisat Paris France Ecole Polytech Ctr Math Appl Palaiseau France Ecole Normale Super Dept Math & Applicat Paris France INRIA Saclay XPOP Palaiseau France

Missing values challenge data analysis because many supervised and unsupervised learning methods cannot be applied directly to incomplete data. Matrix completion based on low-rank assumptions are very powerful solution for dealing with missing values. However, existing methods do not consider the case of informative missing values which are widely encountered in practice. This paper proposes matrix completion methods to recover Missing Not At Random (MNAR) data. Our first contribution is to suggest a model-based estimation strategy by modelling the missing mechanism distribution. An em algorithm is then implemented, involving a Fast Iterative Soft-Thresholding algorithm (FISTA). Our second contribution is to suggest a computationally efficient surrogate estimation by implicitly taking into account the joint distribution of the data and the missing mechanism: the data matrix is concatenated with the mask coding for the missing values;a low-rank structure for exponential family is assumed on this new matrix, in order to encode links between variables and missing mechanisms. The methodology that has the great advantage of handling different missing value mechanisms is robust to model specification errors. The performances of our methods are assessed on the real data collected from a trauma registry (TraumaBase (R)) containing clinical information about over twenty thousand severely traumatized patients in France. The aim is then to predict if the doctors should administrate tranexomic acid to patients with traumatic brain injury, that would limit excessive bleeding.

关键词： Informative missing values Denoising Matrix completion Accelerated proximal gradient method em algorithm Nuclear norm penalty

来源：评论

学校读者我要写书评

暂无评论

Likelihood-based quantile autoregressive distributed lag models and its applications

引用

JOURNAL OF APPLIED STATISTICS 2020年第1期47卷 117-131页

作者： Tian, Yuzhu Wang, Liyong Tang, Manlai Zang, Yanchao Tian, Maozai Northwest Normal Univ Sch Math & Stat Lanzhou Gansu Peoples R China Henan Univ Sci & Technol Sch Math & Stat Luoyang Peoples R China Cent Univ Finance & Econ Sch Stat & Math Beijing Peoples R China Hang Seng Univ Hong Kong Dept Math & Stat Hong Kong Peoples R China Renmin Univ China Sch Stat Beijing Peoples R China

Time lag effect exists widely in the course of economic operation. Some economic variables are affected not only by various factors in the current period but also by various factors in the past and even their own past values. As a class of dynamical models, autoregressive distributed lag (ARDL) models are frequently used to conduct dynamic regression analysis. In this paper, we are interested in the quantile regression (QR) modeling of the ARDL model in a dynamic framework. By combining the working likelihood of asymmetric Laplace distribution (ALD) with the expectation-maximization (em) algorithm into the considered ARDL model, the iterative weighted least square estimators (IWLSE) are derived. Some Monte Carlo simulations are implemented to evaluate the performance of the proposed estimation method. A dataset of the consumption of electricity by residential customers is analyzed to illustrate the application.

关键词： Dynamic regression model QR analysis em algorithm information criterion electricity consumption

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：