检索结果-内蒙古大学图书馆

Backward joint model and dynamic prediction of survival with multivariate longitudinal data

STATISTICS IN MEDICINE 2021年第20期40卷 4395-4409页

作者： Shen, Fan Li, Liang Univ Texas Dallas Sch Publ Hlth Dept Biostat & Data Sci Dallas TX USA Univ Texas MD Anderson Canc Ctr Dept Biostat Houston TX 77030 USA

An important approach to dynamic prediction of time-to-event outcomes using longitudinal data is based on modeling the joint distribution of longitudinal and time-to-event data. The widely used joint model for this purpose is the shared random effect model. Presumably, adding more longitudinal predictors improves the predictive accuracy. However, the shared random effect model can be computationally difficult or prohibitive when a large number of longitudinal variables are used. In this paper, we study an alternative way of modeling the joint distribution of longitudinal and time-to-event data. Under this formulation, the log-likelihood involves no more than one-dimensional integration, regardless of the number of longitudinal variables in the model. Therefore, this model is particularly suitable in dynamic prediction problems with large number of longitudinal predictors. The model fitting can be implemented with tractable and stable computation by using a combination of pseudo maximum likelihood estimation, Expectation-Maximization algorithm, and convex optimization. We evaluate the proposed methodology and its predictive accuracy with varying number of longitudinal variables using simulations and data from a primary biliary cirrhosis study.

关键词： dynamic prediction em algorithm joint modeling multivariate longitudinal data predictive accuracy survival analysis

来源：评论

学校读者我要写书评

暂无评论

A mixture of linear-linear regression models for a linear-circular regression

引用

STATISTICAL MODELLING 2021年第3期21卷 220-243页

作者： Sikaroudi, Ali Esmaieeli Park, Chiwoo JP Morgan Chase & Co Jacksonville FL USA Florida State Univ Dept Ind & Mfg Engn 2525 Pottsdamer St Tallahassee FL 32310 USA

We introduce a new approach to a linear-circular regression problem that relates multiple linear predictors to a circular response. We follow a modelling approach of a wrapped normal distribution that describes angular variables and angular distributions and advances them for a linear-circular regression analysis. Some previous works model a circular variable as projection of a bivariate Gaussian random vector on the unit square, and the statistical inference of the resulting model involves complicated sampling steps. The proposed model treats circular responses as the result of the modulo operation on unobserved linear responses. The resulting model is a mixture of multiple linear-linear regression models. We present two em algorithms for maximum likelihood estimation of the mixture model, one for a parametric model and another for a nonparametric model. The estimation algorithms provide a great trade-off between computation and estimation accuracy, which was numerically shown using five numerical examples. The proposed approach was applied to a problem of estimating wind directions that typically exhibit complex patterns with large variation and circularity.

关键词： circular data em algorithm Gibbs sampling Mixture of regressions

来源：评论

学校读者我要写书评

暂无评论

Multiparameter one-sided tests for nonlinear mixed effects models with censored responses

引用

STATISTICS IN MEDICINE 2021年第13期40卷 3138-3152页

作者： Zhou, Guohai Wu, Lang Harvard Med Sch Brigham & Womens Hosp Ctr Clin Invest Boston MA 02115 USA Univ British Columbia Dept Stat Vancouver BC Canada

Nonlinear mixed-effects (NLME) models are commonly used in longitudinal studies such as pharmacokinetics and HIV viral dynamics studies. NLME models are often derived based on underlying data-generating mechanisms, therefore the parameters in these models often have natural physical interpretations that may suggest reasonable constraints on certain parameters. For example, the HIV viral decay rates for populations receiving anti-HIV treatments may be reasonably expected to be nonnegative. Hypothesis testing for these parameters should incorporate practically reasonable constraints to increase statistical power. Motivated from HIV viral dynamic models, in this article we propose multiparameter one-sided or constrained tests for NLME models with censored responses, for example, viral dynamic models with viral loads subject to lower detection limits. We propose approximate likelihood-based tests that are computationally efficient. We evaluate the tests via simulations and show that the proposed tests are more powerful than the corresponding two-sided or unrestricted tests. We apply the proposed tests to two AIDS datasets with new findings.

关键词： constrained test em algorithm likelihood linearization power

来源：评论

学校读者我要写书评

暂无评论

Regression analysis of arbitrarily censored survival data under the proportional odds model

引用

STATISTICS IN MEDICINE 2021年第16期40卷 3724-3739页

作者： Wang, Lu Wang, Lianming Western New England Univ Dept Math Springfield MA 01119 USA Univ South Carolina Dept Stat Columbia SC USA

Arbitrarily censored data are referred to as the survival data that contain a mixture of exactly observed, left-censored, interval-censored, and right-censored observations. Existing research work on regression analysis on arbitrarily censored data is relatively sparse and mainly focused on the proportional hazards model and the accelerated failure time model. This article studies the proportional odds (PO) model and proposes a novel estimation approach through an expectation-maximization (em) algorithm for analyzing such data. The proposed em algorithm has many appealing properties such as being robust to initial values, easy to implement, converging fast, and providing the variance estimate of the regression parameter estimate in closed form. An informal diagnosis plot is developed for checking the PO model assumption. Our method has shown excellent performance in estimating the regression parameters as well as the baseline survival function in a simulation study. A real-life dataset about metastatic colorectal cancer is analyzed for illustration. An R package regPO has been created for practitioners to implement our method.

关键词： arbitrarily censored data data augmentation em algorithm monotone spline proportional odds model semiparametric regression

来源：评论

学校读者我要写书评

暂无评论

Enhancing cure rate analysis through integration of machine learning models: a comparative study

引用

STATISTICS AND COMPUTING 2024年第4期34卷 142-142页

作者： Aselisewine, Wisdom Pal, Suvra Univ Texas Arlington Dept Math Arlington TX 76019 USA Univ Texas Arlington Coll Sci Div Data Sci Arlington TX 76019 USA

Cure rate models have been thoroughly investigated across various domains, encompassing medicine, reliability, and finance. The merging of machine learning (ML) with cure models is emerging as a promising strategy to improve predictive accuracy and gain profound insights into the underlying mechanisms influencing the probability of cure. The current body of literature has explored the benefits of incorporating a single ML algorithm with cure models. However, there is a notable absence of a comprehensive study that compares the performances of various ML algorithms in this context. This paper seeks to address and bridge this gap. Specifically, we focus on the well-known mixture cure model and examine the incorporation of five distinct ML algorithms: extreme gradient boosting, neural networks, support vector machines, random forests, and decision trees. To bolster the robustness of our comparison, we also include cure models with logistic and spline-based regression. For parameter estimation, we formulate an expectation maximization algorithm. A comprehensive simulation study is conducted across diverse scenarios to compare various models based on the accuracy and precision of estimates for different quantities of interest, along with the predictive accuracy of cure. The results derived from both the simulation study, as well as the analysis of real cutaneous melanoma data, indicate that the incorporation of ML models into cure model provides a beneficial contribution to the ongoing endeavors aimed at improving the accuracy of cure rate estimation.

关键词： Machine learning Mixture cure model em algorithm Proportional hazard Predictive accuracy

来源：评论

学校读者我要写书评

暂无评论

A prediction model for healthcare time-series data with a mixture of deep mixed effect models using Gaussian processes

引用

BIOMEDICAL SIGNAL PROCESSING AND CONTROL 2023年第1期84卷

作者： Hong, Jaehyoung Chun, Hyonho Korea Adv Inst Sci & Technol Dept Math Sci Daejeon 34141 South Korea

Healthcare outcomes such as blood pressure and heart rate are commonly tracked across time owing to technological advances in wearable devices. This advance then makes it possible to predict health risks and to practice personalized medicine. For this type of healthcare data, it is important to reflect huge variation among subjects where the subject becomes an experimental unit. The person-specific model becomes critical for accurate prediction, but it is not optimal due to the noisy nature of the data. It has been demonstrated that sharing information across subjects via a mixed effect model can improve the prediction of individual responses compared to a completely personalized model. However, sharing information across all patients can dilute signals when there are several different patterns present in the data. That is, subjects may form groups and each group behaves differently. To reflect this feature, we extend a deep mixed effect model via a mixture of deep mixed effect models. Our mixed effect model is based on Gaussian processes where the mean adopts the deep neural networks to capture flexible time trends. Our model finds a highly nonlinear trend shared among segments of patients while clustering patients with similar trends into groups. Our approach shows great performance in simulation studies as well as real data analysis, emphasizing the importance of modeling group-specific trends when making accurate predictions from healthcare time-series data.

关键词： Clustering em algorithm Gaussian mixture model Gaussian process Healthcare Mixed effect model

来源：评论

学校读者我要写书评

暂无评论

Multilevel superposition for deciphering the conformational variability of protein ensembles

引用

BRIEFINGS IN BIOINFORMATICS 2024年第3期25卷 bbae137-bbae137页

作者： Amisaki, Takashi Tottori Univ Fac Med Dept Biol Regulat 86 Nishi Cho Yonago Tottori 6838503 Japan

The dynamics and variability of protein conformations are directly linked to their functions. Many comparative studies of X-ray protein structures have been conducted to elucidate the relevant conformational changes, dynamics and heterogeneity. The rapid increase in the number of experimentally determined structures has made comparison an effective tool for investigating protein structures. For example, it is now possible to compare structural ensembles formed by enzyme species, variants or the type of ligands bound to them. In this study, the author developed a multilevel model for estimating two covariance matrices that represent inter- and intra-ensemble variability in the Cartesian coordinate space. Principal component analysis using the two estimated covariance matrices identified the inter-/intra-enzyme variabilities, which seemed to be important for the enzyme functions, with the illustrative examples of cytochrome P450 family 2 enzymes and class A $\beta$-lactamases. In P450, in which each enzyme has its own active site of a distinct size, an active-site motion shared universally between the enzymes was captured as the first principal mode of the intra-enzyme covariance matrix. In this case, the method was useful for understanding the conformational variability after adjusting for the differences between enzyme sizes. The developed method is advantageous in small ensemble-size problems and hence promising for use in comparative studies on experimentally determined structures where ensemble sizes are smaller than those generated, for example, by molecular dynamics simulations.

关键词： random effects model structural superposition em algorithm covariance matrix principal component analysis

来源：评论

学校读者我要写书评

暂无评论

Maximum likelihood estimation for semiparametric regression models with panel count data

引用

BIOMETRIKA 2021年第4期108卷 947-963页

作者： Zeng, Donglin Lin, D. Y. Univ N Carolina Dept Biostat 3101 McGavran Greenberg Hall Chapel Hill NC 27599 USA

Panel count data, in which the observation for each study subject consists of the number of recurrent events between successive examinations, are commonly encountered in industrial reliability testing, medical research and other scientific investigations. We formulate the effects of potentially time-dependent covariates on one or more types of recurrent events through nonhomogeneous Poisson processes with random effects. We employ nonparametric maximum likelihood estimation under arbitrary examination schemes, and develop a simple and stable em algorithm. We show that the resulting estimators of the regression parameters are consistent and asymptotically normal, with a covariance matrix that achieves the semiparametric efficiency bound and can be estimated using profile likelihood. We evaluate the performance of the proposed methods through simulation studies and analysis of data from a skin cancer clinical trial.

关键词： em algorithm Interval censoring Nonhomogeneous Poisson process Nonparametric likelihood Proportional means model Random effect Recurrent event Semiparametric efficiency Time-dependent covariate

来源：评论

学校读者我要写书评

暂无评论

Promotion time cure rate model with a neural network estimated nonparametric component

引用

STATISTICS IN MEDICINE 2021年第15期40卷 3516-3532页

作者： Xie, Yujing Yu, Zhangsheng Shanghai Jiao Tong Univ Sch Math Sci Shanghai Peoples R China Shanghai Jiao Tong Univ SJTU Yale Joint Ctr Biostat Dept Bioinformat & Biostat Shanghai Peoples R China

Promotion time cure rate models (PCM) are often used to model the survival data with a cure fraction. Medical images or biomarkers derived from medical images can be the key predictors in survival models. However, incorporating images in the PCM is challenging using traditional nonparametric methods such as splines. We propose to use neural network to model the nonparametric or unstructured predictors' effect in the PCM context. Expectation-maximization algorithm with neural network for the M-step is used for parameter estimation. Asymptotic properties of the proposed estimates are derived. Simulation studies show good performance in terms of both prediction and estimation. We finally apply our methods to analyze the brain images from open access series of imaging studies data.

关键词： convergence rate cure rate models em algorithm machine learning survival analysis

来源：评论

学校读者我要写书评

暂无评论

A new algorithm for fitting semi-parametric variance regression models

引用

COMPUTATIONAL STATISTICS 2021年第4期36卷 2313-2335页

作者： Robledo, Kristy P. Marschner, Ian C. Univ Sydney NHMRC Clin Trials Ctr Locked Bag 77 Camperdown NSW 1450 Australia

Variance regression allows for heterogeneous variance, or heteroscedasticity, by incorporating a regression model into the variance. This paper uses a variant of the expectation-maximisation algorithm to develop a new method for fitting additive variance regression models that allow for regression in both the mean and the variance. The algorithm is easily extended to allow for B-spline bases, thus allowing for the incorporation of a semi-parametric model in both the mean and variance. Although there are existing methods to fit these types of models, this new algorithm provides a reliable alternative approach that is not susceptible to numerical instability that can arise in this constrained estimation context. We utilise the developed algorithm with a series of simulation studies and analyse illustrative data. Various simulation studies show that the algorithm can recover the true model for a variety of scenarios. We also study automatic selection of model complexity based on information-based criteria, and show that the Akaike information criterion is useful for choosing the optimal number of knots in a B-spline model. An R package is available for implementing these methods.

关键词： Variance regression Semi-parametric regression em algorithm B-splines

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：