检索结果-内蒙古大学图书馆

SUBGROUP-EFFECTS MODELS FOR THE ANALYSIS OF PERSONAL TREATMENT EFFECTS

ANNALS OF APPLIED STATISTICS 2022年第1期16卷 80-103页

作者： Zhou, Ling Sun, Shiquan Fu, Haoda Song, Peter X-K Southwestern Univ Finance & Econ Ctr Stat Res Chengdu Peoples R China Xi An Jiao Tong Univ Sch Publ Hlth Xian Peoples R China Eli Lilly & Co Indianapolis IN 46285 USA Univ Michigan Dept Biostat Ann Arbor MI 48109 USA

The emerging field of precision medicine is transforming statistical analysis from the classical paradigm of population-average treatment effects into that of personal treatment effects. This new scientific mission has called for adequate statistical methods to assess heterogeneous covariate effects in regression analysis. This paper focuses on a subgroup analysis that consists of two primary analytic tasks: identification of treatment effect subgroups and individual group memberships, and statistical inference on treatment effects by subgroup. We propose an approach to synergizing supervised clustering analysis via alternating direction method of multipliers (ADMM) algorithm and statistical inference on subgroup effects via expectation-maximization (em) algorithm. Our proposed procedure, termed as hybrid operation for subgroup analysis (HOSA), enjoys computational speed and numerical stability with interpretability and reproducibility. We establish key theoretical properties for both proposed clustering and inference procedures. Numerical illustration includes extensive simulation studies and analyses of motivating data from two randomized clinical trials to learn subgroup treatment effects.

关键词： ADMM algorithm em algorithm maximum likelihood precision medicine supervised clustering

来源：评论

学校读者我要写书评

暂无评论

A multivariate zero-inflated binomial model for the analysis of correlated proportional data

引用

JOURNAL OF APPLIED STATISTICS 2022年第11期49卷 2740-2766页

作者： Deng, Dianliang Sun, Yiguang Tian, Guo-Liang Univ Regina Dept Math & Stat Regina SK S4S 0A2 Canada Southern Univ Sci & Technol Dept Stat & Data Sci Shenzhen Guangdong Peoples R China

In this paper, a new multivariate zero-inflated binomial (MZIB) distribution is proposed to analyse the correlated proportional data with excessive zeros. The distributional properties of purposed model are studied. The Fisher scoring algorithm and em algorithm are given for the computation of estimates of parameters in the proposed MZIB model with/without covariates. The score tests and the likelihood ratio tests are derived for assessing both the zero-inflation and the equality of multiple binomial probabilities in correlated proportional data. A limited simulation study is performed to evaluate the performance of derived em algorithms for the estimation of parameters in the model with/without covariates and to compare the nominal levels and powers of both score tests and likelihood ratio tests. The whitefly data is used to illustrate the proposed methodologies.

来源：评论

学校读者我要写书评

暂无评论

An incremental loss ratio method using prior information on calendar year effects

引用

EUROPEAN ACTUARIAL JOURNAL 2023年第1期13卷 91-123页

作者： Riegel, Ulrich Munich Reinsurance Co Koniginstr 107 D-80802 Munich Germany

In a run-off triangle external factors can have a similar influence on all incremental losses of the same calendar year. This can distort the triangle such that reserving methods like chain ladder or the loss ratio method do not work properly. A very recent example of such an external factor is the Covid-19 pandemic. In many countries, the insurance industry is in the process of establishing market knowledge about the impact of the pandemic on premiums and losses. We extend the additive claims reserving model to allow for calendar year effects and develop a variant of the incremental loss ratio method (also known as the additive method) that can make use of such market knowledge. We derive formulas for the mean squared error of prediction and provide a detailed numerical example.

关键词： Additive claims reserving model Loss development Calendar year effects modeling em algorithm

来源：评论

学校读者我要写书评

暂无评论

Sampling-based Gaussian Mixture Regression for Big Data

引用

Journal of Data Science 2023年第1期21卷 158-172页

作者： Lee, JooChul Schifano, Elizabeth D. Wang, HaiYing Department of Statistics University of Connecticut Storrs 06269 CT United States

This paper proposes a nonuniform subsampling method for finite mixtures of regression models to reduce large data computational tasks. A general estimator based on a subsample is investigated, and its asymptotic normality is established. We assign optimal subsampling probabilities to data points that minimize the asymptotic mean squared errors of the general estimator and linearly transformed estimators. Since the proposed probabilities depend on unknown parameters, an implementable algorithm is developed. We first approximate the optimal subsampling probabilities using a pilot sample. After that, we select a subsample using the approximated subsampling probabilities and compute estimates using the subsample. We evaluate the proposed method in a simulation study and present a real data example using appliance energy data. © 2023 The Author(s).

关键词： em algorithm massive data optimal probabilities supsampling

来源：评论

学校读者我要写书评

暂无评论

A multi-dimensional integrative scoring framework for predicting functional variants in the human genome

引用

AMERICAN JOURNAL OF HUMAN GENETICS 2022年第3期109卷 446-456页

作者： Li, Xihao Young, Godwin Zhou, Hufeng Sun, Ryan Li, Zilin Hou, Kangcheng Zhang, Martin Jinye Liu, Yaowu Arapoglou, Theodore Wang, Chen Ionita-Laza, Iuliana Lin, Xihong Harvard TH Chan Sch Publ Hlth Dept Biostat Boston MA 02115 USA Methods Collaborat & Outreach Grp Genent Roche San Francisco CA 94080 USA Univ Texas MD Anderson Canc Ctr Dept Biostat Houston TX 77030 USA Univ Calif Los Angeles David Geffen Sch Med Dept Pathol & Lab Med Los Angeles CA 90095 USA Harvard TH Chan Sch Publ Hlth Dept Epidemiol Boston MA 02115 USA Broad Inst Harvard & MIT Program Med & Populat Genet Cambridge MA 02142 USA SouthWestern Univ Finance & Econ Sch Stat Chengdu Sichuan Peoples R China Columbia Univ Dept Biostat Mailman Sch Publ Hlth New York NY 10032 USA Harvard Univ Dept Stat Cambridge MA 02138 USA

Attempts to identify and prioritize functional DNA elements in coding and non-coding regions, particularly through use of in silico functional annotation data, continue to increase in popularity. However, specific functional roles can vary widely from one variant to another, making it challenging to summarize different aspects of variant function with a one-dimensional rating. Here we propose multi-dimensional annotation-class integrative estimation (MACIE), an unsupervised multivariate mixed-model framework capable of integrating annotations of diverse origin to assess multi-dimensional functional roles for both coding and non-coding variants. Unlike existing one-dimensional scoring methods, MACIE views variant functionality as a composite attribute encompassing multiple characteristics and estimates the joint posterior functional probabilities of each genomic position. This estimate offers more comprehensive and interpretable information in the presence of multiple aspects of functionality. Applied to a variety of independent coding and non-coding datasets, MACIE demonstrates powerful and robust performance in discriminating between functional and non-functional variants. We also show an application of MACIE to fine-mapping and heritability enrichment analysis by using the lipids GWAS summary statistics data from the European Network for Genetic and Genomic Epidemiology Consortium.

关键词： functional annotations multi-dimensional integrated scores prediction of functional effect generalized linear mixed model em algorithm

来源：评论

学校读者我要写书评

暂无评论

MAP segmentation in Bayesian hidden Markov models: a case study

引用

JOURNAL OF APPLIED STATISTICS 2022年第5期49卷 1203-1234页

作者： Koloydenko, Alexey Kuljus, Kristi Lember, Juri Royal Holloway Univ London London England Univ Tartu Inst Math & Stat Tartu Estonia

We consider the problem of estimating the maximum posterior probability (MAP) state sequence for a finite state and finite emission alphabet hidden Markov model (HMM) in the Bayesian setup, where both emission and transition matrices have Dirichlet priors. We study a training set consisting of thousands of protein alignment pairs. The training data is used to set the prior hyperparameters for Bayesian MAP segmentation. Since the Viterbi algorithm is not applicable any more, there is no simple procedure to find the MAP path, and several iterative algorithms are considered and compared. The main goal of the paper is to test the Bayesian setup against the frequentist one, where the parameters of HMM are estimated using the training data.

关键词： Hidden Markov model Bayesian inference MAP sequence viterbi algorithm em algorithm

来源：评论

学校读者我要写书评

暂无评论

Nonparametric estimation for competing risks survival data subject to left truncation and interval censoring

引用

COMPUTATIONAL STATISTICS 2022年第1期37卷 29-42页

作者： Shen, Pao-sheng Tunghai Univ Dept Stat Taichung 40704 Taiwan

In this article, we consider nonparametric estimation of the cumulative incidence function (CIF) for left-truncated and interval-censored competing risks (LT-ICC) data. To reduce the bias of the pseudo-likelihood estimator (PLE) of CIF in the literature, we proposed two alternative estimators. The first estimator, called the modified PLE (MPLE), is obtained based on the modified NPMLE of F(t). The second estimator, called the modified maximum likelihood estimator (MMLE), is derived using modified likelihood functions for LT-ICC data, where the left endpoints of the intervals for left-censored observations with failure type j are the maximum of left-truncated variables and the estimated left endpoint of the support of the observations. Simulation studies show that the MPLE and MMLE are less biased than the PLE for most of the cases considered and their standard deviations are significantly smaller than that of the PLE.

关键词： NPMLE Cumulative incidence function Left truncation em algorithm

来源：评论

学校读者我要写书评

暂无评论

Statistical integration of heterogeneous omics data: Probabilistic two-way partial least squares (PO2PLS)

引用

JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS 2022年第5期71卷 1451-1470页

作者： el Bouhaddani, Said Uh, Hae-Won Jongbloed, Geurt Houwing-Duistermaat, Jeanine UMC Utrecht Dept Data Sci & Biostat Utrecht Netherlands Delft Univ Technol Delft Inst Appl Math Delft Netherlands Univ Leeds Dept Stat Leeds W Yorkshire England Univ Bologna Dept Stat Sci Bologna Italy

The availability of multi-omics data has revolutionized the life sciences by creating avenues for integrated system-level approaches. Data integration links the information across datasets to better understand the underlying biological processes. However, high dimensionality, correlations and heterogeneity pose statistical and computational challenges. We propose a general framework, probabilistic two-way partial least squares (PO2PLS), that addresses these challenges. PO2PLS models the relationship between two datasets using joint and data-specific latent variables. For maximum likelihood estimation of the parameters, we propose a novel fast em algorithm and show that the estimator is asymptotically normally distributed. Aglobal test for the relationship between two datasets is proposed, specifically addressing the high dimensionality, and its asymptotic distribution is derived. Notably, several existing data integration methods are special cases of PO2PLS. Via extensive simulations, we show that PO2PLS performs better than alternatives in feature selection and prediction performance. In addition, the asymptotic distribution appears to hold when the sample size is sufficiently large. We illustrate PO2PLS with two examples from commonly used study designs: a large population cohort and a small case-control study. Besides recovering known relationships, PO2PLS also identified novel findings. The methods are implemented in our R-package PO2PLS.

关键词： em algorithm global test heterogeneity identifiability latent variable models probabilistic O2PLS

来源：评论

学校读者我要写书评

暂无评论

Quantile mixed hidden Markov models for multivariate longitudinal data: An application to children's Strengths and Difficulties Questionnaire scores

引用

JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS 2022年第2期71卷 417-448页

作者： Merlo, Luca Petrella, Lea Tzavidis, Nikos Sapienza Univ Rome Dept Stat Sci Rome Italy Sapienza Univ Rome MEMOTEF Dept Rome Italy Univ Southampton Southampton Stat Sci Res Inst Dept Social Stat & Demog Southampton Hants England

The identification of factors associated with mental and behavioural disorders in early childhood is critical both for psychopathology research and the support of primary health care practices. Motivated by the Millennium Cohort Study, in this paper we study the effect of a comprehensive set of covariates on children's emotional and behavioural trajectories in England. To this end, we develop a quantile mixed hidden Markov model for joint estimation of multiple quantiles in a linear regression setting for multivariate longitudinal data. The novelty of the proposed approach is based on the multivariate asymmetric Laplace distribution which allows to jointly estimate the quantiles of the univariate conditional distributions of a multivariate response, accounting for possible correlation between the outcomes. Sources of unobserved heterogeneity and serial dependency due to repeated measures are modelled through the introduction of individual-specific, time-constant random coefficients and time-varying parameters evolving over time with a Markovian structure respectively. The inferential approach is carried out through the construction of a suitable expectation-maximization algorithm without parametric assumptions on the random effects distribution.

关键词： em algorithm finite mixtures multivariate asymmetric Laplace distribution non-parametric maximum likelihood quantile regression random effects model

来源：评论

学校读者我要写书评

暂无评论

Statistical inference for normal mixtures with unknown number of components

引用

ELECTRONIC JOURNAL OF STATISTICS 2022年第2期16卷 5149-5181页

作者： Huang, Mian Tang, Shiyi Yao, Weixin Shanghai Univ Finance & Econ Sch Stat & Management Shanghai Peoples R China Univ Calif Riverside Dept Stat Riverside CA 92521 USA

Statistical inference for normal mixture models with unknown number of components has long been challenging due to the issues of non-identifiability, degenerated Fisher matrix, and boundary parameters. In this paper, a penalized likelihood estimation procedure is proposed for mixtures of normals with unknown number of components to achieve both the order selection consistency and the root -n convergence rate for the component pa-rameters estimators. We show that the proposed new estimator could avoid being trapped in certain degenerated regions of the nonidentifiable subset of the parameter space for over-fitted normal mixture models so that a reg-ular asymptotic quadratic Taylor expansion of the mixture log-likelihood could be derived. With a suitable penalty function on mixing proportions, the new estimator is proved to be consistent on the order selection, and have an asymptotic normal distribution. Our derived sparsity conditions also reveal some surprising but interesting differences among some com-monly used penalty functions and explain why the performance of some popularly used penalty functions, such as Lasso and SCAD, provide un-satisfactory results in the order selection. Extensive simulations and a real data analysis are conducted to demonstrate the effectiveness of the newly proposed estimator.

关键词： Normal mixture model penalized estimation order selection em algorithm

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：