检索结果-内蒙古大学图书馆

A Cure Rate Model for Exponentially Distributed Lifetimes with Competing Risks

JOURNAL OF STATISTICAL THEORY AND PRACTICE 2021年第1期15卷 1-24页

作者： Pal, Ayan Mondal, Shuvashree Kundu, Debasis Indian Inst Technol Kanpur Dept Math & Stat Kanpur 208016 Uttar Pradesh India Indian Inst Technol ISM Dhanbad Dept Math & Comp Dhanbad 826004 Jharkhand India

In real life, more often experimental units are susceptible to more than one risk factor. Moreover, some experimental units may not fail even if they are observed over a long period of time. In statistical analysis, competing risks models handle the first scenario while cure rate models have been introduced to analyze the long-term survivors in the population. In this paper, we consider a cure rate model when the failure of a unit can be due to either of the two competing causes. To analyze the competing risk data in presence of long-term survivors, we consider the latent failure times approach introduced by Cox (J R Stat Soc Ser B (Methodol) 21(2):411-421, 1959). The latent failure times are assumed to follow exponential distributions, and they are independently distributed. Under this setup, a random censoring scheme is applied and the observed data consist of either censored times or actual failure times along with the cause of failures. We derive the maximum likelihood estimators (MLEs) using the expectation-maximization (em) algorithm based on the missing value principle. As the overall survival function is not a proper survival function, the asymptotic behavior of the MLEs is not immediate. We provide the sufficient conditions for the existence, uniqueness, consistency and the asymptotic normality of the MLEs. Monte Carlo simulations are performed to support the theoretical validation numerically. For illustrative purposes, we have analyzed one real dataset, and the results are quite satisfactory.

关键词： Cure rate model Long-term survivors Competing risk em algorithm Asymptotic normality Consistency Maximum likelihood estimator

来源：评论

学校读者我要写书评

暂无评论

Restricted empirical Likelihood Estimation for Time Series Autoregressive Models

引用

JOURNAL OF STATISTICAL THEORY AND APPLICATIONS 2021年第1期20卷 11-20页

作者： Bayati, Mandieh Ghoreishi, S. K. Wu, Jingjing Azad Univ Dept Stat Sci & Res Branch Tehran Iran Univ Qom Fac Sci Dept Stat Qom Iran Univ Calgary Dept Math & Stat Calgary AB Canada

In this paper, we first illustrate the restricted empirical likelihood function, as an alternative to the usual empirical likelihood. Then, we use this quasi-empirical likelihood function as a basis for Bayesian analysis of AR(r) time series models. The efficiency of both the posterior computation algorithm, when the estimating equations are linear functions of the parameters, and the em algorithm for estimating hyper-parameters is an appealing property of our proposed approach. Moreover, the competitive finite-sample performance of this proposed method is illustrated via both simulation study and analysis of a real dataset. (C) 2021 The Authors. Published by Atlantis Press B.V.

关键词： Autoregressive models em algorithm empirical likelihood Estimating equations

来源：评论

学校读者我要写书评

暂无评论

Modelling Healthcare Demand Count Data with Excessive Zeros and Overdispersion

引用

GLOBAL ECONOMIC REVIEW 2021年第4期50卷 358-381页

作者： Park, Myung Hyun Kim, Joseph H. T. Seoul Natl Univ Fac Liberal Educ Seoul South Korea Yonsei Univ Dept Stat & Data Sci Seoul 120749 South Korea

In healthcare economics count datasets often exhibit excessive zeros or right-skewed tails. When covariates are available, such datasets are typically modelled using the zero-inflated (ZI) or finite mixture (FM) regression models. However, neither model performs adequately when the dataset has both excessive zeros and a long tail, which is often the case in practice. In this paper we combine these two models to create a more flexible, versatile class of ZIFM models. With this model we perform a comprehensive analysis on the number of visits to a physician's office using the US healthcare demand dataset that has been used in numerous healthcare studies in the literature. After comparing to other existing models which have been reported to perform well on this dataset, we find that the ZIFM model substantially outperforms alternative models. In addition, the model offers a new interpretation that is in contrast to previous empirical findings regarding the factors associated with the demand for the physicians, which can shed a fresh light on the healthcare utilisation policies.

关键词： Count distributions zero-inflated models finite mixture models em algorithm health care demand

来源：评论

学校读者我要写书评

暂无评论

LETTER TO THE EDITOR: FITTING A FOLDED NORMAL DISTRIBUTION WITHOUT em

引用

ANNALS OF APPLIED STATISTICS 2020年第4期14卷 2096-2098页

作者： MacDonald, Iain L. Univ Cape Town Ctr Actuarial Res Rondebosch South Africa

The problem of fitting a folded normal distribution by maximum likelihood has been described as 'not straightforward', and alternatives such as em proposed. We suggest here that it is in fact straightforward to fit such a distribution by direct numerical maximization of the likelihood. We demonstrate this in an example. The relevant R code is included.

关键词： Folded normal distribution em algorithm likelihood numerical maximization

来源：评论

学校读者我要写书评

暂无评论

Estimation under a finite mixture of modified Weibull distributions based on censored data via em algorithm with application

引用

JOURNAL OF STATISTICAL THEORY AND APPLICATIONS 2014年第3期13卷 196-204页

作者： Ateya, Saieed F. Alharthi, Amirah S. Taif Univ Fac Sci Math & Stat Dept At Taif Saudi Arabia Assiut Univ Fac Sci Math Dept Assiut Egypt

In this paper, the maximum likelihood estimates (MLE's) of the parameters of a finite mixture of modified Weibull (MW(alpha,beta,gamma) distributions are obtained based on type-I and type-II censored samples using the em algorithm. A simulation study is carried out to study the behavior of the mean squared errors. A real data set is introduced and analyzed using a mixture of two MW distributions and also using a mixture of two Weibull (alpha,beta) distributions. A comparison is carried out between the mentioned mixtures based on the corresponding Kolmogorov-Smirnov (K-S) test statistic to emphasize that the MW mixture model fits the data better than the other mixture model.

关键词： Modified Weibull Distribution Maximum likelihood estimation em algorithm Finite mixture models Weibull distribution type-I censoring type-II censoring Kolmogorov-Smirnov (K-S) test statistic

来源：评论

学校读者我要写书评

暂无评论

Proportional hazards regression for zero-inflated cure rate models - An application to cervical cancer data

引用

Communications in Statistics Case Studies Data Analysis and Applications 2025年

作者： Sreedevi, E.P. Sankaran, P.G. Rejani, P.P. Cochin University of Science and Technology Kochi India

In lifetime studies, on many occasions, a proportion of individuals may experience the event of interest at the beginning of the study itself, while another group of individuals may not experience the event of interest, even after a long follow up period. We obtain zero inflated cure rate data in these situations. In this article, we study the regression analysis of zero inflated cure rate models. We propose a proportional hazards regression model to estimate the effect of covariates on lifetime. A simulation study is conducted to asses the finite sample behavior of the proposed estimator. The procedures are applied to a real data on cervical cancer, to demonstrate the practical utility. © 2025 Taylor & Francis Group, LLC.

关键词： Cure rate model em algorithm proportional hazards regression zero-inflated data

来源：评论

学校读者我要写书评

暂无评论

Identification of stream sediment geochemical anomalies in lithologically complex regions: case study of Cu mineralization in Hunan province, SE China

引用

GEOCHemISTRY-EXPLORATION ENVIRONMENT ANALYSIS 2022年第2期22卷 -页

作者： Sun, Yaoyao Hao, Libo Zhao, Xinyun Lu, Jilong Shi, Yanxiang Ma, Chengyou Li, Qingquan Wei, Qiaoqiao Jilin Univ Dept Geochem Changchun 130021 Peoples R China

Owing to the strong control bedrock geology may exert on the chemical composition of stream sediments, the determination of stream sediment geochemical anomalies is always affected by the lithology background in areas with variable lithologies. In this study, the expectation-maximization (em) algorithm was used to separate lithologies of different chemical compositions in a 1: 200 000 scale regional geochemical data set of stream sediments in a lithologically complex region in Hunan province, SE China. The data set included 1024 minerogenic stream sediment samples which were analysed for Cu, La, Li, Be, Cr, Ni, Sr, V, Th, Ti and Zr. A comparison between Cu anomalies determined with and without taking into account the separation of lithologies was carried out. The result shows that stream sediment geochemical anomalies in lithologically complex regions can be determined in a more reasonable way by application of the em clustering method. Strong but false or meaningless anomalies can be eliminated, and weak but important or meaningful anomalies are more clearly revealed.

关键词： geochemical anomaly lithologically complex region em algorithm stream sediment

来源：评论

学校读者我要写书评

暂无评论

Mixture of Conditional Gaussian Graphical Models for Unlabelled Heterogeneous Populations in the Presence of Co-factors

引用

SN Computer Science 2021年第6期2卷 466页

作者： Lartigue, Thomas Durrleman, Stanley Allassonnière, Stéphanie Aramis Project-Team INRIA ICM Paris France CMAP CNRS École polytechnique I.P. Paris Palaiseau France Centre de Recherche des Cordeliers Université de Paris Inserm HEKA Project team INRIA Paris Sorbonne Université Paris 75006 France

Conditional correlation networks, within Gaussian Graphical Models (GGM), are widely used to describe the direct interactions between the components of a random vector. In the case of an unlabelled Heterogeneous population, Expectation Maximisation (em) algorithms for Mixtures of GGM have been proposed to estimate both each sub-population’s graph and the class labels. However, we argue that, with most real data, class affiliation cannot be described with a Mixture of Gaussian, which mostly groups data points according to their geometrical proximity. In particular, there often exists external co-features whose values affect the features’ average value, scattering across the feature space data points belonging to the same sub-population. Additionally, if the co-features’ effect on the features is Heterogeneous, then the estimation of this effect cannot be separated from the sub-population identification. In this article, we propose a Mixture of Conditional GGM (CGGM) that subtracts the heterogeneous effects of the co-features to regroup the data points into sub-population corresponding clusters. We develop a penalised em algorithm to estimate graph-sparse model parameters. We demonstrate on synthetic and real data how this method fulfils its goal and succeeds in identifying the sub-populations where the Mixtures of GGM are disrupted by the effect of the co-features. © 2021, The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd.

关键词： Conditional Gaussian Graphical Models em algorithm Mixture models

来源：评论

学校读者我要写书评

暂无评论

Hidden Markov chains and fields with observations in Riemannian manifolds 24

Hidden Markov chains and fields with observations in Riemann...

引用

24th International Symposium on Mathematical Theory of Networks and Systems (MTNS)

作者： Said, Salem Le Bihan, Nicolas Manton, Jonathan H. Univ Bordeaux CNRS Bordeaux France CNRS Gipsa Lab Grenoble France Univ Melbourne Melbourne Vic Australia

Hidden Markov chain, or Markov field, models, with observations in a Euclidean space, play a major role across signal and image processing. The present work provides a statistical framework which can be used to extend these models, along with related, popular algorithms (such as the Baum-Welch algorithm), to the case where the observations lie in a Riemannian manifold. It is motivated by the potential use of hidden Markov chains and fields, with observations in Riemannian manifolds, as models for complex signals and images. Copyright (C) 2021 The Authors.

关键词： Riemannian manifold hidden Markov model Markov field em algorithm

来源：评论

学校读者我要写书评

暂无评论

An empirical Analysis and Application of the Expectation-Maximization and Matrix Completion algorithms for Varying Degrees of Missing Data

An Empirical Analysis and Application of the Expectation-Max...

引用

Southern African Universities Power Engineering Conference/Robotics and Mechatronics/Pattern-Recognition-Association of South Africa (SAUPEC/RobMech/PRASA)

作者： Thulare, Evans Ajoodha, Ritesh Jadhav, Ashwini Univ Witwatersrand Sch Comp Sci & Appl Math Johannesburg South Africa Univ Witwatersrand Fac Sci Johannesburg South Africa

ISBN: (纸本)9781665403450

Incomplete data sets have been a problem in most studies, however, few studies have come to realise that imputation is a solution to this problem. Incomplete data can have a significant effect on the conclusion drawn and decision made. To solve the problem of incomplete data, one should use techniques to recover those missing values, depending on how much the data is missing, how big is the data, how the data has gone missing, etc. In this report, we aimed to compare the performance of the em algorithm and matrix completion when imputing the missing values for varying degrees of missing data. Kullback-Leibler (KL) divergence was used as an evaluation metric to observe the performance of Expectation-Maximization (em) algorithm and matrix completion when estimating missing values relative to the ground-truth distribution. The findings of this research shows that the em algorithm outperformed matrix completion in both the theoretical (the simulated scenarios of learning from varying degrees of missing data) and the application (the application of theoretical model on real-world data on credit card fraud) models. Few similarities of the algorithms were observed when recovering missing values such as the increasing trend of error as missing values increases and the impact of increasing number of variables in a data set. Matrix completion only performed better when missing values were beyond approximately 75%. Therefore, from our findings, we conclude that when less than 50% of the data is missing, em algorithm produces accurate predictions. The em algorithm performed better compared to the matrix completion since it first learned the data itself and used maximum likelihood procedures to estimate the parameters of the model while the matrix completion analysed the existing pattern from rows and columns and imputes them using the pattern learned in the data.

关键词： em algorithm Matrix completion KL divergence missing values theoretical model application model ground-truth distribution

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：