检索结果-内蒙古大学图书馆

Tweedie gradient boosting for extremely unbalanced zero-inflated data

COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION 2022年第9期51卷 5507-5529页

作者： Zhou, He Qian, Wei Yang, Yi Univ Minnesota Sch Stat Minneapolis MN 55455 USA Univ Delaware Dept Appl Econ & Stat Newark DE USA McGill Univ Dept Math & Stat Montreal PQ H3A 0G4 Canada

Tweedie's compound Poisson model is a popular method to model insurance claims with probability mass at zero and nonnegative, highly right-skewed distribution. In particular, it is not uncommon to have extremely unbalanced data with excessively large proportion of zero claims, and even traditional Tweedie model may not be satisfactory for fitting the data. In this paper, we propose a boosting-assisted zero-inflated Tweedie model, called emTboost, that allows zero probability mass to exceed a traditional model. We makes a nonparametric assumption on its Tweedie model component, that unlike a linear model, is able to capture nonlinearities, discontinuities, and complex higher order interactions among predictors. A specialized Expectation-Maximization algorithm is developed that integrates a blockwise coordinate descent strategy and a gradient tree-boosting algorithm to estimate key model parameters. We use extensive simulation and data analysis on synthetic zero-inflated auto-insurance claim data to illustrate our method's prediction performance.

关键词： Claim frequency and severity em algorithm gradient boosting Zero-inflated insurance claims data

来源：评论

学校读者我要写书评

暂无评论

em algorithm in Estimating the 2-and 3-Parameter Burr Type III Distributions

EM Algorithm in Estimating the 2-and 3-Parameter Burr Type I...

引用

21st National Symposium on Mathematical Sciences (SKSM)

作者： Ismail, Nor Hidayah Binti Khalid, Zarina Binti Mohd Univ Teknol Malaysia Dept Math Sci Fac Sci Skudai 81310 Johor Malaysia

ISBN: (纸本)9780735412415

The Burr Type III distribution has been applied in the study of income, wage and wealth. It is suitable to fit lifetime data since it has flexible shape and controllable scale parameters. The popularity of Burr Type III distribution increases because it has included the characteristics of other distributions such as logistic and exponential. Burr Type III distribution has two categories: First a two-parameter distribution which has two shape parameters and second a three-parameter distribution which has a scale and two shape parameters. Expectation-maximization (em) algorithm method is selected in this paper to estimate the two-and three-parameter Burr Type III distributions. Complete and censored data are simulated based on the derivation of pdf and cdf in parametric form of Burr Type III distributions. Then, the em estimates are compared with estimates from maximum likelihood estimation (MLE) approach through mean square error. The best approach results in estimates with a higher approximation to the true parameters are determined. The result shows that the em algorithm estimates perform better than the MLE estimates for two- and three-parameter Burr Type III distributions in the presence of complete and censored data.

关键词： Burr Type III Distribution em algorithm Censored Data

来源：评论

学校读者我要写书评

暂无评论

Strong consistency of the MLE under two-parameter Gamma mixture models with a structural scale parameter

引用

ADVANCES IN DATA ANALYSIS AND CLASSIFICATION 2022年第1期16卷 125-154页

作者： He, Mingxing Chen, Jiahua Yunnan Univ Yunnan Key Lab Stat Modeling & Data Anal Kunming 650091 Yunnan Peoples R China Yunnan Univ Sch Math & Stat Kunming 650221 Yunnan Peoples R China Univ British Columbia Dept Stat Vancouver BC V7C 1Z2 Canada

We study the strong consistency of the maximum likelihood estimator under a special finite mixture of two-parameter Gamma distributions. Somewhat surprisingly, the likelihood function under Gamma mixture with a set of independent and identically distributed observations is unbounded. There exist many sets of nonsensical parameter values at which the likelihood value is arbitrarily large. This leads to an inconsistent, or arguably undefined, maximum likelihood estimator. Interestingly, when the scale or shape parameter in the finite Gamma mixture model is structural, the maximum likelihood estimator of the mixing distribution is well defined and strongly consistent. Establishing the consistency when the shape parameter is structural is technically less challenging and already given in the literature. In this paper, we prove the consistency when the scale parameter is structural and provide some illustrative simulation experiments. We further include an application example of the model with a structural scale parameter to salary potential data. We conclude that the Gamma mixture distribution with a structural scale parameter provides another flexible yet relatively parsimonious model for observations with intrinsic positive values.

关键词： em algorithm Finite Gamma mixture model Maximum likelihood estimator Strong consistency Structural parameter

来源：评论

学校读者我要写书评

暂无评论

Estimating Lost Sales for Substitutable Products with Uncertain On-Shelf Availability

引用

M&SOM-MANUFACTURING & SERVICE OPERATIONS MANAGemENT 2022年第3期24卷 1578-1594页

作者： Steeneck, Daniel Eng-Larsson, Fredrik Jauffred, Francisco Air Force Inst Technol Dept Operat Sci Wright Patterson AFB OH 45433 USA Stockholm Univ Stockholm Business Sch SE-10691 Stockholm Sweden MIT Ctr Transportat & Logist 77 Massachusetts Ave Cambridge MA 02139 USA

Problem definition: We address the problem of how to estimate lost sales for substitutable products when there is no reliable on-shelf availability (OSA) information. Academic/practical relevance: We develop a novel approach to estimating lost sales using only sales data, a market share estimate, and an estimate of overall availability. We use the method to illustrate the negative consequences of using potentially inaccurate inventory records as indicators of availability. Methodology: We suggest a partially hidden Markov model of OSA to generate probabilistic choice sets and incorporate these probabilistic choice sets into the estimation of a multinomial logit demand model using a nested expectation-maximization algorithm. We highlight the importance of considering inventory reliability problems first through simulation and then by applying the procedure to a data set from a major U.S. retailer. Results: The simulations show that the method converges in seconds and produces estimates with similar or lower bias than state-of-the-art benchmarks. For the product category under consideration at the retailer, our procedure finds lost sales of around 3.0% compared with 0.2% when relying on the inventory record as an indicator of availability. Managerial implications: Themethod efficiently computes estimates that can be used to improve inventory management and guide managers on how to use their scarce resources to improve stocking execution. The research also shows that ignoring inventory record inaccuracies when estimating lost sales can produce substantially inaccurate estimates, which leads to incorrect parameters in supply chain planning.

关键词： inventory uncertainty lost sales censored demand demand estimation em algorithm hidden Markov model

来源：评论

学校读者我要写书评

暂无评论

Left-truncated and right-censored field failure data: Review of parametric analysis for reliability

引用

QUALITY AND RELIABILITY ENGINEERING INTERNATIONAL 2022年第7期38卷 3919-3934页

作者： emura, Takeshi Michimae, Hirofumi Kurume Univ Biostat Ctr Kurume Fukuoka Japan Inst Stat Math Res Ctr Med & Hlth Data Sci Tokyo Japan Kitasato Univ Sch Pharm Dept Clin Med Biostat Tokyo Japan

In field reliability analyses, a data collection period is given to monitor the failure events from the field. Left-truncation arises due to early failures occurring before the data collection period, and right-censoring arises for late failures occurring beyond the monitoring period. Naive analyses of left-truncated and right-censored data lead to biased estimation of the population lifetime of interest. A variety of models and methods have been developed to analyze the left-truncated and right-censored data for field reliability analyses. The goal of the paper is to review the existing models and methods for fitting left-truncated and right-censored data. Our review includes the existing statistical models, such as the exponential, Weibull, lognormal, gamma, Gompertz, Lomax, and spline models. We comprehensively review the statistical issues of maximum likelihood estimation, model selection, residual lifetime prediction, and Bayesian methods. Some of these methods are illustrated through the field reliability analysis of the electric power transformer dataset.

关键词： Akaike's information criterion em algorithm lognormal distribution Newton-Raphson algorithm Reliability Weibull distribution

来源：评论

学校读者我要写书评

暂无评论

The Cox-Aalen model for doubly censored data

引用

COMMUNICATIONS IN STATISTICS-THEORY AND METHODS 2022年第23期51卷 8075-8092页

作者： Shen, Pao-sheng Tunghai Univ Dept Stat Taichung 40704 Taiwan

Double censored data often arise in medical and epidemiological studies when observations are subject to both left censoring and right censoring. In this article, based on doubly censored data, we consider maximum likelihood estimation for the Cox-Aalen model with fixed covariates. By treating left censored observations as missing, we propose expectation-maximization (em) algorithms for obtaining the maximum likelihood estimators (MLE) of the regression coefficients for the Cox-Aalen model. We establish the asymptotic properties of the MLE. Simulation studies show that MLE via the em algorithms performs well.

关键词： Left censoring maximum likelihood estimation additive hazard model em algorithm MLE

来源：评论

学校读者我要写书评

暂无评论

Simultaneous variable selection in regression analysis of multivariate interval-censored data

引用

BIOMETRICS 2022年第4期78卷 1402-1413页

作者： Sun, Liuquan Li, Shuwei Wang, Lianming Song, Xinyuan Sui, Xuemei Guangzhou Univ Sch Econ & Stat Guangzhou Peoples R China Chinese Acad Sci Acad Math & Syst Sci Inst Appl Math Beijing Peoples R China Univ South Carolina Dept Stat Columbia SC 29208 USA Chinese Univ Hong Kong Dept Stat Hong Kong Peoples R China Univ South Carolina Arnold Sch Publ Hlth Dept Exercise Sci Columbia SC 29208 USA

Multivariate interval-censored data arise when each subject under study can potentially experience multiple events and the onset time of each event is not observed exactly but is known to lie in a certain time interval formed by adjacent examination times with changed statuses of the event. This type of incomplete and complex data structure poses a substantial challenge in practical data analysis. In addition, many potential risk factors exist in numerous studies. Thus, conducting variable selection for event-specific covariates simultaneously becomes useful in identifying important variables and assessing their effects on the events of interest. In this paper, we develop a variable selection technique for multivariate interval-censored data under a general class of semiparametric transformation frailty models. The minimum information criterion (MIC) method is embedded in the optimization step of the proposed expectation-maximization (em) algorithm to obtain the parameter estimator. The proposed em algorithm greatly reduces the computational burden in maximizing the observed likelihood function, and the MIC naturally avoids selecting the optimal tuning parameter as needed in many other popular penalties, making the proposed algorithm promising and reliable. The proposed method is evaluated through extensive simulation studies and illustrated by an analysis of patient data from the Aerobics Center Longitudinal Study.

关键词： em algorithm interval censoring minimum information criterion multivariate analysis transformation models

来源：评论

学校读者我要写书评

暂无评论

em algorithm for Reconstructing Overpressure Field of Underwater Explosion Induced Shock Wave 13

EM Algorithm for Reconstructing Overpressure Field of Underw...

引用

13th IEEE/ACIS International Conference on Computer and Information Science (ICIS)

作者： Bai, Miaomiao Guo, Yali Wang, Liming North Univ China Taiyuan 030051 Peoples R China

ISBN: (纸本)9781479948604

In this paper, we study the techniques of blast wave field reconstruction based on Tomography. Overpressure field is reconstructed by inverting the velocity field in the process of shock wave transmission. Since the reconstruction process is difficult due to the insufficient number of excitation sources and detectors, we propose an em algorithm based on prior information. Appropriate models are constructed using the proposed methods, and a simulation example is put forward at last. The result reveals that compared with the traditional methods, this method has higher precision and converges faster. It also shows the validity and practicality of the developed algorithm in solving the problem of incomplete data reconstruction.

关键词： reconstruction of overpressure field retrieval accuracy em algorithm

来源：评论

学校读者我要写书评

暂无评论

Segmentation of immunohistochemical staining of beta-catenin expression of oral cancer using em algorithm

引用

JOURNAL OF TAIBAH UNIVERSITY MEDICAL SCIENCES 2015年第2期10卷 169-174页

作者： Albasri, Abdulkader M. Ali, Abdulrahman H. Nathiha, Ayesha A. Taibah Univ Dept Pathol Coll Med Almadinah Ahnunawwarah Saudi Arabia Taibah Univ Coll Med Rehabil Almadinah Ahnunawwarah Saudi Arabia Natl Coll Engn Appl Elect Tirunelveli India

Objectives: Oral Cancer, also called Oral Squamous Cell Carcinoma (OSCC), has been one of the serious cancers that affect the South Asian countries. A range of diagnostic strategies are available including biopsy of the affected part. The Wnt/beta-catenin pathway plays important roles in morphogenesis, normal physiological functions, and tumor formation. This study examined the accumulation of beta-catenin in the nuclei and cytoplasm of oral cancer. Methods: The accuracy of histopathological results is hampered by considerable inter and intra-reader variability even by expert pathologists. In order to get both qualitative and quantitative results, we developed a system for diagnosis of oral cancer using Expectation-Maximization (em algorithm). Results: The microscopic images of immunohistochemical staining of beta-catenin expression were segmented using Iterative Method of (em) algorithm to extract the cellular and extracellular components of an image. The segmentation process of the system uses unitone conversion to obtain a single channel image using Principal Component Analysis (PCA) with the highest contrast. Finally, the unitone image is normalized to (0-1) range. Conclusion: Based on the segmentation process we conclude that beta-catenin expression using em algorithm is an efficient technique to help the pathologist to evaluate the histological changes on microscopic images of oral cancer.

关键词： em algorithm Immunohistochemistry Oral Squamous Cell Carcinoma PCA Unitone

来源：评论

学校读者我要写书评

暂无评论

Inference about the bivariate new extended Weibull distribution based on complete and censored data

引用

COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION 2022年第3期51卷 738-756页

作者： Azizi, Afsaneh Sayyareh, Abdolreza Panahi, Hanieh Razi Univ Dept Stat Kermanshah Iran KN Toosi Univ Technol Fac Math Dept Comp Sci & Stat Tehran Iran Islamic Azad Univ Lahijan Branch Dept Math & Stat Lahijan Iran

In this article, we have discussed the problem of point estimation of the three unknown parameters of a bivariate new extended Weibull distribution under complete and randomly right-censored samples. The expectation-maximization algorithm is used to estimate the unknown parameters. Simulation experiments are performed to see the effectiveness of the estimators for complete and censored data. One dataset has been considered to illustrate the practical utility of the article.

关键词： Bivariate distribution em algorithm NEW distribution Pseudo-likelihood Random censoring

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：