检索结果-内蒙古大学图书馆

Heteroscedastic replicated measurement error models under asymmetric heavy-tailed distributions

COMPUTATIONAL STATISTICS 2018年第1期33卷 319-338页

作者： Cao, Chunzheng Chen, Mengqian Wang, Yahui Shi, Jian Qing Nanjing Univ Informat Sci & Technol Sch Math & Stat Nanjing Jiangsu Peoples R China Newcastle Univ Sch Math & Stat Newcastle Upon Tyne Tyne & Wear England

We propose a heteroscedastic replicated measurement error model based on the class of scale mixtures of skew-normal distributions, which allows the variances of measurement errors to vary across subjects. We develop em algorithms to calculate maximum likelihood estimates for the model with or without equation error. An empirical Bayes approach is applied to estimate the true covariate and predict the response. Simulation studies show that the proposed models can provide reliable results and the inference is not unduly affected by outliers and distribution misspecification. The method has also been used to analyze a real data of plant root decomposition.

关键词： Scale mixtures of skew-normal distributions Maximum likelihood estimates em algorithm Robustness

来源：评论

学校读者我要写书评

暂无评论

A flexible cure rate model based on the polylogarithm distribution

引用

JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION 2018年第11期88卷 2137-2149页

作者： Gallardo, Diego I. Gomez, Yolanda M. de Castro, Mario Univ Atacama Fac Ingn Dept Matemat Copiapo Chile Univ Sao Paulo Inst Ciencias Matemat & Comp Sao Carlos SP Brazil

Models for dealing with survival data in the presence of a cured fraction of individuals have attracted the attention of many researchers and practitioners in recent years. In this paper, we propose a cure rate model under the competing risks scenario. For the number of causes that can lead to the event of interest, we assume the polylogarithm distribution. The model is flexible in the sense it encompasses some well-known models, which can be tested using large sample test statistics applied to nested models. Maximum-likelihood estimation based on the em algorithm and hypothesis testing are investigated. Results of simulation studies designed to gauge the performance of the estimation method and of two test statistics are reported. The methodology is applied in the analysis of a data set.

关键词： em algorithm geometric distribution logarithmic distribution negative binomial distribution power series family survival analysis

来源：评论

学校读者我要写书评

暂无评论

Bayesian Image Denoising with Multiple Noisy Images

引用

REVIEW OF SOCIONETWORK STRATEGIES 2019年第2期13卷 267-280页

作者： Kataoka, Shun Yasuda, Muneki Otaru Univ Fac Commerce Otaru Hokkaido 0478571 Japan Yamagata Univ Grad Sch Sci & Engn Yonezawa Yamagata 9928510 Japan

In this paper, we propose a fast image denoising method based on discrete Markov random fields and the fast Fourier transform. The purpose of the image denoising is to infer the original noiseless image from a noise corrupted image. We consider the case where several noisy images are available for inferring the original image and the Bayesian approach is adopted to create the posterior probability distribution of the denoised image. In the proposed method, the estimation of the denoised image is achieved using belief propagation and an expectation-maximization algorithm. We numerically verified the performance of the proposed method using several standard images.

关键词： Image denoising Discrete Markov random field Belief propagation em algorithm FFT

来源：评论

学校读者我要写书评

暂无评论

Bayesian variable selection for parametric survival model with applications to cancer omics data

引用

HUMAN GENOMICS 2018年第1期12卷 1-15页

作者： Duan, Weiwei Zhang, Ruyang Zhao, Yang Shen, Sipeng Wei, Yongyue Chen, Feng Christiani, David C. Nanjing Med Univ Sch Publ Hlth Dept Biostat 101 Longmian Ave Nanjing 211166 Jiangsu Peoples R China Nanjing Med Univ China Int Cooperat Ctr Environm & Human Hlth 101 Longmian Ave Nanjing 211166 Jiangsu Peoples R China Nanjing Med Univ Sch Publ Hlth Harvard Sch Publ Hlth Joint Lab Hlth & Environm Risk Assessment HERA 101 Longmian Ave Nanjing 211166 Jiangsu Peoples R China Nanjing Med Univ Key Lab Biomed Big Data 101 Longmian Ave Nanjing 211166 Jiangsu Peoples R China Harvard Sch Publ Hlth Dept Environm Hlth Boston MA USA Harvard Med Sch Dept Med Massachusetts Gen Hosp Pulm & Crit Care Div Boston MA 02114 USA

BackgroundModeling thousands of markers simultaneously has been of great interest in testing association between genetic biomarkers and disease or disease-related quantitative traits. Recently, an expectation-maximization (em) approach to Bayesian variable selection (emVS) facilitating the Bayesian computation was developed for continuous or binary outcome using a fast em algorithm. However, it is not suitable to the analyses of time-to-event outcome in many public databases such as The Cancer Genome Atlas (TCGA).ResultsWe extended the emVS to high-dimensional parametric survival regression framework (SurvemVS). A variant of cyclic coordinate descent (CCD) algorithm was used for efficient iteration in M-step, and the extended Bayesian information criteria (EBIC) was employed to make choice on hyperparameter tuning. We evaluated the performance of SurvemVS using numeric simulations and illustrated the effectiveness on two real datasets. The results of numerical simulations and two real data analyses show the well performance of SurvemVS in aspects of accuracy and computation. Some potential markers associated with survival of lung or stomach cancer were *** results suggest that our model is effective and can cope with high-dimensional omics data.

关键词： Survival analysis Bayesian variable selection em algorithm Omics Non-small cell lung cancer Stomach adenocarcinoma

来源：评论

学校读者我要写书评

暂无评论

Missing data handling technique in joint modeling context

引用

Biomedical Engineering Advances 2021年 2卷

作者： Gajendra K. Vishwakarma Atanu Bhattacharjee Neelesh Kumar Department of Mathematics & Computing Indian Institute of Technology (ISM) Dhanbad 826004 India Section of Biostatistics Centre for Cancer Epidemiology Tata Memorial Centre Navi Mumbai 410210 India Homi Bhabha National Institute Mumbai India School of Studies in Statistics Vikram University Ujjain 456010 India

The maximum tolerated dose (MTD) is commonly practiced for dose selection in oncology. Higher doses work as the best effective treatment. Conventionally, the doses are selected in phase 2 from a phase. Perhaps, the quality of life gets compromised due to the high dose of chemotherapeutic regimes in the early phase of the trial. Alternative chemotherapy administration is Metronomic Chemotherapy (MC). Here very minimal doses are administered to avoid high toxicity. This work is about to handle missing data of circulating endothelial cells (CEC) and supports the optimal biological dose (OBD) for MC. It is performed with mimic data. A data simulation strategy is adopted. Simulation work is performed with R software. It helps to identify the suitable technique to handle missing data. The results conclude that the MC is efficacious by improving the Progression Free Survival (PFS) and Oveall Survival (OS) through controlled toxicity. Now the illustrated example can be extended to explore the impact of CEC for OS. This is a preliminary attempt towards MC to address some critical issues.

关键词： em algorithm Handling Missing Data Imputation Predictive Mean Matching Regression method

来源：评论

学校读者我要写书评

暂无评论

The COM-Poisson Cure Rate Model for Survival Data-Computational Aspects

引用

中国统计学报 2019年第1期57卷 1-42页

作者：何致晟江村剛志

The Conway-Maxwell-Poisson (COM-Poisson) distribution is useful to account for a cure proportion in survival data. With this model, two computational approaches for calculating maximum likelihood estimates have been developed in the literature: one based on the method in the gamlss R package that employs the first-order derivatives of the log-likelihood, and the other based on the em algorithm that employs the complete-data likelihood. In this paper, we propose a robust version of the Newton-Raphson (NR) algorithm, where the robustness is introduced by random perturbations to the initial values and by log-transformations to positive parameters. We provide the expressions of the derivatives of the log-likelihood under the Bernoulli cure model and computer codes for implementation. Since the NR algorithm employs the first- and second-derivatives of the log-likelihood, it converges more quickly than the method of the gamlss R package. We also review the em algorithms and compare the computational performance between the NR and em algorithms via simulations. We also include a novel data to be fitted to the COM-Poisson cure model, and discuss the consequence of performing the two algorithms.

关键词： em algorithm Generalized gamma distribution Newton-Raphson algorithm Survival analysis Weibull distribution

来源：评论

学校读者我要写书评

暂无评论

Bayesian analysis of multiple-inflation Poisson models and its application to infection data

引用

BRAZILIAN JOURNAL OF PROBABILITY AND STATISTICS 2018年第2期32卷 239-261页

作者： Ryu, Duchwan Bilgili, Devrim Ergonul, Onder Ebrahimi, Nader Northern Illinois Univ Div Stat De Kalb IL 60115 USA Univ North Florida Dept Math & Stat Jacksonville FL 32224 USA Koc Univ Sch Med Infect Dis & Clin Microbiol Dept Istanbul Turkey Koc Univ Sch Med Dept Publ Hlth Istanbul Turkey

In this article we propose a multiple-inflation Poisson regression to model count response data containing excessive frequencies at more than one non-negative integer values. To handle multiple excessive count responses, we generalize the zero-inflated Poisson regression by replacing its binary regression with the multinomial regression, while Su et al. [Statist. Sinica 23 (2013) 1071-1090] proposed a multiple-inflation Poisson model for consecutive count responses with excessive frequencies. We give several properties of our proposed model, and do statistical inference under the fully Bayesian framework. We perform simulation studies and also analyze the data related to the number of infections collected in five major hospitals in Turkey, using our methodology.

关键词： Bayesian generalized linear model em algorithm excessive count response likelihood function zero-inflated poisson model

来源：评论

学校读者我要写书评

暂无评论

Judgment post-stratification in finite mixture modeling: An example in estimating the prevalence of osteoporosis

引用

STATISTICS IN MEDICINE 2018年第30期37卷 4823-4836页

作者： Omidvar, Sedigheh Jozani, Mohammad Jafari Nematollahi, Nader Allameh Tabatabai Univ Dept Stat Tehran Iran Univ Manitoba Dept Stat Winnipeg MB R3T 2N2 Canada

Judgment post-stratification is used to supplement observations taken from finite mixture models with additional easy to obtain rank information and incorporate it in the estimation of model parameters. To do this, sampled units are post-stratified on ranks by randomly selecting comparison sets for each unit from the underlying population and assigning ranks to them using available auxiliary information or judgment ranking. This results in a set of independent order statistics from the underlying model, where the number of units in each rank class is random. We consider cases where one or more rankers with different ranking abilities are used to provide judgment ranks. The judgment ranks are then combined to produce a strength of agreement measure for each observation. This strength measure is implemented in the maximum likelihood estimation of model parameters via a suitable expectation maximization algorithm. Simulation studies are conducted to evaluate the performance of the estimators with or without the extra rank information. Results are applied to bone mineral density data from the third National Health and Nutrition Examination Survey to estimate the prevalence of osteoporosis in adult women aged 50 and over.

关键词： bone mineral density em algorithm finite mixture model judgment post-stratification ranked set sampling

来源：评论

学校读者我要写书评

暂无评论

Modelling the role of variables in model-based cluster analysis

引用

STATISTICS AND COMPUTING 2018年第1期28卷 145-169页

作者： Galimberti, Giuliano Manisi, Annamaria Soffritti, Gabriele Univ Bologna Dept Stat Sci Via Belle Arti 41 I-40126 Bologna Italy

In the framework of cluster analysis based on Gaussian mixture models, it is usually assumed that all the variables provide information about the clustering of the sample units. Several variable selection procedures are available in order to detect the structure of interest for the clustering when this structure is contained in a variable sub-vector. Currently, in these procedures a variable is assumed to play one of (up to) three roles: (1) informative, (2) uninformative and correlated with some informative variables, (3) uninformative and uncorrelated with any informative variable. A more general approach for modelling the role of a variable is proposed by taking into account the possibility that the variable vector provides information about more than one structure of interest for the clustering. This approach is developed by assuming that such information is given by non-overlapped and possibly correlated sub-vectors of variables;it is also assumed that the model for the variable vector is equal to a product of conditionally independent Gaussian mixture models (one for each variable sub-vector). Details about model identifiability, parameter estimation and model selection are provided. The usefulness and effectiveness of the described methodology are illustrated using simulated and real datasets.

关键词： Clusterwise linear regression em algorithm Gaussian mixture model Genetic algorithm Multiple cluster structure Variable selection

来源：评论

学校读者我要写书评

暂无评论

Robust and consistent estimation of generators in credit risk

引用

QUANTITATIVE FINANCE 2018年第6期18卷 983-1001页

作者： dos Reis, G. Smith, G. Univ Edinburgh Maxwell Inst Math Sci Sch Math Edinburgh Midlothian Scotland Univ Nova Lisboa FCT CMA Lisbon Portugal

Bond rating Transition Probability Matrices (TPMs) are built over a one-year time-frame and for many practical purposes, like the assessment of risk in portfolios or the computation of banking Capital Requirements (e.g. the new IFRS 9 regulation), one needs to compute the TPM and probabilities of default over a smaller time interval. In the context of continuous time Markov chains (CTMC) several deterministic and statistical algorithms have been proposed to estimate the generator matrix. We focus on the Expectation-Maximization (em) algorithm by Bladt and Sorensen. [J. R. Stat. Soc. Ser. B (Stat. Method.), 2005, 67, 395-410] for a CTMC with an absorbing state for such estimation. This work's contribution is threefold. Firstly, we provide directly computable closed form expressions for quantities appearing in the em algorithm and associated information matrix, allowing easy approximation of confidence intervals. Previously, these quantities had to be estimated numerically and considerable computational speedups have been gained. Secondly, we prove convergence to a single set of parameters under very weak conditions (for the TPM problem). Finally, we provide a numerical benchmark of our results against other known algorithms, in particular, on several problems related to credit risk. The em algorithm we propose, padded with the new formulas (and error criteria), outperforms other known algorithms in several metrics, in particular, with much less overestimation of probabilities of default in higher ratings than other statistical algorithms.

关键词： Likelihood inference Credit risk Transition probability matrices em algorithm Markov Chain Monte Carlo

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：