检索结果-内蒙古大学图书馆

Sparse inverse covariance estimation for high-throughput microRNA sequencing data in the Poisson log-normal graphical model

引用

JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION 2019年第16期89卷 3105-3117页

作者： Sinclair, David Hooker, Giles Cornell Univ Dept Stat Sci Ithaca NY 14850 USA

We introduce a one-step em algorithm to estimate the graphical structure in a Poisson-Log-Normal graphical model. This procedure is equivalent to a normality transformation that makes the problem of identifying relationships in high-throughput microRNA (miRNA) sequence data feasible. The Poisson-log-normal model moreover allows us to directly account for known overdispersion relationships present in this data set. We show that our em algorithm provides a provable increase in performance in determining the network structure. The model is shown to provide an increase in performance in simulation settings over a range of network structures. The model is applied to high-throughput miRNA sequencing data from patients with breast cancer from The Cancer Genome Atlas (TCGA). By selecting the most highly connected miRNA molecules in the fitted network we find that nearly all of them are known to be involved in the regulation of breast cancer.

关键词： Poisson network graphical LASSO em algorithm miRNA

来源：评论

学校读者我要写书评

暂无评论

Marginal maximum likelihood estimation of conditional autoregressive models with missing data

引用

STAT 2019年第1期8卷 1-10页

作者： Suesse, Thomas Zammit-Mangion, Andrew Univ Wollongong Sch Math & Appl Stat Natl Inst Appl Stat Res Australia Wollongong NSW 2522 Australia

Maximum likelihood (ML) estimation of spatial autocorrelation models is well established for the case where each node in the graph is directly observed. When one or more nodes are not observed, the user has a variety of computational tools at her or his disposal ranging from the expectation-maximization algorithm, which has become a standard for missing-data problems, to marginal likelihood estimation methods and to fully Bayesian approaches. In this article, we give a comprehensive overview of likelihood-based computational frameworks for parameter estimation of the conditional autoregressive model, and we establish connections with several algorithms in the literature that are iterative and often computationally suboptimal. We show that a vanilla marginal ML approach, which we provide computational details for, is still generally orders of magnitude faster than the iterative approaches, even on large data sets and especially so when the number of unobserved units is relatively large.

关键词： CAR model em algorithm incomplete data spatial statistics

来源：评论

学校读者我要写书评

暂无评论

The Rayleigh-Lindley model: properties and applications

引用

JOURNAL OF APPLIED STATISTICS 2019年第1期46卷 141-163页

作者： Gomez, Yolanda M. Gallardo, Diego I. Iriarte, Yuri Bolfarine, Heleno Univ Atacama Fac Ingn Dept Matemat Copiapo Chile Univ Antofagasta Fac Ciencias Basicas Dept Matemat Antofagasta Chile Univ Sao Paulo Inst Matemat & Estat Sao Paulo Brazil

In this paper, the Rayleigh-Lindley (RL) distribution is introduced, obtained by compounding the Rayleigh and Lindley discrete distributions, where the compounding procedure follows an approach similar to the one previously studied by Adamidis and Loukas in some other contexts. The resulting distribution is a two-parameter model, which is competitive with other parsimonious models such as the gamma and Weibull distributions. We study some properties of this new model such as the moments and the mean residual life. The estimation was approached via em algorithm. The behavior of these estimators was studied in finite samples through a simulation study. Finally, we report two real data illustrations in order to show the performance of the proposed model versus other common two-parameter models in the literature. The main conclusion is that the model proposed can be a valid alternative to other competing models well established in the literature.

关键词： Rayleigh distribution Lindley discrete model compound distribution em algorithm survival analysis

来源：评论

学校读者我要写书评

暂无评论

New Parametric Estimation Methods based on Ranked Set Sampling

引用

GAZI UNIVERSITY JOURNAL OF SCIENCE 2019年第4期32卷 1356-1368页

作者： Ashour, Samir K. Abdallah, Mohamed S. Cairo Univ Inst Stat Studies & Res Cairo Egypt Aswan Univ Fac Commerce Aswan Egypt

The problem of parameters estimation plays a significant role in various areas of academic researches. In this article, we propose three different methods of estimation for the parameters of location-scale family under ranked set sampling in the view of missing data mechanism. Through a series of Monte Carlo simulations, it is well investigated that the proposed methods are relatively robust from violating the perfect ranking condition and provide better performance over their competitors using bias and MSE (mean square error) criteria. An empirical data set is also used for illustrative purposes.

关键词： Cramer-von-Mises em algorithm Estimation methods Missing Data Approach Ranked set sampling

来源：评论

学校读者我要写书评

暂无评论

AGGREGATE CLAIM ESTIMATION USING BIVARIATE HIDDEN MARKOV MODEL

引用

ASTIN BULLETIN 2019年第1期49卷 189-215页

作者： Oflaz, Zarina Nukeshtayeva Yozgatligil, Ceylan Selcuk-Kestel, A. Sevtap Middle East Tech Univ Dept Stat Ankara Turkey KTO Karatay Univ Dept Insurance & Social Secur Konya Turkey Middle East Tech Univ Inst Appl Math Actuarial Sci Ankara Turkey

In this paper, we propose an approach for modeling claim dependence, with the assumption that the claim numbers and the aggregate claim amounts are mutually and serially dependent through an underlying hidden state and can be characterized by a hidden finite state Markov chain using bivariate Hidden Markov Model (BHMM). We construct three different BHMMs, namely Poisson-Normal HMM, Poisson-Gamma HMM, and Negative Binomial- Gamma HMM, stemming from the most commonly used distributions in insurance studies. Expectation Maximization algorithm is implemented and for the maximization of the state-dependent part of log-likelihood of BHMMs, the estimates are derived analytically. To illustrate the proposed model, motor third-party liability claims in Istanbul, Turkey, are employed in the frame of Poisson-Normal HMM under a different number of states. In addition, we derive the forecast distribution, calculate state predictions, and determine the most likely sequence of states. The results indicate that the dependence under indirect factors can be captured in terms of different states, namely low, medium, and high states.

关键词： Claim estimation bivariate Hidden Markov model em algorithm Viterbi algorithm MTPL

来源：评论

学校读者我要写书评

暂无评论

Meta-Analysis of Clinical Trials With Sparse Binary Outcomes Using Zero-Inflated Binomial (ZIB) Models

引用

STATISTICS IN BIOPHARMACEUTICAL RESEARCH 2019年第3期11卷 228-238页

作者： Dong, Cheng Zhao, Yueqin Tiwari, Ram Univ Missouri Dept Stat Columbia MO 65211 USA US FDA Div Biometr 7 Off Biostat OTSCDER Silver Spring MD 20993 USA US FDA Div Biostat OSB CDRH Silver Spring MD USA

In meta-analysis of clinical trials, standard statistical methods run into problems when the proportions of safety events are small. Motivated by the dataset used in a published analysis of cardiovascular safety in Rosiglitazone trials, this article proposes using a zero-inflated binomial model to handle the zero-event trials. The maximum likelihood estimates of the model parameters are obtained using the expectation and maximization algorithm. Via simulation studies, it is shown that the proposed methods provide estimates of odds ratios with less bias and variation, compared with both the Mantel-Hanszel method with continuity correction and Peto's method. The proposed methods are applied to the Rosiglitazone trials. for this article are available online.

关键词： em algorithm Fixed-effects model Rare events

来源：评论

学校读者我要写书评

暂无评论

Joint estimation of conditional quantiles in multivariate linear regression models with an application to financial distress

引用

JOURNAL OF MULTIVARIATE ANALYSIS 2019年 173卷 70-84页

作者： Petrella, Lea Raponi, Valentina Sapienza Univ Rome MEMOTEF Rome Italy Imperial Coll London Imperial Coll Business Sch London England

This paper proposes a maximum likelihood approach to jointly estimate marginal conditional quantiles of multivariate response variables in a linear regression framework. We consider a slight reparameterization of the multivariate asymmetric Laplace distribution proposed by Kotz et al. (2001) and exploit its location-scale mixture representation to implement a new em algorithm for estimating model parameters. The idea is to extend the link between the asymmetric Laplace distribution and the well-known univariate quantile regression model to a multivariate context, i.e., when a multivariate dependent variable is concerned. The approach accounts for association among multiple responses and studies how the relationship between responses and explanatory variables can vary across different quantiles of the marginal conditional distribution of the responses. A penalized version of the em algorithm is also presented to tackle the problem of variable selection. The validity of our approach is analyzed in a simulation study, where we also provide evidence on the efficiency gain of the proposed method compared to estimation obtained by separate univariate quantile regressions. A real data application examines the main determinants of financial distress in a sample of Italian firms. (C) 2019 Elsevier Inc. All rights reserved.

关键词： em algorithm Maximum likelihood Multivariate asymmetric Laplace distribution Multiple quantiles Multivariate response variables Quantile regression

来源：评论

学校读者我要写书评

暂无评论

The endo-exo problem in high frequency financial price fluctuations and rejecting criticality

引用

QUANTITATIVE FINANCE 2019年第7期19卷 1165-1178页

作者： Wheatley, Spencer Wehrli, Alexander Sornette, Didier Swiss Fed Inst Technol Dept Management Technol & Econ Zurich Switzerland Univ Geneva Swiss Finance Inst 40 Blvd Pont dArve CH-1211 Geneva 4 Switzerland

The endo-exo problem lies at the heart of statistical identification in many fields of science, and is often plagued by spurious strong-and-long memory due to improper treatment of trends, shocks and shifts in the data. A class of models that has shown to be useful in discerning exogenous and endogenous activity is the Hawkes process. This class of point processes has enjoyed great recent popularity and rapid development within the quantitative finance literature, with particular focus on the study of market microstructure and high frequency price fluctuations. We show that there are important lessons from older fields like time series and econometrics that should also be applied in financial point process modelling. In particular, we emphasize the importance of appropriately treating trends and shocks for the identification of the strength and length of memory in the system. We exploit the powerful Expectation Maximization algorithm and objective statistical criteria (BIC) to select the flexibility of the deterministic background intensity. With these methods, we strongly reject the hypothesis that the considered financial markets are critical at univariate and bivariate microstructural levels.

关键词： Hawkes process Econometrics High frequency financial data Spurious inference Non-stationarity em algorithm

来源：评论

学校读者我要写书评

暂无评论

A joint quantile regression model for multiple longitudinal outcomes

引用

ASTA-ADVANCES IN STATISTICAL ANALYSIS 2019年第4期103卷 453-473页

作者： Kulkarni, Hemant Biswas, Jayabrata Das, Kiranmoy Indian Stat Inst Human Genet Unit Kolkata India Indian Stat Inst Interdisciplinary Stat Res Unit Kolkata India

Complexity of longitudinal data lies in the inherent dependence among measurements from same subject over different time points. For multiple longitudinal responses, the problem is challenging due to inter-trait and intra-trait dependence. While linear mixed models are popularly used for analysing such data, appropriate inference on the shape of the population cannot be drawn for non-normal data sets. We propose a linear mixed model for joint quantile regression of multiple longitudinal responses. We consider an asymmetric Laplace distribution for quantile regression and estimate model parameters by Monte Carlo em algorithm. Nonparametric bootstrap resampling method is used for estimating confidence intervals of parameter estimates. Through extensive simulation studies, we investigate the operating characteristics of our proposed model and compare the performance to other traditional quantile regression models. We apply proposed model for analysing data from nutrition education programme on hypercholesterolemic children of the USA.

关键词： Asymmetric Laplace Distribution em algorithm Longitudinal data MCMC Quantile regression

来源：评论

学校读者我要写书评

暂无评论

Variable selection in semiparametric nonmixture cure model with interval-censored failure time data: An application to the prostate cancer screening study

引用

STATISTICS IN MEDICINE 2019年第16期38卷 3026-3039页

作者： Sun, Liuquan Li, Shuwei Wang, Lianming Song, Xinyuan Guangzhou Univ Sch Econ & Stat Guangzhou 510006 Guangdong Peoples R China Univ South Carolina Dept Stat Columbia SC 29208 USA Chinese Univ Hong Kong Dept Stat Shatin Hong Kong Peoples R China

Censored failure time data with a cured subgroup is frequently encountered in many scientific areas including the cancer screening research, tumorigenicity studies, and sociological surveys. Meanwhile, one may also encounter an extraordinary large number of risk factors in practice, such as patient's demographic characteristics, clinical measurements, and medical history, which makes variable selection an emerging need in the data analysis. Motivated by a medical study on prostate cancer screening, we develop a variable selection method in the semiparametric nonmixture or promotion time cure model when interval-censored data with a cured subgroup are present. Specifically, we propose a penalized likelihood approach with the use of the least absolute shrinkage and selection operator, adaptive least absolute shrinkage and selection operator, or smoothly clipped absolute deviation penalties, which can be easily accomplished via a novel penalized expectation-maximization algorithm. We assess the finite-sample performance of the proposed methodology through extensive simulations and analyze the prostate cancer screening data for illustration.

关键词： em algorithm interval censoring nonmixture cure model penalized likelihood variable selection

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：