检索结果-内蒙古大学图书馆

Baum-Welch algorithm on directed acyclic graph for mixtures with latent Bayesian networks

STAT 2017年第1期6卷 303-314页

作者： Li, Jia Lin, Lin Penn State Univ Dept Stat University Pk PA 16802 USA

We consider a mixture model with latent Bayesian network (MLBN) for a set of random vectors X-(t), X-(t) is an element of R-dt, t = 1, ..., T. Each X-(t) is associated with a latent state s(t), given which X-(t) is conditionally independent from other variables. The joint distribution of the states is governed by a Bayes net. Although specific types of MLBN have been used in diverse areas such as biomedical research and image analysis, the exact expectation-maximization (em) algorithm for estimating the models can involve visiting all the combinations of states, yielding exponential complexity in the network size. A prominent exception is the Baum-Welch algorithm for the hidden Markov model, where the underlying graph topology is a chain. We hereby develop a new Baum-Welch algorithm on directed acyclic graph (BW-DAG) for the general MLBN and prove that it is an exact em algorithm. BW-DAG provides insight on the achievable complexity of em. For a tree graph, the complexity of BW-DAG is much lower than that of the brute-force em. Copyright (c) 2017 John Wiley & Sons, Ltd.

关键词： Baum-Welch algorithm Bayesian network directed acyclic graph em algorithm hidden Markov model maximum likelihood estimation

来源：评论

学校读者我要写书评

暂无评论

Bivariate discrete generalized exponential distribution

引用

STATISTICS 2017年第5期51卷 1143-1158页

作者： Nekoukhou, Vahid Kundu, Debasis Khansar Fac Math & Comp Sci Dept Stat Khansar Iran Indian Inst Technol Kanpur Dept Math & Stat Kanpur India

In this paper, we develop a bivariate discrete generalized exponential distribution, whose marginals are discrete generalized exponential distribution as proposed by Nekoukhou, Alamatsaz and Bidram [Discrete generalized exponential distribution of a second type. Statistics. 2013;47:876-887]. It is observed that the proposed bivariate distribution is a very flexible distribution and the bivariate geometric distribution can be obtained as a special case of this distribution. The proposed distribution can be seen as a natural discrete analogue of the bivariate generalized exponential distribution proposed by Kundu and Gupta [Bivariate generalized exponential distribution. J Multivariate Anal. 2009;100:581-593]. We study different properties of this distribution and explore its dependence structures. We propose a new em algorithm to compute the maximum-likelihood estimators of the unknown parameters which can be implemented very efficiently, and discuss some inferential issues also. The analysis of one data set has been performed to show the effectiveness of the proposed model. Finally, we propose some open problems and conclude the paper.

关键词： Discrete bivariate model generalized exponential distribution maximum-likelihood estimators positive dependence joint probability mass function em algorithm Primary: 62F10 Secondary: 62H10

来源：评论

学校读者我要写书评

暂无评论

Robust quantile regression using a generalized class of skewed distributions

引用

STAT 2017年第1期6卷 113-130页

作者： Morales, Christian Galarza Davila, Victor Lachos Cabral, Celso Barbosa Cepero, Luis Castro Escuela Super Politecn Litoral Dept Matemat ESPOL Guayaquil 090902 Ecuador Univ Estadual Campinas Dept Estat BR-13083859 Campinas SP Brazil Univ Fed Amazonas Dept Estat BR-69080000 Manaus Amazonas Brazil Univ Concepcion Dept Estat Concepcion 4070386 Chile Univ Concepcion CI2MA Concepcion 4070386 Chile

It is well known that the widely popular mean regression model could be inadequate if the probability distribution of the observed responses do not follow a symmetric distribution. To deal with this situation, the quantile regression turns to be a more robust alternative for accommodating outliers and the misspecification of the error distribution because it characterizes the entire conditional distribution of the outcome variable. This paper presents a likelihood-based approach for the estimation of the regression quantiles based on a new family of skewed distributions. This family includes the skewed version of normal, Student-t, Laplace, contaminated normal and slash distribution, all with the zero quantile property for the error term and with a convenient and novel stochastic representation that facilitates the implementation of the expectation-maximization algorithm for maximum likelihood estimation of the pth quantile regression parameters. We evaluate the performance of the proposed expectation-maximization algorithm and the asymptotic properties of the maximum likelihood estimates through empirical experiments and application to a real-life dataset. The algorithm is implemented in the R package lqr, providing full estimation and inference for the parameters as well as simulation envelope plots useful for assessing the goodness of fit. Copyright (C) 2017 John Wiley & Sons, Ltd.

关键词： em algorithm quantile regression model scale mixtures of normal distributions

来源：评论

学校读者我要写书评

暂无评论

Bayesian and Maximum Likelihood Estimation for Gaussian Processes on an Incomplete Lattice

引用

JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS 2017年第1期26卷 108-120页

作者： Stroud, Jonathan R. Stein, Michael L. Lysen, Shaun Georgetown Univ McDonough Sch Business Washington DC 20057 USA Univ Chicago Dept Stat Chicago IL 60637 USA Google Inc Quantitat Mkt Boulder CO USA

This article proposes a new approach for Bayesian and maximum likelihood parameter estimation for stationary Gaussian processes observed on a large lattice with missing values. We propose a Markov chain Monte Carlo approach for Bayesian inference, and a Monte Carlo expectation-maximization algorithm for maximum likelihood inference. Our approach uses data augmentation and circulant embedding of the covariance matrix, and provides likelihood-based inference for the parameters and the missing data. Using simulated data and an application to satellite sea surface temperatures in the Pacific Ocean, we show that our method provides accurate inference on lattices of sizes up to 512 x 512, and is competitive with two popular methods: composite likelihood and spectral approximations.

关键词： Circulant embedding Data augmentation em algorithm Markov chain Monte Carlo Spatial statistics

来源：评论

学校读者我要写书评

暂无评论

Model-based clustering for spatiotemporal data on air quality monitoring

引用

ENVIRONMETRICS 2017年第3期28卷 1-N.PAG页

作者： Cheam, A. S. M. Marbac, M. McNicholas, P. D. McMaster Univ Dept Math & Stat Hamilton ON Canada

Data extracted from air quality monitoring can require spatiotemporal clustering techniques. Of late, many clustering techniques are based on mixture models;however, there is a shortage of model-based approaches for spatiotemporal data. A new mixture to cluster spatiotemporal data, named STM, is introduced, and generic identifiability is proved. The resulting model defines each mixture component as a mixture of autoregressive polynomial regressions in which the weights consider the spatial and temporal information with logistic links. Under the maximum likelihood framework, parameter estimation is carried out via an expectation-maximization algorithm while classical information criteria can be used for model selection. The proposed model is applied to air quality monitoring data from the periphery of Paris considering one of the critical pollutants, nitrogen dioxide, at different times during the day. The STM model is implemented in the R package SpaTimeClust.

关键词： air quality clustering em algorithm functional data mixture model polynomial regression spatiotemporal clustering

来源：评论

学校读者我要写书评

暂无评论

A clustering cure rate model with application to a sealantstudy

引用

JOURNAL OF APPLIED STATISTICS 2017年第16期44卷 2949-2962页

作者： Gallardo, Diego I. Bolfarine, Heleno Pedroso-de-Lima, Atonio Carlos Univ Antofagasta Fac Ciencias Basicas Dept Matemat Antofagasta Chile Univ Sao Paulo Inst Matemat & Estat Sao Paulo Brazil

In this paper, the destructive negative binomial (DNB) cure rate model with a latent activation scheme [V. Cancho, D. Bandyopadhyay, F. Louzada, and B. Yiqi, The DNB cure rate model with a latent activation scheme, Statistical Methodology 13 (2013b), pp. 48-68] is extended to the case where the observations are grouped into clusters. Parameter estimation is performed based on the restricted maximum likelihood approach and on a Bayesian approach based on Dirichlet process priors. An application to a real data set related to a sealant study in a dentistry experiment is considered to illustrate the performance of the proposed model.

关键词： Bivariate random effects competing risks Dirichlet processes em algorithm latent activation scheme restricted maximum likelihood

来源：评论

学校读者我要写书评

暂无评论

Estimation and prediction for the generalized inverted exponential distribution based on progressively first-failure-censored data with application

引用

JOURNAL OF APPLIED STATISTICS 2017年第9期44卷 1576-1608页

作者： Ahmed, Essam A. Taibah Univ Fac Business Adm Khyber 41941 Saudi Arabia Sohag Univ Math Dept Sohag 82524 Egypt

In this paper, the estimation of parameters for a generalized inverted exponential distribution based on the progressively first-failure type-II right-censored sample is studied. An expectation-maximization (em) algorithm is developed to obtain maximum likelihood estimates of unknown parameters as well as reliability and hazard functions. Using the missing value principle, the Fisher information matrix has been obtained for constructing asymptotic confidence intervals. An exact interval and an exact confidence region for the parameters are also constructed. Bayesian procedures based on Markov Chain Monte Carlo methods have been developed to approximate the posterior distribution of the parameters of interest and in addition to deduce the corresponding credible intervals. The performances of the maximum likelihood and Bayes estimators are compared in terms of their mean-squared errors through the simulation study. Furthermore, Bayes two-sample point and interval predictors are obtained when the future sample is ordinary order statistics. The squared error, linear-exponential and general entropy loss functions have been considered for obtaining the Bayes estimators and predictors. To illustrate the discussed procedures, a set of real data is analyzed.

关键词： Generalized inverted exponential progressive first-failure censored em algorithm confidence intervals and region Bayesian estimation and prediction symmetric andasymmetric loss functions MCMC

来源：评论

学校读者我要写书评

暂无评论

A CONTINUOUS-TIME STOCHASTIC BLOCK MODEL FOR BASKETBALL NETWORKS

引用

ANNALS OF APPLIED STATISTICS 2017年第2期11卷 553-597页

作者： Xin, Lu Zhu, Mu Chipman, Hugh Royal Bank Canada 88 Queens Quay West Toronto ON M5J 0B8 Canada Univ Waterloo Dept Stat & Actuarial Sci 200 Univ Ave West Waterloo ON N2L 3G1 Canada Acadia Univ Dept Math & Stat Wolfville NS B4P 2R6 Canada

For professional basketball, finding valuable and suitable players is the key to building a winning team. To deal with such challenges, basketball managers, scouts and coaches are increasingly turning to analytics. Objective evaluation of players and teams has always been the top goal of basketball analytics. Typical statistical analytics mainly focuses on the box score and has developed various metrics. In spite of the more and more advanced methods, metrics built upon box score statistics provide limited information about how players interact with each other. Two players with similar box scores may deliver distinct team plays. Thus professional basketball scouts have to watch real games to evaluate players. Live scouting is effective, but suffers from inefficiency and subjectivity. In this paper, we go beyond the static box score and model basketball games as dynamic networks. The proposed continuous-time stochastic block model clusters the players according to their playing style and performance. The model provides cluster-specific estimates of the effectiveness of players at scoring, rebounding, stealing, etc., and also captures player interaction patterns within and between clusters. By clustering similar players together, the model can help basketball scouts to narrow down the search space. Moreover, the model is able to reveal the subtle differences in the offensive strategies of different teams. An application to NBA basketball games illustrates the performance of the model.

关键词： Clustering transactional network Markov chain em algorithm Gibbs sampling basketball analytics social network

来源：评论

学校读者我要写书评

暂无评论

Application of hidden semi-Markov models for the seismic hazard assessment of the North and South Aegean Sea, Greece

引用

JOURNAL OF APPLIED STATISTICS 2017年第6期44卷 1064-1085页

作者： Pertsinidou, C. E. Tsaklidis, G. Papadimitriou, E. Limnios, N. Aristotle Univ Thessaloniki Dept Math Thessaloniki Greece Aristotle Univ Thessaloniki Dept Geophys Thessaloniki Greece Univ Technol Compiegne CS Sorbonne Univ LMAC Lab Math Appl Compiegne EA2222 Compiegne France

The real stress field in an area associated with earthquake generation cannot be directly observed. For that purpose we apply hidden semi-Markov models (HSMMs) for strong earthquake occurrence in the areas of North and South Aegean Sea considering that the stress field constitutes the hidden process. The advantage of HSMMs compared to hidden Markov models (HMMs) is that they allow any arbitrary distribution for the sojourn times. Poisson, Logarithmic and Negative Binomial distributions as well as different model dimensions are tested. The parameter estimation is achieved via the em algorithm. For the decoding procedure, a new Viterbi algorithm with a simple form is applied detecting precursory phases (hidden stress variations) and warning for anticipated earthquake occurrences. The optimal HSMM provides an alarm period for 70 out of 88 events. HMMs are also studied presenting poor results compared to these obtained via HSMMs. Bootstrap standard errors and confidence intervals for the parameters are evaluated and the forecasting ability of the Poisson models is examined.

关键词： Hidden semi-Markov model Viterbi-decoding algorithm em algorithm stress field Aegean sea 62M20 62M05 65C60 86A15

来源：评论

学校读者我要写书评

暂无评论

Estimating Treatment Effect in a Proportional Hazards Model in Randomized Clinical Trials with All-or-Nothing Compliance

引用

BIOMETRICS 2016年第3期72卷 742-750页

作者： Li, Shuli Gray, Robert J. Dana Farber Canc Inst Dept Biostat & Computat Biol Boston MA 02115 USA

We consider methods for estimating the treatment effect and/or the covariate by treatment interaction effect in a randomized clinical trial under noncompliance with time-to-event outcome. As in Cuzick et al. (2007), assuming that the patient population consists of three (possibly latent) subgroups based on treatment preference: the ambivalent group, the insisters, and the refusers, we estimate the effects among the ambivalent group. The parameters have causal interpretations under standard assumptions. The article contains two main contributions. First, we propose a weighted per-protocol (Wtd PP) estimator through incorporating time-varying weights in a proportional hazards model. In the second part of the article, under the model considered in Cuzick et al. (2007), we propose an em algorithm to maximize a full likelihood (FL) as well as the pseudo likelihood (PL) considered in Cuzick et al. (2007). The E step of the algorithm involves computing the conditional expectation of a linear function of the latent membership, and the main advantage of the em algorithm is that the risk parameters can be updated by fitting a weighted Cox model using standard software and the baseline hazard can be updated using closed-form solutions. Simulations show that the em algorithm is computationally much more efficient than directly maximizing the observed likelihood. The main advantage of the Wtd PP approach is that it is more robust to model misspecifications among the insisters and refusers since the outcome model does not impose distributional assumptions among these two groups.

关键词： All-or-nothing compliance em algorithm Proportional hazards model Randomized clinical trial Weighted partial likelihood

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：