The tick structure of the financial markets entails discreteness of stock price changes. Based on this empirical evidence, we develop a multivariate model for discrete price changes featuring a mechanism to account fo...
详细信息
The tick structure of the financial markets entails discreteness of stock price changes. Based on this empirical evidence, we develop a multivariate model for discrete price changes featuring a mechanism to account for the large share of zero returns at high frequency. We assume that the observed price changes are independent conditional on the realization of two hidden Markov chains determining the dynamics and the distribution of the multivariate time series at hand. We study the properties of the model, which is a dynamic mixture of zero-inflated Skellam distributions. We develop an expectation-maximization algorithm with closed-form M-step that allows us to estimate the model by maximum likelihood. In the empirical application, we study the joint distribution of the price changes of a number of assets traded on NYSE. Particular focus is dedicated to the assessment of the quality of univariate and multivariate density forecasts, and of the precision of the predictions of moments like volatility and correlations. Finally, we look at the predictability of price staleness and its determinants in relation to the trading activity on the financial markets.
This paper considers the latent Gaussian graphical model, which extends the Gaussian graphical model to handle discrete data as well as mixed data with both continuous and discrete variables by assuming that discrete ...
详细信息
This paper considers the latent Gaussian graphical model, which extends the Gaussian graphical model to handle discrete data as well as mixed data with both continuous and discrete variables by assuming that discrete variables are generated by discretizing latent Gaussian variables. We propose a modified expectation-maximization (em) algorithm to estimate parameters in the latent Gaussian model for binary data. We also extend the proposed modified em algorithm to the latent Gaussian model for mixed data. The conditional dependence structure can be consequently constructed by exploring the sparsity pattern of the precision matrix of the latent variables. We illustrate the performance of our proposed estimator through comprehensive numerical studies and an application to voting data of the United Nations General Assembly.
Graphical models have received an increasing amount of attention in network psychometrics as a promising probabilistic approach to study the conditional relations among variables using graph theory. Despite recent adv...
详细信息
Graphical models have received an increasing amount of attention in network psychometrics as a promising probabilistic approach to study the conditional relations among variables using graph theory. Despite recent advances, existing methods on graphical models usually assume a homogeneous population and focus on binary or continuous variables. However, ordinal variables are very popular in many areas of psychological science, and the population often consists of several different groups based on the heterogeneity in ordinal data. Driven by these needs, we introduce the finite mixture of ordinal graphical models to effectively study the heterogeneous conditional dependence relationships of ordinal data. We develop a penalized likelihood approach for model estimation, and design a generalized expectation-maximization (em) algorithm to solve the significant computational challenges. We examine the performance of the proposed method and algorithm in simulation studies. Moreover, we demonstrate the potential usefulness of the proposed method in psychological science through a real application concerning the interests and attitudes related to fan avidity for students in a large public university in the United States.
The asymmetric exponential power (AEP) distribution has received much attention in economics and finance. Simulation study shows that iterative methods developed for finding the maximum likelihood (ML) estimates of th...
详细信息
The asymmetric exponential power (AEP) distribution has received much attention in economics and finance. Simulation study shows that iterative methods developed for finding the maximum likelihood (ML) estimates of the AEP distribution sometimes fail to converge. In this paper, the expectation-maximization (em) algorithm is proposed to find the ML estimates of the AEP distribution which always converges. Performance of the em algorithm is demonstrated by simulations and a real data illustration. As an application, the proposed em algorithm is applied to find the ML estimates for the regression coefficients when the error term in a linear regression model follows the AEP distribution. Performance of the AEP distribution in robust simple regression modelling is established through a real data illustration.
Finite Gamma mixture models are often used to describe randomness in income data, insurance data, and data in applications where the response values are intrinsically positive. The popular likelihood approach for mode...
详细信息
Finite Gamma mixture models are often used to describe randomness in income data, insurance data, and data in applications where the response values are intrinsically positive. The popular likelihood approach for model fitting, however, does not work for this model because its likelihood function is unbounded. Because of this, the maximum likelihood estimator is not well-defined. Other approaches have been developed to achieve consistent estimation of the mixing distribution, such as placing an upper bound on the shape parameter or adding a penalty to the log-likelihood function. In this paper, we show that if the shape parameter in the finite Gamma mixture model is structural, then the direct maximum likelihood estimator of the mixing distribution is well-defined and strongly consistent. We also present simulation results demonstrating the consistency of the estimator. We illustrate the application of the model with a structural shape parameter to household income data. The fitted mixture distribution leads to several possible subpopulation structures with regard to the level of disposable income.
Random effect models have been popularly used as a mainstream statistical technique over several decades;and the same can be said for response transformation models such as the Box-Cox transformation. The latter aims ...
详细信息
Random effect models have been popularly used as a mainstream statistical technique over several decades;and the same can be said for response transformation models such as the Box-Cox transformation. The latter aims at ensuring that the assumptions of normality and of homoscedasticity of the response distribution are fulfilled, which are essential conditions for inference based on a linear model or a linear mixed model. However, methodology for response transformation and simultaneous inclusion of random effects has been developed and implemented only scarcely, and is so far restricted to Gaussian random effects. We develop such methodology, thereby not requiring parametric assumptions on the distribution of the random effects. This is achieved by extending the 'Nonparametric Maximum Likelihood' towards a 'Nonparametric profile maximum likelihood' technique, allowing to deal with overdispersion as well as two-level data scenarios.
The stochastic blockmodel (SBM) models the connectivity within and between disjoint subsets of nodes in networks. Prior work demonstrated that the rows of an SBM's adjacency spectral embedding (ASE) and Laplacian ...
详细信息
The stochastic blockmodel (SBM) models the connectivity within and between disjoint subsets of nodes in networks. Prior work demonstrated that the rows of an SBM's adjacency spectral embedding (ASE) and Laplacian spectral embedding (LSE) both converge in law to Gaussian mixtures where the components are curved exponential families. Maximum likelihood estimation via the Expectation-Maximization (em) algorithm for a full Gaussian mixture model (GMM) can then perform the task of clustering graph nodes, albeit without appealing to the components' curvature. Noting that em is a special case of the Expectation-Solution (ES) algorithm, we propose two ES algorithms that allow us to take full advantage of these curved structures. After presenting the ES algorithm for the general curved-Gaussian mixture, we develop those corresponding to the ASE and LSE limiting distributions. Simulating from artificial SBMs and a brain connectome SBM reveals that clustering graph nodes via our ES algorithms can improve upon that of em for a full GMM for a wide range of settings.
Change point estimation in standard process observed over time is an important problem in literature with applications in various fields. We study this problem in a heterogeneous population. A model-based clustering p...
详细信息
Change point estimation in standard process observed over time is an important problem in literature with applications in various fields. We study this problem in a heterogeneous population. A model-based clustering procedure relying on skewed matrix-variate mixture is proposed. It is capable of capturing the heterogeneity pattern and estimating change points from all data groups simultaneously. The appeal of such approach also lies in its flexibility to model the skewness and dependence in data with good interpretability. Two novel algorithms called matrix power mixture with abrupt change model and matrix power mixture with gradual change model are developed. The approaches are illustrated by simulation studies across a variety of settings. The models are then tested on the US crime data with promising results.
At the specific power of electron cyclotron resonance (ECR) heating of 3.2 MW m(-3) (plasma density of 2 x 10(19) m(-3), electron temperature of 0.6 keV), an increase in the plasma energy lifetime by not less than 30%...
详细信息
At the specific power of electron cyclotron resonance (ECR) heating of 3.2 MW m(-3) (plasma density of 2 x 10(19) m(-3), electron temperature of 0.6 keV), an increase in the plasma energy lifetime by not less than 30% is accompanied by a two-time-decrease in the level of short-wave turbulent density fluctuations. In such a shot, before the beginning of the quasi-stationary confinement stage, the turbulent state of density fluctuations is characterized by the stronger deviation from zero of the coefficient of excess of fluctuation increments than it is in shots without transport transitions. This indicates the stronger deviation of the probability distribution function of density fluctuation increments from the normal law in shots with transport transitions. Based on the analysis of increments of short-wave fluctuations using the special method for separating the continuous components in stochastic processes, a qualitative difference was established between the behaviors of the structural components forming the plasma turbulence in shots with and without transport transitions. In addition, for shots with transport transitions, a change in the shape of the approximating finite mixture of normal distributions and parameters of its component densities is demonstrated.
Large-scale multiple testing is a fundamental problem in high dimensional statistical inference. It is increasingly common that various types of auxiliary information, reflecting the structural relationship among the ...
详细信息
Large-scale multiple testing is a fundamental problem in high dimensional statistical inference. It is increasingly common that various types of auxiliary information, reflecting the structural relationship among the hypotheses, are available. Exploiting such auxiliary information can boost statistical power. To this end, we propose a framework based on a two-group mixture model with varying probabilities of being null for different hypotheses a priori, where a shape-constrained relationship is imposed between the auxiliary information and the prior probabilities of being null. An optimal rejection rule is designed to maximize the expected number of true positives when average false discovery rate is controlled. Focusing on the ordered structure, we develop a robust em algorithm to estimate the prior probabilities of being null and the distribution of p-values under the alternative hypothesis simultaneously. We show that the proposed method has better power than state-of-the-art competitors while controlling the false discovery rate, both empirically and theoretically. Extensive simulations demonstrate the advantage of the proposed method. Datasets from genome-wide association studies are used to illustrate the new methodology.
暂无评论