Current status data arise commonly in applications when there is only one feasible observation time to check if the failure time has occurred, but the exact failure time remains unknown. To accommodate the covariate e...
详细信息
Current status data arise commonly in applications when there is only one feasible observation time to check if the failure time has occurred, but the exact failure time remains unknown. To accommodate the covariate effect on failure time, the accelerated failure time (AFT) model has been widely used to analyze current status data with the distribution of the failure time assumed to be specified or unspecified. In this paper, we consider a logistic regression with a misclassfied covariate from the current status observation scheme. A semiparametric AFT model was built to model current status data to eliminate the bias caused by this misclassification. This model is also robust to the misspecification of the failure time compared to the parametric AFT model, as we assume an unknown distribution of the failure time in the proposed model. Furthermore, incorporating the covariate effect on the failure time increases the flexibility of the model. Finally, we adapt the Expectation-Maximization algorithm for estimation, which guarantees the convergence of the estimate. Both theory and empirical studies show the consistency of the estimator.
Multivariate Hawkes Processes (MHPs) are a class of point processes that can account for complex temporal dynamics among event sequences. In this work, we study the accuracy and computational efficiency of three class...
详细信息
Multivariate Hawkes Processes (MHPs) are a class of point processes that can account for complex temporal dynamics among event sequences. In this work, we study the accuracy and computational efficiency of three classes of algorithms which, while widely used in the context of Bayesian inference, have rarely been applied in the context of MHPs: stochastic gradient expectation-maximization, stochastic gradient variational inference and stochastic gradient Langevin Monte Carlo. An important contribution of this paper is a novel approximation to the likelihood function that allows us to retain the computational advantages associated with conjugate settings while reducing approximation errors associated with the boundary effects. The comparisons are based on various simulated scenarios as well as an application to the study of risk dynamics in the Standard & Poor's 500 intraday index prices among its 11 sectors.
The work aims to find a novel technique to remove noise from low light or low luminous level images to improve the visibility of the image and the performance of many image processing systems. A denoising technique us...
详细信息
The work aims to find a novel technique to remove noise from low light or low luminous level images to improve the visibility of the image and the performance of many image processing systems. A denoising technique using patch priors in wavelet domain for images with low luminous levels, with the help of the Gaussian Mixture Model, is presented here. The main idea is to perform denoising in a sparse domain. Initially, the image is decomposed into approximate and detailed components with the help of wavelet transform, and then the patch based Gaussian mixture model denoising process is applied on both approximate and detailed components. Expectation maximization algorithm is used for estimating the Gaussian mixture model parameters from the image patches. After denoising each component, inverse wavelet transform is applied to obtain the denoised output image. This denoising method was applied to a set of natural low luminous level images, and it resulted in clean images with good Peak Signal to Noise Ratio and Structural Similarity Index, compared to other conventional methods. This work is a novel method combining wavelet transform and Gaussian mixture model for the denoising of low light images.
The class of generalized hyperbolic (GH) distributions is generated by a mean-variance mixture of a multivariate Gaussian with a generalized inverse Gaussian (GIG) distribution. This rich family of GH distributions in...
详细信息
The class of generalized hyperbolic (GH) distributions is generated by a mean-variance mixture of a multivariate Gaussian with a generalized inverse Gaussian (GIG) distribution. This rich family of GH distributions includes some well-known heavy-tailed and symmetric multivariate distributions, including the Normal Inverse Gaussian and some members of the family of scale-mixture of skew-normal distributions. The class of GH distributions has received considerable attention in finance and signal processing applications. In this paper, we propose the likelihood ratio (LR) test to test hypotheses about the skewness parameter of a GH distribution. Due to the complexity of the likelihood function, the em algorithm is used to find the maximum likelihood estimates both in the complete model and the reduced model. For comparative purposes and due to its simplicity, we also consider the Gradient (G) test. A simulation study shows that the LR and G tests are usually able to achieve the desired significance levels and the testing power increases as the asymmetry increases. The methodology developed in the paper is applied to two real datasets.
We develop a novel extension of the mover-stayer model to allow for time-dependent variables such as macroeconomic factors and apply it to the repayment process for car loans. The MS model postulates a simple form of ...
详细信息
We develop a novel extension of the mover-stayer model to allow for time-dependent variables such as macroeconomic factors and apply it to the repayment process for car loans. The MS model postulates a simple form of population heterogeneity, which is particularly well suited to describing the repayment process: a proportion of borrowers always repay on time (stayers), and a complementary proportion evolves according to a discrete-time Markov chain (movers), with an absorbing default state. In contrast to the literatures focus on the determinants of defaults, our extension examines the determinants of creditworthy borrowers (stayers). We model the probability of borrowers being stayers as a logistic function of their time-fixed covariates as well as of macroeconomic variables. The car-loans data set, obtained from a Polish bank, contains a large number of characteristics for each borrower and their repayment histories. The MS models' estimation from these data indicates that annual GDP growth is the only macroeconomic variable exerting a substantial effect on the stayers' probability: as GDP increases, so does the proportion of stayers. Because stayers are the most desirable borrowers, the proposed model should be useful to institutional lenders.
In this article, doubly truncated expectation (DTE) and variance (DTV) of univariate generalized skew-elliptical (GSE) distributions are investigated. In addition, we present an alternative form of DTE and DTV for thi...
详细信息
In this article, doubly truncated expectation (DTE) and variance (DTV) of univariate generalized skew-elliptical (GSE) distributions are investigated. In addition, we present an alternative form of DTE and DTV for this class of distributions in terms of the hazard function. This class of distributions includes many skewing distributions, for instance, generalized skew-normal, skew-Student-t, skew-logistic, skew-Laplace, and skew-Pearson type VII distributions. Also, we define truncated generalized skew-elliptical distributions and give the relations between moments of truncated distributions and truncated moments of distributions. Specially, we use the em algorithm to give maximum likelihood estimation of parameters for generalized skew-elliptical distributions. Further, we apply our results to present tail conditional expectation (TCE) and tail variance (TV) for GSE distributions. We also structure an optimal portfolio selection involving DTE and DTV, and give its optimal solution. As an illustrative example, DTE and DTV of a skew-normal random variable are estimated by the Monte Carlo method. Finally, we use real data to fit and select the best distributions, and analysis TCE and TV for the logarithm of adjusted price of three companies (stocks) from S & P (Standard & Poor's) sectors.
This work introduces a refinement of the Parsimonious Model for fitting a Gaussian Mixture. The improvement is based on the consideration of clusters of the involved covariance matrices according to a criterion, such ...
详细信息
This work introduces a refinement of the Parsimonious Model for fitting a Gaussian Mixture. The improvement is based on the consideration of clusters of the involved covariance matrices according to a criterion, such as sharing Principal Directions. This and other similarity criteria that arise from the spectral decomposition of a matrix are the bases of the Parsimonious Model. We show that such groupings of covariance matrices can be achieved through simple modifications of the Cem (Classification Expectation Maximization) algorithm. Our approach leads to propose Gaussian Mixture Models for model-based clustering and discriminant analysis, in which covariance matrices are clustered according to a parsimonious criterion, creating intermediate steps between the fourteen widely known parsimonious models. The added versatility not only allows us to obtain models with fewer parameters for fitting the data, but also provides greater interpretability. We show its usefulness for model-based clustering and discriminant analysis, providing algorithms to find approximate solutions verifying suitable size, shape and orientation constraints, and applying them to both simulation and real data examples.
We introduce a new discrete distribution, called the centered reduced discrete q-Gaussian N-q(0,1). This distribution connects classical Gaussian, discrete Uniform, and quantum q-Gaussian distributions. In this paper,...
详细信息
We introduce a new discrete distribution, called the centered reduced discrete q-Gaussian N-q(0,1). This distribution connects classical Gaussian, discrete Uniform, and quantum q-Gaussian distributions. In this paper, we extend N-q(0,1) to N-q(mu,sigma(2)), overcoming a limitation of some q-distributions like Diaz and Pariguan's q-Gaussian. Notably, N-q(0,1) has distinct shapes and parameters from the classical counterpart, providing additional flexible modeling approach. Results show the suggested discrete q-Gaussian as a useful alternative to the classical Gaussian for modeling data with hollow values or heavy-tailed tails. We explore properties of N-q(mu,sigma(2)) and apply moments and maximum likelihood methods to estimate its parameters. Our analysis yields a key result on the concavity of the likelihood function, enabling efficient optimization algorithms for parameters estimation. Furthermore, we investigate a finite mixture of discrete q-Gaussians and apply the em algorithm for parameters estimation. Finally, we conduct a simulation study to evaluate the model and estimation methods.
Failure time data subject to various types of censoring commonly arise in epidemiological and biomedical studies. Motivated by an AIDS clinical trial, we consider regression analysis of failure time data that include ...
详细信息
Failure time data subject to various types of censoring commonly arise in epidemiological and biomedical studies. Motivated by an AIDS clinical trial, we consider regression analysis of failure time data that include exact and left-, interval-, and/or right-censored observations, which are often referred to as partly interval-censored failure time data. We study the effects of potentially time-dependent covariates on partly interval-censored failure time via a class of semiparametric transformation models that includes the widely used proportional hazards model and the proportional odds model as special cases. We propose an em algorithm for the nonparametric maximum likelihood estimation and show that it unifies some existing approaches developed for traditional right-censored data or purely interval-censored data. In particular, the proposed method reduces to the partial likelihood approach in the case of right-censored data under the proportional hazards model. We establish that the resulting estimator is consistent and asymptotically normal. In addition, we investigate the proposed method via simulation studies and apply it to the motivating AIDS clinical trial.
The multivariate contaminated normal (MCN) distribution represents a simple heavy-tailed generalization of the multivariate normal (MN) distribution to model elliptical contoured scatters in the presence of mild outli...
详细信息
The multivariate contaminated normal (MCN) distribution represents a simple heavy-tailed generalization of the multivariate normal (MN) distribution to model elliptical contoured scatters in the presence of mild outliers (also referred to as 'bad' points herein) and automatically detect bad points. The price of these advantages is two additional parameters: proportion of good observations and degree of contamination. However, in a multivariate setting, only one proportion of good observations and only one degree of contamination may be limiting. To overcome this limitation, we propose a multiple scaled contaminated normal (MSCN) distribution. Among its parameters, we have an orthogonal matrix Gamma. In the space spanned by the vectors (principal components) of Gamma, there is a proportion of good observations and a degree of contamination for each component. Moreover, each observation has a posterior probability of being good with respect to each principal component. Thanks to this probability, the method provides directional robust estimates of the parameters of the nested MN and automatic directional detection of bad points. The term 'directional' is added to specify that the method works separately for each principal component. Mixtures of MSCN distributions are also proposed, and an expectation-maximization algorithm is used for parameter estimation. Real and simulated data are considered to show the usefulness of our mixture with respect to well-established mixtures of symmetric distributions with heavy tails.
暂无评论