The PARELLA model is a probabilistic parallelogram model that can be used for the measurement of latent attitudes or latent preferences. The data analyzed are the dichotomous responses of persons to items, with a one ...
详细信息
The PARELLA model is a probabilistic parallelogram model that can be used for the measurement of latent attitudes or latent preferences. The data analyzed are the dichotomous responses of persons to items, with a one (zero) indicating agreement (disagreement) with the content of the item. The model provides a unidimensional representation of persons and items. The response probabilities are a function of the distance between person and item: the smaller the distance, the larger the probability that a person will agree with the content of the item. This paper discusses how the approach to differential item functioning presented by Thissen, Steinberg, and Wainer can be implemented for the PARELLA model.
In the paper the online fuzzy clustering recurrent procedure has been introduced that allows the forming of hyperellipsoidal clusters with an arbitrary orientation of the axes is proposed. Such clustering system is th...
详细信息
ISBN:
(纸本)9781538628744
In the paper the online fuzzy clustering recurrent procedure has been introduced that allows the forming of hyperellipsoidal clusters with an arbitrary orientation of the axes is proposed. Such clustering system is the generalization of a number of known algorithms, it is intended to solve tasks within the general problems of Data Stream Mining (DSM) and Dynamic Data Mining (DDM), when information is sequentially fed to processing in online mode.
In this paper, we present a statistical model of the relationship between injection and production wells on an oil field. The model is based on analysis of fluid flows occurring in the oil field;it is a modification o...
详细信息
Methods for the separation of a mixture of three-parameter lognormal distributions are investigated theoretically and empirically in the context of modeling message transmission delays in a computer cluster communicat...
详细信息
In information theory, the channel capacity, which indicates how efficient a given channel is, plays an important role. The best-used algorithm for evaluating the channel capacity is Arimoto algorithm [3]. This paper ...
详细信息
The specific nature of credit loan data requires the use of mixture cure models within the class of survival analysis tools. The constructed models allow for competing risks such as early repayment and default, and fo...
详细信息
The specific nature of credit loan data requires the use of mixture cure models within the class of survival analysis tools. The constructed models allow for competing risks such as early repayment and default, and for incorporating maturity, expressed as an unsusceptible part of the population. A novel further extension of such models incorporates unobserved heterogeneity within the risk groups. A hierarchical expectation-maximization algorithm is derived to fit the models and standard errors are obtained. Simulations and a data analysis illustrate the applicability and benefits of these models, and in particular an improved event time estimation. (c) 2021 EcoSta Econometrics and Statistics. Published by Elsevier B.V. All rights reserved.
Popular clustering methods, such as the k-means and the Gaussian mixture model (GMM), are composed of an initialization stage and an iteration stage. Although this is well known, the role of the two stages has not bee...
详细信息
ISBN:
(纸本)9798350310900
Popular clustering methods, such as the k-means and the Gaussian mixture model (GMM), are composed of an initialization stage and an iteration stage. Although this is well known, the role of the two stages has not been well studied for existing clustering methods yet. To understand this issue, the current research investigates well-known existing k-means and the GMM methods. The finding is that the initialization stage is more critical than the iteration stage. Based on this issue, the research develops improved k-means and GMM methods by using a nice initialization, which significantly enhances the performance of the corresponding methods. Experiments based on Monte Carlo simulations show that the proposed improved k-means and GMM methods can completely identify the true clusters if they are well separated from each, but this cannot be achieved by the existing k-means and GMM methods. Experiments based on a real-world dataset show that the proposed methods can be efficiently combined with dimension-reduction techniques for clustering high-dimensional massive data.
Modern stage of development of information and communication networks requires solving of crucial tasks of network traffic statistical analysis and traffic simulation modeling. The predominance of non-Poisson traffic ...
详细信息
ISBN:
(纸本)9781728106069
Modern stage of development of information and communication networks requires solving of crucial tasks of network traffic statistical analysis and traffic simulation modeling. The predominance of non-Poisson traffic leads to the impossibility of analyzing multichannel communication systems by the methods of queuing theory that used to describe telephone networks. The last decade has paid much attention to research on traffic that has signs of self-similarity. The main purpose of this work is a statistical analysis of non-Poissonian traffic, represented by multimodal non-standard Pascal (negative binomial) and Rice distributions. As a result, a study of the self-similarity degree has been performed by the R/S analysis and the aggregation method. In addition, we propose to use em-algorithm with an algorithm for determining an optimal number of clusters for an approximation of non-typical multimodal distributions.
A parametric mixture model of three different distributions is proposed to analyse heterogeneous survival data. The maximum likelihood estimators of the postulated parametric mixture model are estimated by applying an...
详细信息
ISBN:
(纸本)9780735412415
A parametric mixture model of three different distributions is proposed to analyse heterogeneous survival data. The maximum likelihood estimators of the postulated parametric mixture model are estimated by applying an Expectation Maximization algorithm (em) scheme. The simulations are performed by generating data, sampled from a population of three component parametric mixture of three different distributions. The parameters estimated by the proposed emalgorithm scheme are close to the parameters of the postulated model. To investigate the consistency and stability of the em scheme, the simulations are repeated several times. The repetitions of the simulation gave parameters closer to the values of postulated models, with relatively small standard errors. Log likelihood, AIC and BIC are computed to compare the proposed mixture model with parametric mixture models of one distribution. The calculated values of Log likelihood, AIC and BIC are all in favour of the proposed parametric mixture model of different distributions.
Mixture generalized gamma distribution is a combination of two distributions -- Generalized gamma distribution and length biased generalized gamma distribution. This distribution is presented by Suksaengrakcharoen and...
详细信息
ISBN:
(纸本)9781450363396
Mixture generalized gamma distribution is a combination of two distributions -- Generalized gamma distribution and length biased generalized gamma distribution. This distribution is presented by Suksaengrakcharoen and Bodhisuwan in 2014. The fmdings showed that probability density function (pdf) had fairly complexities, so it made problems in estimating parameters. The problem occurred in parameter estimation was that we were unable to calculate estimators in the form of critical expression. Thus, we will use numerical estimation to find the estimators. In this study, we presented a new method of the parameter estimation by using the expectation maximization algorithm (em), the conjugate gradient method, and the quasi -Newton method. The data was generated by acceptance -rejection method which is used for estimating alpha, beta, lambda and p . lambda is the scale parameter, p is the weight parameter, alpha and beta are the shape parameters. We will use Monte Carlo technique to fmd the estimator's performance. Determining the size of sample equals 30, 100 and the simulation were repeated 20 times in each case. We evaluated the effectiveness of the estimators which was introduced by considering values of the mean squared errors and the bias. The findings revealed that the em -algorithm had proximity to the actual values determined. Also, the maximum likelihood estimators via the conjugate gradient and the quasi Newton method are less precision than the maximum likelihood estimators via the em -algorithm.
暂无评论