in this paper, we address the problem of image categorization with a fast novel method based on the unsupervised clustering of graphs in the context of both region-based segmentation and the constellation approach to ...
详细信息
in this paper, we address the problem of image categorization with a fast novel method based on the unsupervised clustering of graphs in the context of both region-based segmentation and the constellation approach to object recognition. Such method is an em central clustering algorithm which builds prototypical graphs on the basis of either Softassign or fast matching with graph transformations. We present two realistic applications and their experimental results: categorization of image segmentations and visual localization. We compare our graph prototypes with the set median graphs. Our results reveal that, on the one hand, structure extracted from images improves appearance-based visual localization accuracy. On the other hand, we show that the cost of our central graph clustering algorithm is the cost of a pairwise algorithm. We also discuss how the method scales with an increasing amount of images. In addition, we address the scientific question of what are the bounds of structural learning for categorization. Our in-depth experiments both for region-based and feature-based image categorization, will show that such bounds depend hardly on structural variability. (C) 2008 Elsevier B.V. All rights reserved.
Understanding the variability of speed patterns and congestion characteristics of interstate freeway systems caused by holiday traffic is beneficial because appropriate countermeasures for safety improvement and conge...
详细信息
Understanding the variability of speed patterns and congestion characteristics of interstate freeway systems caused by holiday traffic is beneficial because appropriate countermeasures for safety improvement and congestion mitigation can be prepared and drivers can avoid traffic congestion and change their holiday travel schedules. This study evaluated the traffic congestion patterns during the Thanksgiving holiday period in 2006 using a Gaussian mixture speed distribution estimated by the Expectation-Maximization (em) algorithm. This mathematical approach showed the potential of improving freeway operational performance evaluation schemes for holiday periods (even non-holiday periods). This study suggested that a Gaussian mixture model using the em algorithm could be used to properly characterize the severity and the variability of congestion on certain interstate roadway systems. However, this study also pointed out that the fundamental limitations of the mixture model and the statistical significance test about the mixture components should be well understood and need to be further investigated. In addition, because this study investigated the changing patterns of speed distributions with only one interstate freeway system, 1-95 northbound, other freeway systems with both directions need to be evaluated so that a more broad and confident analysis on holiday traffic can be achieved. Published by Elsevier Ltd.
作者:
Ghahramani, ZUCL
Gatsby Computat Neurosci Unit London WC1N 3AR England
We provide a tutorial on learning and inference in hidden Markov models in the context of the recent literature on Bayesian networks. This perspective makes it possible to consider novel generalizations of hidden Mark...
详细信息
We provide a tutorial on learning and inference in hidden Markov models in the context of the recent literature on Bayesian networks. This perspective makes it possible to consider novel generalizations of hidden Markov models with multiple hidden state variables, multiscale representations, and mixed discrete and continuous variables. Although exact inference in these generalizations is usually intractable, one can use approximate inference algorithms such as Markov chain sampling and variational methods. We describe how such methods are applied to these generalized hidden Markov models. We conclude this review with a discussion of Bayesian methods for model selection in generalized HMMs.
We extend the standard mixture of linear regressions model by allowing the mixing proportions to be modeled nonparametrically as a function of the predictors. This framework allows for more flexibility in the modeling...
详细信息
We extend the standard mixture of linear regressions model by allowing the mixing proportions to be modeled nonparametrically as a function of the predictors. This framework allows for more flexibility in the modeling of the mixing proportions than the fully parametric mixture of experts model, which we also discuss. We present an em-like algorithm for estimation of the new model. We also provide simulations demonstrating that our nonparametric approach can provide a better fit than the parametric approach in some instances and can serve to validate and thus reinforce the parametric approach ill others. We also analyze and interpret two real data sets using the new method. (C) 2010 Elsevier B.V. All rights reserved.
We address the procurement of new components for recyclable products in the context of Kodak's single-use camera. The objective is to find an ordering policy that minimizes the total expected procurement, inventor...
详细信息
We address the procurement of new components for recyclable products in the context of Kodak's single-use camera. The objective is to find an ordering policy that minimizes the total expected procurement, inventory holding, and lost sales cost. Distinguishing characteristics of the system are the uncertainty and unobservability associated with return flows of used cameras. We model the system as a closed queueing network, develop a heuristic procedure for adaptive estimation and control, and illustrate our methods with disguised data from Kodak. Using this framework, we investigate the effects of various system characteristics such as informational structure, procurement delay, demand rate, and length of the product's life cycle.
The minimum-variance smoother solution for input estimation is described and it is shown that the resulting estimates are unbiased. The smoothed input and state estimates are used to iteratively identify unknown proce...
详细信息
The minimum-variance smoother solution for input estimation is described and it is shown that the resulting estimates are unbiased. The smoothed input and state estimates are used to iteratively identify unknown process noise variances. The use of smoothed estimates, as opposed to filtered estimates, leads to improved approximate Cramer-Rao lower bounds for the unknown parameters. It is also shown that the sequence of iterates are monotonic and asymptotically approach the actual values under prescribed conditions. A nonlinear mining navigation application is described in which unknown parameters are estimated.
A class of random graph models is considered, combining features of exponential-family models and latent structure models, with the goal of retaining the strengths of both of them while reducing the weaknesses of each...
详细信息
A class of random graph models is considered, combining features of exponential-family models and latent structure models, with the goal of retaining the strengths of both of them while reducing the weaknesses of each of them. An open problem is how to estimate such models from large networks. A novel approach to large-scale estimation is proposed, taking advantage of the local structure of such models for the purpose of local computing. The main idea is that random graphs with local dependence can be decomposed into subgraphs, which enables parallel computing on subgraphs and suggests a two-step estimation approach. The first step estimates the local structure underlying random graphs. The second step estimates parameters given the estimated local structure of random graphs. Both steps can be implemented in parallel, which enables large-scale estimation. The advantages of the two-step estimation approach are demonstrated by simulation studies with up to 10,000 nodes and an application to a large Amazon product recommendation network with more than 10,000 products. (C) 2020 Elsevier B.V. All rights reserved.
Count regression models maintain a steadfast presence in modern applied statistics as highlighted by their usage in diverse areas like biometry, ecology, and insurance. However, a common practical problem with observe...
详细信息
Count regression models maintain a steadfast presence in modern applied statistics as highlighted by their usage in diverse areas like biometry, ecology, and insurance. However, a common practical problem with observed count data is the presence of excess zeros relative to the assumed count distribution. The seminal work of Lambert (1992) was one of the first articles to thoroughly treat the problem of zero-inflated count data in the presence of covariates. Since then, a vast literature has emerged regarding zero-inflated count regression models. In this first of two review articles, we survey some of the classic and contemporary literature on parametric zero-inflated count regression models, with emphasis on the utility of different univariate discrete distributions. We highlight some of the primary computational tools available for estimating and assessing the adequacy of these models. We concurrently emphasize the diverse data problems to which these models have been applied. This article is categorized under: Statistical Models > Generalized Linear Models Software for Computational Statistics > Software/Statistical Software algorithms and Computational Methods > Maximum Likelihood Methods
Detecting the foreground region of interest (ROI) for video sequences is an important issue both for video codecs and monitoring systems. In this paper, we propose a flow-process-based method to detect foreground ROI ...
详细信息
Detecting the foreground region of interest (ROI) for video sequences is an important issue both for video codecs and monitoring systems. In this paper, we propose a flow-process-based method to detect foreground ROI using four steps: global motion compensation, motion block extraction, multi-layer segmentation, and model updating. The former two procedures extract the foreground motion blocks and form a motion mask, and the latter two procedures remove the pixels belonging to the background inside the motion mask and update the color distributions of the background model. In addition, a block-based to pixel-based detection scheme is proposed to allow detection flexibility. Another benefit of the proposed method is that it can be embedded in video codecs for real-time ROI detection and encoding. Experimental results demonstrate that our method achieves improved performance in terms of both detection accuracy and time consumption.
Unsupervised machine learning approaches involving several clustering algorithms working together to tackle difficult data sets are a recent area of research with a large number of applications such as clustering of d...
详细信息
Unsupervised machine learning approaches involving several clustering algorithms working together to tackle difficult data sets are a recent area of research with a large number of applications such as clustering of distributed data, multi-expert clustering, multi-scale clustering analysis or multi-view clustering. Most of these frameworks can be regrouped under the umbrella of collaborative clustering, the aim of which is to reveal the common underlying structures found by the different algorithms while analyzing the data. Within this context, the purpose of this article is to propose a collaborative framework lifting the limitations of many of the previously proposed methods: Our proposed collaborative learning method makes possible for a wide range of clustering algorithms from different families to work together based solely on their clustering solutions, thus lifting previous limitation requiring identical prototypes between the different collaborators. Our proposed framework uses a variational em as its theoretical basis for the collaboration process and can be applied to any of the previously mentioned collaborative contexts. In this article, we give the main ideas and theoretical foundations of our method, and we demonstrate its effectiveness in a series of experiments on real data sets as well as data sets from the literature. (C) 2017 Elsevier Ltd. All rights reserved.
暂无评论