Chinese named entity recognition (NER) has recently been viewed as a classification or sequence labeling problem, and many approaches have been proposed. However, they tend to address this problem without considering ...
详细信息
Recently, tensor networks have been proposed as a data structure for weighted model counting. Computing a weighted model count is thus reduced to contracting a factorized tensor expression. Inference queries on graphi...
详细信息
Recently, tensor networks have been proposed as a data structure for weighted model counting. Computing a weighted model count is thus reduced to contracting a factorized tensor expression. Inference queries on graphicalmodels, especially PoE (probability of evidence) queries, can be expressed directly as weighted model counting problems. Maximization problems can also be addressed on the same data structure, only the standard sum-product semiring has to be replaced by either the tropical (max-sum) or the Viterbi (max-product) semiring in the computations, that is, the tensor contractions. However, tensor contractions only provide maximal values, but MPE (most probable explanation) queries on graphicalmodels do not ask for the maximal value, but for a state, or even the states, at which the maximal value is attained. In the special case of tropical tensor networks for ground states of spin glasses, it has been observed that the ground state can be obtained by computing a derivative of the tensor network over the tropical semiring. Here, we generalize this observation, provide a generic algorithm for computing the derivatives, and prove its correctness.
graphicalprobabilisticmodels play an important role in modern machine learning and pattern recognition. In this half-day tutorial, we introduce the fundamentals of graphicalprobabilistic modeling, including Bayesia...
详细信息
Protein secondary structure prediction is an important step to understanding protein tertiary structure. Recent studies indicate that the correlation between neighboring secondary structures are beneficial to improve ...
详细信息
ISBN:
(纸本)9780769536415
Protein secondary structure prediction is an important step to understanding protein tertiary structure. Recent studies indicate that the correlation between neighboring secondary structures are beneficial to improve prediction performance. In this paper, we propose a new large margin approach for protein secondary structure prediction, which consider the problem as a sequence labeling problem like probabilisticgraphicalmodels. It doesn't only make full use of the correlation between neighboring secondary structures like graphical chain models, but also shares the key advantages of other SVM-based methods, i.e. learming non-linear discriminant via kernel functions. The experimental results on datasets: CB513 and RS126 show that our algorithm outperforms other state-of-the-art methods.
For large scale automatic semantic video characterization, it is necessary to learn and model a large number of semantic concepts. These semantic concepts do not exist in isolation to each other and exploiting this re...
详细信息
ISBN:
(纸本)9781424403660
For large scale automatic semantic video characterization, it is necessary to learn and model a large number of semantic concepts. These semantic concepts do not exist in isolation to each other and exploiting this relationship between multiple video concepts could be a useful source to improve the concept detection accuracy. In this paper, we describe various multi-concept relational learning approaches via a unified probabilisticgraphical model representation and propose using numerous graphicalmodels to mine the relationship between video concepts that have not been applied before. Their performances in video semantic concept detection are evaluated and compared on two TRECVID'05 video collections.
probabilisticgraphical model representations of relational data provide a number of desired features, such as inference of missing values, detection of errors, visualization of data, and probabilistic answers to rela...
详细信息
ISBN:
(纸本)9781450322638
probabilisticgraphical model representations of relational data provide a number of desired features, such as inference of missing values, detection of errors, visualization of data, and probabilistic answers to relational queries. However, adoption has been slow due to the high level of expertise expected both in probability and in the domain from the user. Instead of requiring a domain expert to specify the probabilistic dependencies of the data, we present an approach that uses the relational DB schema to automatically construct a Bayesian graphical model for a database. This resulting model contains customized distributions for the attributes, latent variables that cluster the records, and factors that reflect and represent the foreign key links, whilst allowing efficient inference. Experiments demonstrate the accuracy of the model and scalability of inference on synthetic and real-world data.
M-Modes is the problem of finding the top M locally optimal solutions of a graphical model, called modes. These modes provide geometric characterization of the energy landscape of a graphical model and lead to high qu...
详细信息
M-Modes is the problem of finding the top M locally optimal solutions of a graphical model, called modes. These modes provide geometric characterization of the energy landscape of a graphical model and lead to high quality solutions in structured prediction. It has been shown that any mode must be a local MAP within every subgraph of certain size. The state-of-the-art method is a search algorithm that explores subgraphs in a fixed ordering, uses each subgraph as a layer and searches for a consistent concatenation of local MAPs. We observe that for the M-Modes problem, different search orderings can lead to search spaces with dramatically different sizes, resulting in huge differences in performance. We formalize a metric measuring the quality of different orderings. We then formulate finding an optimized ordering as a shortest path problem, and introduce pruning criteria to speed up the search. Our empirical results show that using optimized orderings improves the efficiency of M-Modes search by up to orders of magnitude.
Color coding is an important research topic in spatial encoded structured light sensing (SLS). In this study, we propose a novel graphical model based approach for the color pattern decoding task. For efficient color ...
详细信息
ISBN:
(纸本)9781479948086
Color coding is an important research topic in spatial encoded structured light sensing (SLS). In this study, we propose a novel graphical model based approach for the color pattern decoding task. For efficient color labeling, the color pattern is firstly decomposed into separate binary pattern images. With the labeled pattern elements, a unified probabilisticgraphical framework is constructed to represent the pseudorandom pattern as a clique tree structure. The model contains two parts: the Conditional Random Field (CRF) is used to represent the dependences between these local decisions, and the Bayesian network (BN) is applied for the representation of background colors effect. A colorful target is experimented to demonstrate its feasibility. And the 3D reconstructed models based on the decoding results are also provided to show its robustness.
Statistical inference is a powerful technique in various applications. Although many statistical inference tools are available, answering inference queries involving complex quantification structures remains challengi...
详细信息
ISBN:
(纸本)9781956792003
Statistical inference is a powerful technique in various applications. Although many statistical inference tools are available, answering inference queries involving complex quantification structures remains challenging. Recently, solvers for Stochastic Boolean Satisfiability (SSAT), a powerful formalism allowing concise encodings of PSPACE decision problems under uncertainty, are under active development and applied in more and more applications. In this work, we exploit SSAT solvers for the inference of probabilisticgraphicalmodels (PGMs), an essential representation for probabilistic reasoning. Specifically, we develop encoding methods to systematically convert PGM inference problems into SSAT formulas for effective solving. Experimental results demonstrate that, by using our encoding, SSAT-based solving can complement existing PGM tools, especially in answering complex queries.
We propose a probabilisticgraphical model (PGM) for prognosis and diagnosis of breast cancer. PGMs are suitable for building predictive models in medical applications, as they are powerful tools for making decisions ...
详细信息
ISBN:
(纸本)9781509002870
We propose a probabilisticgraphical model (PGM) for prognosis and diagnosis of breast cancer. PGMs are suitable for building predictive models in medical applications, as they are powerful tools for making decisions under uncertainty from big data with missing attributes and noisy evidence. Previous work relied mostly on clinical data to create a predictive model. Moreover, practical knowledge of an expert was needed to build the structure of a model, which may not be accurate. In our opinion, since cancer is basically a genetic disease, the integration of microarray and clinical data can improve the accuracy of a predictive model. However, since microarray data is high-dimensional, including genomic variables may lead to poor results for structure and parameter learning due to the curse of dimensionality and small sample size problems. We address these problems by applying manifold learning and a deep belief network (DBN) to microarray data. First, we construct a PGM and a DBN using clinical and microarray data, and extract the structure of the clinical model automatically by applying a structure learning algorithm to the clinical data. Then, we integrate these two models using softmax nodes. Extensive experiments using real-world databases, such as METABRIC and NKI, show promising results in comparison to Support Vector Machines (SVMs) and k-Nearest Neighbors (k-NN) classifiers, for classifying tumors and predicting events like recurrence and metastasis.
暂无评论