Medical data is particularly interesting as a subject for relational data mining due to the complex interactions which exist between different entities. Furthermore, the ambiguity of medical imaging causes interpretat...
详细信息
ISBN:
(数字)9783319780900
ISBN:
(纸本)9783319780900;9783319780894
Medical data is particularly interesting as a subject for relational data mining due to the complex interactions which exist between different entities. Furthermore, the ambiguity of medical imaging causes interpretation to be complex and error-prone, and thus particularly amenable to improvement through automated decision support. Probabilistic inductivelogicprogramming (Pilp) is a particularly well-suited tool for this task, since it makes it possible to combine the relational nature of this field withthe ambiguity inherent in human interpretation of medical imaging. this work presents a Pilp setting for breast cancer data, where several clinical and demographic variables were collected retrospectively, and new probabilistic variables and rules reflecting domain knowledge were introduced. A Pilp predictive model was built automatically from this data and experiments show that it can not only match the predictions of a team of experts in the area, but also consistently reduce the error rate of malignancy prediction, when compared to other non-relational techniques.
We consider the problem of learning Boltzmann machine classifiers from relational data. Our goal is to extend the deep belief framework of RBMs to statistical relational models. this allows one to exploit the feature ...
详细信息
ISBN:
(数字)9783319780900
ISBN:
(纸本)9783319780900;9783319780894
We consider the problem of learning Boltzmann machine classifiers from relational data. Our goal is to extend the deep belief framework of RBMs to statistical relational models. this allows one to exploit the feature hierarchies and the non-linearity inherent in RBMs over the rich representations used in statistical relational learning (SRL). Specifically, we use lifted random walks to generate features for predicates that are then used to construct the observed features in the RBM in a manner similar to Markov logic Networks. We show empirically that this method of constructing an RBM is comparable or better than the state-of-the-art probabilistic relational learning algorithms on six relational domains.
inductive learning has been employed successfully in various domains, however the inductivelogicprogramming (ilp) systems focused on non-incremental learning tasks where independent sets of data are provided incoher...
详细信息
ISBN:
(纸本)9781538616390
inductive learning has been employed successfully in various domains, however the inductivelogicprogramming (ilp) systems focused on non-incremental learning tasks where independent sets of data are provided incoherently. In this paper, we propose a new genetic algorithm-based ilp system, called GAilp, for incremental learning. GAilp is a covering algorithm which extracts hypotheses/rules from a collection of examples in a reliable way. It employs a genetic algorithm technique to discover various aspects of the potential combinations. GAilp induces every possible rule for the given combination and selects the most generic ones among them. It also eliminates rules which might become obsolete by the existence of more generic rules. Unlike other ilp systems, GAilp batches all given examples and background knowledge, then it groups the examples and prioritizes the induction process. this prioritization needs to be done to preserve dependency and to revise theory. the paper introduces GAilp's fundamentals mechanisms and demonstrates its algorithms with a running example.
Motivated by an analogy with matrix factorization, we introduce the problem of factorizing relational data. In matrix factorization, one is given a matrix and has to factorize it as a product of other matrices. In rel...
详细信息
Motivated by an analogy with matrix factorization, we introduce the problem of factorizing relational data. In matrix factorization, one is given a matrix and has to factorize it as a product of other matrices. In relational data factorization, the task is to factorize a given relation as a conjunctive query over other relations, i.e., as a combination of natural join operations. Given a conjunctive query and the input relation, the problem is to compute the extensions of the relations used in the query. thus, relational data factorization is a relational analog of matrix factorization;it is also a form of inverse querying as one has to compute the relations in the query from the result of the query. the result of relational data factorization is neither necessarily unique nor required to be a lossless decomposition of the original relation. therefore, constraints can be imposed on the desired factorization and a scoring function is used to determine its quality (often similarity to the original data). Relational data factorization is thus a constraint satisfaction and optimization problem. We show how answer set programming can be used for solving relational data factorization problems.
this paper describing a method of specifying common terms of genes from microarray data in 3 steps. First, we use random forest for extracting disease-related genes and it give each gene variable importance. the highe...
详细信息
ISBN:
(纸本)9781450353502
this paper describing a method of specifying common terms of genes from microarray data in 3 steps. First, we use random forest for extracting disease-related genes and it give each gene variable importance. the higher the variable importance, the more effective feature for classification. We extract genes whose variable importance more than 0 and set them positive samples and the rest set negative samples for ilp. Next, we annotate extracted genes by using Gene Ontology (GO) and use the term as predicate for ilp. Annotation is the process of assigning GO terms to gene products. Finally, we obtain rules about common terms in positive samples by using ilp. ilp is a subfield of machine learning which uses logicprogramming as a uniform representation technique for examples, background knowledge and hypotheses. ilp learns based on background knowledge. Background knowledge is represented in first-order logic. In the result, we extracted 1051 mRNA as positive samples for ilp from random forest and its F-measure score was 65.1%. We obtained about 4000 terms at each dataset and use them as predicates for ilp. We got eventually some rules about positive samples.
Relation Extraction ( RE) is the task of detecting semantic relations between entities in text. Most of the state-of-the-art RE systems rely on statistical machine learning techniques which usually employ an attribute...
详细信息
ISBN:
(纸本)9781509001637
Relation Extraction ( RE) is the task of detecting semantic relations between entities in text. Most of the state-of-the-art RE systems rely on statistical machine learning techniques which usually employ an attribute-value representation of features. Contrarily to this trend, we focus on an alternative approach to RE based on the automatic induction of symbolic extraction rules. We present OntoilpER, an RE system based on inductivelogicprogramming which uses a domain ontology in its extraction process. Several experiments are discussed in this paper over the reACE 2004/2005 reference corpora. the results are encouraging and seem to demonstrate the effectiveness of the proposed solution.
the integration of abduction and induction has lead to a variety of non-monotonic ilp systems. XHAIL is one of these systems, in which abduction is used to compute hypotheses that subsume Kernel Sets. On the other han...
详细信息
the integration of abduction and induction has lead to a variety of non-monotonic ilp systems. XHAIL is one of these systems, in which abduction is used to compute hypotheses that subsume Kernel Sets. On the other hand, Peircebayes is a recently proposed logic-based probabilistic programming approach that combines abduction with parameter learning to learn distributions of most likely explanations. In this paper, we propose an approach for integrating probabilistic inference withilp. the basic idea is to redefine the inductive task of XHAIL as a statistical abduction, and to use Peircebayes to learn probability distribution of hypotheses. An initial evaluation of the proposed algorithm is given using synthetic data.
the proceedings contain 27 papers. the special focus in this conference is on inductivelogicprogramming. the topics include: Probabilistic relational models;inductive databases (abstract);some elements of machine le...
ISBN:
(纸本)3540661093
the proceedings contain 27 papers. the special focus in this conference is on inductivelogicprogramming. the topics include: Probabilistic relational models;inductive databases (abstract);some elements of machine learning (extended abstract);refinement operators can be (weakly) perfect;combining divide-and-conquer and separate-and-conquer for efficient and;effective rule induction;refining complete hypotheses in ilp;acquiring graphic design knowledge with nonmonotonic inductive learning;morphosyntactic tagging of Slovene using progol;experiments in predicting biodegradability;a first-order Bayesian classifier;building background knowledge into a refinement operator for inductivelogicprogramming;a strong complete schema for inductive functional logicprogramming;application of different learning methods to Hungarian part-of-speech tagging;combining lapis and wordnet for the learning of LR parsers with optimal semantic constraints;learning word segmentation rules for tag prediction;approximate ilp rules by back propagation neural network;rule evaluation measures: a unifying view;improving part-of-speech disambiguation rules by adding linguistic knowledge;on sufficient conditions for learnability of logic programs from positive data;a bounded search space of clausal theories;discovering new knowledge from graph data using inductivelogicprogramming;analogical prediction;generalizing refinement operators to learn prenex conjunctive normal forms;theory recovery;instance based function learning;some properties of inverse resolution in normal logic programs;an assessment of ilp-assisted models for toxicology and the PTE-3 experiment.
this book constitutes the refereed proceedings of the 8thinternationalconference on inductivelogicprogramming, ilp-98, held in Madison, Wisconsin, USA, in July 1998.;the 27 revised full papers presented together w...
详细信息
ISBN:
(数字)9783540690597
ISBN:
(纸本)9783540647386
this book constitutes the refereed proceedings of the 8thinternationalconference on inductivelogicprogramming, ilp-98, held in Madison, Wisconsin, USA, in July 1998.;the 27 revised full papers presented together withthe abstracts of three invited talks were carefully reviewed and selected for inclusion in the book. All relevant aspects of inductivelogicprogramming are covered ranging from theory to implementations and applications.
Understanding the inuences between components of dynamical systems such as biological networks, cellular automata or social networks provides insights to their dynamics. Inuences of such dynamical sys- tems can be rep...
详细信息
暂无评论