One challenge faced by many inductivelogicprogramming (ilp) systems is poor scalability to problems with large search spaces and many examples. Randomized search methods such as stochastic clause selection (SCS) and...
详细信息
ISBN:
(纸本)3540229418
One challenge faced by many inductivelogicprogramming (ilp) systems is poor scalability to problems with large search spaces and many examples. Randomized search methods such as stochastic clause selection (SCS) and rapid random restarts (RRR) have proven somewhat successful at addressing this weakness. However, on datasets where hypothesis evaluation is computationally expensive, even these algorithms may take unreasonably long to discover a good solution. We attempt to improve the performance of these algorithms on datasets by learning an approximation to ilp hypothesis evaluation. We generate a small set of hypotheses, uniformly sampled from the space of candidate hypotheses, and evaluate this set on actual data. these hypotheses and their corresponding evaluation scores serve as training data for learning an approximate hypothesis evaluator. We outline three techniques that make use of the trained evaluation-function approximator in order to reduce the computation required during an ilp hypothesis search. We test our approximate clause evaluation algorithm using the popular ilp system Aleph. Empirical results are provided on several benchmark datasets. We show that the clause evaluation function can be accurately approximated.
the generation and testing of hypotheses is widely considered to be the primary method by which Science progresses. So much so, that it is still common to find a scientific proposal or an intellectual argument damned ...
详细信息
ISBN:
(纸本)3540229418
the generation and testing of hypotheses is widely considered to be the primary method by which Science progresses. So much so, that it is still common to find a scientific proposal or an intellectual argument damned on the grounds that it has no hypothesis being tested, it is merely a fishing expedition, and so on. Extreme versions run if there is no hypothesis, it is not Science, the clear implication being that hypothesis-driven programmes (as opposed to data-driven studies) are the only contributor to the scientific endeavour. this misrepresents how knowledge and understanding are actually generated from the study of natural phenomena and laboratory experiments. Hypothesis-driven and inductive modes of reasoning are not competitive, but complementary, and both are required in post-genomic biology.
the LOGAN-H system is a bottom up ilp system for learning multi-clause and multi-predicate function free Horn expressions in the framework of learning from interpretations. the paper introduces a new implementation of...
详细信息
ISBN:
(纸本)3540229418
the LOGAN-H system is a bottom up ilp system for learning multi-clause and multi-predicate function free Horn expressions in the framework of learning from interpretations. the paper introduces a new implementation of the same base algorithm which gives several orders of magnitude speedup as well as extending the capabilities of the system. New tools include several fast engines for subsumption tests, handling real valued features, and pruning. We also discuss using data from the standard ilp setting in our framework, which in some cases allows for further speedup. the efficacy of the system is demonstrated on several ilp datasets.
Recently there has been growing interest both to extend ilp to description logics and to apply it to knowledge discovery in databases. In this paper we present a novel approach to association rule mining which deals w...
详细信息
Recently there has been growing interest both to extend ilp to description logics and to apply it to knowledge discovery in databases. In this paper we present a novel approach to association rule mining which deals with multiple levels of description granularity. It relies on the hybrid language AL-log which allows a unified treatment of boththe relational and structural features of data. A generality order and a downward refinement operator for AL-log pattern spaces is defined on the basis of query subsumption. this framework has been implemented in SPADA, an ilp system for mining multi-level association rules from spatial data. As an illustrative example, we report experimental results obtained by running the new version of SPADA on geo-referenced census data of Manchester Stockport.
In inductivelogicprogramming (ilp), algorithms which are purely of the bottom-up or top-down type encounter several problems in practice. Since a majority of them axe greedy ones, these algorithms find clauses in lo...
详细信息
ISBN:
(纸本)3540229418
In inductivelogicprogramming (ilp), algorithms which are purely of the bottom-up or top-down type encounter several problems in practice. Since a majority of them axe greedy ones, these algorithms find clauses in local optima, according to the "quality" measure used for evaluating the results. Moreover, when learning clauses one by one, induced clauses become less interesting to cover few remaining examples. In this paper, we propose a simulated annealing framework to overcome these problems. Using a refinement operator, we define neighborhood relations on clauses and on hypotheses (i.e. sets of clauses). Withthese relations and appropriate quality measures, we show how to induce clauses (in a coverage approach), or to induce hypotheses directly by using simulated annealing algorithms. We discuss the necessary conditions on the refinement operators and the evaluation measures in order to increase the algorithm's effectivity. Implementations are described and experimentation results are presented.
logic Programs with Annotated Disjunctions (LPADs) provide a simple and elegant framework for integrating probabilistic reasoning and logicprogramming. In this paper we propose an algorithm for learning LPADs. the le...
详细信息
ISBN:
(纸本)3540229418
logic Programs with Annotated Disjunctions (LPADs) provide a simple and elegant framework for integrating probabilistic reasoning and logicprogramming. In this paper we propose an algorithm for learning LPADs. the learning problem we consider consists in starting from a sets of interpretations annotated withtheir probability and finding one (or more) LPAD that assign to each interpretation the associated probability. the learning algorithm first finds all the disjunctive clauses that are true in all interpretations, then it assigns to each disjunct in the head a probability and finally decides how to combine the clauses to form an LPAD by solving a constraint satisfaction problem. We show that the learning algorithm is correct and complete.
the integration of inductivelogicprogramming (ilp) and Bottlenose Dolphin Optimization (BDO) in this research addresses a pressing issue in today's information-saturated landscape: the proliferation of fake news...
详细信息
ISBN:
(纸本)9798350348798;9798350348804
the integration of inductivelogicprogramming (ilp) and Bottlenose Dolphin Optimization (BDO) in this research addresses a pressing issue in today's information-saturated landscape: the proliferation of fake news. In an era where misleading information can spread rapidly, traditional methods often fall short in effectively identifying deceptive *** combat this challenge, our approach harnesses the synergies of ilp and BDO. ilp plays a crucial role in constructing logical rules that capture intricate relationships within news data. By doing so, it delves deep into the content, seeking out patterns and inconsistencies that may not be obvious at first glance. BDO, on the other hand, takes inspiration from the social behavior of bottlenose dolphins to optimize the process of generating these rules. Just as dolphins collaborate and communicate to solve complex problems, BDO helps refine the logical rules for better accuracy. Ultimately, this research underscores the potential of bio-inspired optimization, such as BDO, combined withthe precision of logicprogramming (ilp) to strengthen the integrity of information dissemination platforms. In an age where the veracity of information is paramount, this innovative approach offers a promising solution to combat the spread of fake news and promote the dissemination of authentic, reliable information.
Probabilistic inductivelogicprogramming (Pilp) is a relatively unexplored area of Statistical Relational Learning which extends classic inductivelogicprogramming (ilp). Within this scope, we introduce SkILL, a Sto...
详细信息
ISBN:
(纸本)9781509002870
Probabilistic inductivelogicprogramming (Pilp) is a relatively unexplored area of Statistical Relational Learning which extends classic inductivelogicprogramming (ilp). Within this scope, we introduce SkILL, a Stochastic inductivelogic Learner, which takes probabilistic annotated data and produces First Order logic (FOL) theories. Data in several domains such as medicine and bioinformatics have an inherent degree of uncertainty, and because SkILL can handle this type of data, the models produced for these areas are closer to reality. SkILL can then use probabilistic data to extract non-trivial knowledge from databases, and also address efficiency issues by introducing an efficient search strategy for finding hypotheses in Pilp environments. SkILL's capabilities are demonstrated using a real world medical dataset in the breast cancer domain.
In this paper, we study how a logical form of scientific modelling that integrates together abduction and induction can be used to understand the functional class of unknown enzymes or inhibitors. We show how we can m...
详细信息
ISBN:
(纸本)3540229418
In this paper, we study how a logical form of scientific modelling that integrates together abduction and induction can be used to understand the functional class of unknown enzymes or inhibitors. We show how we can model, within Abductive logicprogramming (ALP), inhibition in metabolic pathways and use abduction to generate facts about inhibition of enzymes by a particular toxin (e.g. Hydrazine) given the underlying metabolic pathway and observations about the concentration of metabolites. these ground facts, together with biochemical background information, can then be generalised by ilp to generate rules about the inhibition by Hydrazine thus enriching further our model. In particular, using Progol 5.0 where the processes of abduction and inductive generalization are integrated enables us to learn such general rules. Experimental results on modelling in this way the effect of Hydrazine in a real metabolic pathway are presented.
there are two types of formalization for induction in logic. In descriptive induction, induced hypotheses describe rules with respect to observations with all predicates minimized. In explanatory induction, on the oth...
详细信息
ISBN:
(纸本)3540229418
there are two types of formalization for induction in logic. In descriptive induction, induced hypotheses describe rules with respect to observations with all predicates minimized. In explanatory induction, on the other hand, hypotheses abductively account for observations without any minimization principle. Bothinductive methods have strength and weakness, which are complementary to each other. In this work, we unify these two logical approaches. In the proposed framework, not all predicates are minimized but minimality conditions can be flexibly determined as a circumscription policy. Constructing appropriate policies, we can intentionally minimize models of an augmented axiom set. As a result, induced hypotheses can have both conservativeness and explainability, which have been considered incompatible with each other in the literature. We also give two procedures to compute inductive hypotheses in the proposed framework.
暂无评论