检索结果-内蒙古大学图书馆

Visually Guiding and Controlling the Search While Mining Chemical Structures

10th International Work-Conference on Artificial Neural Networks (IWANN 2009)

作者： Pereira, Max Costa, Vitor Santos Camacho, Rui Fonseca, Nuno A. Univ Porto LIAAD INESC Porto LA & FEUP Rua Dr Roberto Frias S-N P-4200465 Oporto Portugal Univ Porto CRACS INSEC Porto LA Porto Portugal Univ Porto Inst Biol Mol Celuar Porto Portugal

ISBN: (纸本)9783642024801

In this paper we present the work in progress on LogCHEM, an ILP based tool for discriminative interactive mining of chemical fragments. In particular, we describe the integration with a molecule visualisation software that allows the chemist to graphically control the search for interesting patterns in chemical fragments. Furthermore, we show how structured information, such as rings, functional groups like carboxyl, amine, methyl, ester, etc are integrated and exploited in LogCHEM.

关键词： inductive logic programming drug design

来源：评论

学校读者我要写书评

暂无评论

Induction on Failure: Learning Connected Horn Theories

引用

10th International Conference on logic programming and Nonmonotonic Reasoning

作者： Kimber, Tim Broda, Krysia Russo, Alessandra Univ London Imperial Coll Sci Technol & Med London SW7 2AZ England

ISBN: (纸本)9783642042379

Several learning systems based on Inverse Entailment (IE) have been proposed, some that compute single clause hypotheses, exemplified by Progol, and others that, produce multiple clauses in response to a single seed example. A common denominator of these systems, is a restricted hypothesis search space, within which each clause must individually explain some example E, or some member of an abductive explanation for E. This paper proposes a new IE approach, called Induction on Failure (IoF), that generalises existing Horn clause learning systems by allowing the computation of hypotheses within a larger search space, namely that of Connected Theories. A proof procedure for IoF is proposed that generalises existing IE systems and also resolves Yamamoto's example. A prototype implementation is also described. Finally, a semantics is presented called Connected Theory Generalisation, which is proved to extend Kernel Set Subsumption and to include hypotheses constructed within this new IoF approach.

关键词： inductive logic programming Inverse Entailment Abduction

来源：评论

学校读者我要写书评

暂无评论

Learning ontological rules to extract multiple relations of genic interactions from text

引用

INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS 2009年第12期78卷 E31-E38页

作者： Manine, Alain-Pierre Alphonse, Erick Bessieres, Philippe Univ Paris 13 Inst Galilee Lab Informat Paris Nord CNRSLIPNUMR7030 F-93430 Villetaneuse France INRA Unite Math Informat & Genome MIG UR1077 F-78352 Jouy En Josas France

Introduction: Information extraction (IE) systems have been proposed in recent years to extract genic interactions from bibliographical resources. They are limited to single interaction relations, and have to face a trade-off between recall and precision, by focusing either on specific interactions (for precision), or general and unspecified interactions of biological entities (for recall). Yet, biologists need to process more complex data from literature, in order to study biological pathways. An ontology is an adequate formal representation to model this sophisticated knowledge. However, the tight integration of IE systems and ontologies is still a current research issue, a fortiori with complex ones that go beyond hierarchies. Method: We propose a rich modeling of genic interactions with an ontology, and show how it can be used within an IE system. The ontology is seen as a language specifying a normalized representation of text. First, IE is performed by extracting instances from natural language processing (NLP) modules. Then, deductive inferences on the ontology language are completed, and new instances are derived from previously extracted ones. Inference rules are learnt with an inductive logic programming (ILP) algorithm, using the ontology as the hypothesis language, and its instantiation on an annotated corpus as the example language. Learning is set in a multi-class setting to deal with the multiple ontological relations. Results: We validated our approach on an annotated corpus of gene transcription regulations in the Bacillus subtilis bacterium. We reach a global recall of 89.3% and a precision of 89.6%, with high scores for the ten semantic relations defined in the ontology. (C) 2009 Elsevier B.V. All rights reserved.

关键词： Information Extraction Ontology Machine Learning Genic Interactions inductive logic programming

来源：评论

学校读者我要写书评

暂无评论

ALLPAD: approximate learning of logic programs with annotated disjunctions

引用

MACHINE LEARNING 2008年第2-3期70卷 207-223页

作者： Riguzzi, Fabrizio Univ Ferrara Dipartimento Ingn I-44100 Ferrara Italy

logic Programs with Annotated Disjunctions (LPADs) provide a simple and elegant framework for representing probabilistic knowledge in logic programming. In this paper we consider the problem of learning ground LPADs starting from a set of interpretations annotated with their probability. We present the system ALLPAD for solving this problem. ALLPAD modifies the previous system LLPAD in order to tackle real world learning problems more effectively. This is achieved by looking for an approximate solution rather than a perfect one. A number of experiments have been performed on real and artificial data for evaluating ALLPAD, showing the feasibility of the approach.

关键词： inductive logic programming probabilistic logic programming statistical relational learning logic programs with annotated disjunctions

来源：评论

学校读者我要写书评

暂无评论

Learning and generalising semantic knowledge from object scenes

引用

ROBOTICS AND AUTONOMOUS SYSTEMS 2008年第11期56卷 891-900页

作者： D'Este, Claire Sammut, Claude Univ New S Wales ARC Ctr Excellence Autonomous Syst Sch Comp Sci & Engn Sydney NSW 2052 Australia

The robot described in this paper learns words that relate to objects and their attributes, and also learns concepts, which may be recursive, that involve relationships between several objects. Once the system is explicitly taught some words by a human teacher it finds new objects that might help to refine its concepts. Once it has found a new object, it tries to generalise its concepts to include the new object and asks the teacher for feedback. The robot learns further properties of objects by interacting with them, by touching them or walking around them to gain a new perspective. The system learns semantic knowledge front spoken interactions, using speech recognition and generation, motion segmentation, feature extraction from images using Ripple Down Rules and generalisation using inductive logic programming. (C) 2008 Elsevier B.V. All rights reserved.

关键词： Human-Robot Communication inductive logic programming Ripple Down Rules Word learning Concept learning

来源：评论

学校读者我要写书评

暂无评论

Structured machine learning: the next ten years

引用

MACHINE LEARNING 2008年第1期73卷 3-23页

作者： Dietterich, Thomas G. Domingos, Pedro Getoor, Lise Muggleton, Stephen Tadepalli, Prasad Oregon State Univ Corvallis OR 97331 USA Univ Washington Seattle WA 98195 USA Univ Maryland College Pk MD 20742 USA Univ London Imperial Coll Sci Technol & Med London England

The field of inductive logic programming (ILP) has made steady progress, since the first ILP workshop in 1991, based on a balance of developments in theory, implementations and applications. More recently there has been an increased emphasis on Probabilistic ILP and the related fields of Statistical Relational Learning (SRL) and Structured Prediction. The goal of the current paper is to consider these emerging trends and chart out the strategic directions and open problems for the broader area of structured machine learning for the next 10 years.

关键词： inductive logic programming relational learning statistical relational learning structured machine learning

来源：评论

学校读者我要写书评

暂无评论

Learning directed probabilistic logical models: ordering-search versus structure-search

引用

ANNALS OF MATHEMATICS AND ARTIFICIAL INTELLIGENCE 2008年第1-3期54卷 99-133页

作者： Fierens, Daan Ramon, Jan Bruynooghe, Maurice Blockeel, Hendrik Katholieke Univ Leuven Dept Comp Sci B-3001 Louvain Belgium

We discuss how to learn non-recursive directed probabilistic logical models from relational data. This problem has been tackled before by upgrading the structure-search algorithm initially proposed for Bayesian networks. In this paper we show how to upgrade another algorithm for learning Bayesian networks, namely ordering-search. For Bayesian networks, ordering-search was found to work better than structure-search. It is non-obvious that these results carry over to the relational case, however, since there ordering-search needs to be implemented quite differently. Hence, we perform an experimental comparison of these upgraded algorithms on four relational domains. We conclude that also in the relational case ordering-search is competitive with structure-search in terms of quality of the learned models, while ordering-search is significantly faster.

关键词： Statistical relational learning Probabilistic logical models inductive logic programming Bayesian networks Probability trees Structure learning

来源：评论

学校读者我要写书评

暂无评论

Sequential data mining:: A comparative case study in development of atherosclerosis risk factors

引用

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS 2008年第1期38卷 3-15页

作者： Klema, Jiri Novakova, Lenka Karel, Filip Stepankova, Olga Zelezny, Filip Czech Tech Univ Dept Cybernet Fac Elect Engn Prague 16627 Czech Republic Czech Tech Univ Intelligent Data Anal Res Grp Gerstner Lab Prague 16627 Czech Republic

Sequential data represent an important source of potentially new medical knowledge. However, this type of data is rarely provided in a format suitable for immediate application of conventional mining algorithms. This paper summarizes and compares three different sequential mining approaches based, respectively, on windowing, episode rules, and inductive logic programming. Windowing is one of the essential methods of data preprocessing. Episode rules represent general sequential mining, while inductive logic programming extracts first-order features whose structure is determined by background knowledge. The three approaches are demonstrated and evaluated in terms of a case study STULONG. It is a longitudinal preventive study of atherosclerosis where the data consist of a series of long-term observations recording the development of risk factors and associated conditions. The intention is to identify frequent sequential/temporal patterns. Possible relations between the patterns and an onset of any of the observed cardiovascular diseases are also studied.

关键词： anachronism episode rules inductive logic programming temporal pattern trend analysis windowing

来源：评论

学校读者我要写书评

暂无评论

Multi-Dimensional Relational Sequence Mining

引用

FUNDAMENTA INFORMATICAE 2008年第1期89卷 23-43页

作者： Esposito, Floriana Di Mauro, Nicola Basile, Teresa M. A. Ferilli, Stefano Univ Bari Dipartimento Informat I-70125 Bari Italy

The issue addressed in this paper concerns the discovery of frequent multi-dimensional patterns from relational sequences. The great variety of applications of sequential pattern mining, such as user profiling, medicine, local weather forecast and bioinformatics, makes this problem one of the central topics in data mining. Nevertheless, sequential information may concern data on multiple dimensions and, hence, the mining of sequential patterns from multi-dimensional information results very important. In a multi-dimensional sequence each event depends on more than one dimension, such as in spatio-temporal sequences where an event may be spatially or temporally related to other events. In literature, the multi-relational data mining approach has been successfully applied to knowledge discovery from complex data. However, there exists no contribution to manage the general case of multi-dimensional data in which, for example, spatial and temporal information may co-exist. This work takes into account the possibility to mine complex patterns, expressed in a first-order language, in which events may occur along different dimensions. Specifically, multidimensional patterns are defined as a set of atomic first-order formulae in which events are explicitly represented by a variable and the relations between events are represented by a set of dimensional predicates. A complete framework and an inductive logic programming algorithm to tackle this problem are presented along with some experiments on artificial and real multi-dimensional sequences proving its effectiveness.

关键词： Multi-relational sequence mining inductive logic programming Sequence analysis

来源：评论

学校读者我要写书评

暂无评论

Learning relational descriptions of differentially expressed gene groups

引用

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS 2008年第1期38卷 16-25页

作者： Trajkovski, Igor Zelezny, Filip Lavrac, Nada Tolar, Jakub Jozef Stefan Inst Dept Knowledge Technol Ljubljana 1000 Slovenia Czech Tech Univ Gerstner Lab Intelligent Data Anal Res Grp CR-16635 Prague 6 Czech Republic Univ Nova Gorica SL-5000 Nova Gorica Slovenia Univ Minnesota Div Hematol Oncol & Blood & Marrow Transplantat Minneapolis MN 55455 USA

This paper presents a method that uses gene ontologies (GOs), together with the paradigm of relational subgroup discovery, to find compactly described groups of genes differentially expressed in specific cancers. The groups are described by means of relational logic features, extracted from publicly available GO information, and are straightforwardly interpretable by medical experts. We applied the proposed method to three gene expression data sets with the following respective sets of sample classes: 1) acute lymphoblastic leukemia (ALL) versus acute myeloid leukemia (AML);2) seven subtypes of ALL;and 3) 14 different types of cancers. Significant number of discovered groups of genes had a description that highlighted the underlying biological process responsible for distinguishing one class from the other classes. The quality of the discovered descriptions was also verified by cross validation. We believe that the. presented approach will significantly contribute to the application of relational machine learning to gene expression analysis, given the expected increase in both the quality and quantity of gene/protein annotations in the, near future.

关键词： inductive logic programming learning from structured data learning in bioinformatics microarray data analysis relational learning scientific discovery

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：