Two-hemisphere model driven (2HMD) approach assumes modeling and use of procedural and conceptual knowledge on equal and related basis according to the principles of Model Driven Architecture (MDA), which separates di...
详细信息
ISBN:
(纸本)9781586037154
Two-hemisphere model driven (2HMD) approach assumes modeling and use of procedural and conceptual knowledge on equal and related basis according to the principles of Model Driven Architecture (MDA), which separates different aspects of system modeling. this differentiates 2HMD approach from pure procedural, pure conceptual, and object oriented approaches. the approach may be applied in the context of modeling of a particular business domain as well as in the context of modeling the knowledge about the domain. therefore, the principles of MDA via 2HMD approach may be applied not only in the context of software development but also in the context of the study course and program development. knowledge modeling by 2HMD approach gives an opportunity to transparently analyze and compare knowledge to be provided and knowledge actually provided by courses belonging to a particular study program, and, thus, to identify and fill gaps between desirable and actual knowledge content of the study program.
the proceedings contain 75 papers. the topics discussed include: statistical relational learning: an inductive logic programming perspective;recent advances in mining time series data;data streams and data synopses fo...
详细信息
ISBN:
(纸本)3540292446
the proceedings contain 75 papers. the topics discussed include: statistical relational learning: an inductive logic programming perspective;recent advances in mining time series data;data streams and data synopses for massive data sets;agglomerative hierarchical clustering with constraints: theoretical and empirical results;a correspondence between maximal complete bipartite subgraphs and closed patterns;mining model trees from spatial data;knowledgediscovery from user preferences in conversational recommendation;non-stationary environment compensation using sequential EM algorithm for robust speech recognition;and a kernel based method for discovering market segments in beef meat.
the proceedings contain 64 papers. the special focus in this conference is on knowledgediscovery in databases. the topics include: Real-world learning with Markov logic networks;mining positive and negative associati...
ISBN:
(纸本)3540231080
the proceedings contain 64 papers. the special focus in this conference is on knowledgediscovery in databases. the topics include: Real-world learning with Markov logic networks;mining positive and negative association rules;an experiment on knowledgediscovery in chemical databases;shape and size regularization in expectation maximization and fuzzy clustering;reducing data stream sliding windows by cyclic tree-like histograms;a framework for data mining pattern management;spatial associative classification at different levels of granularity;parameter-free graph partitioning and outlier detection;a tree-based approach to clustering xml documents by structure;discovery of regulatory connections in microarray data;comparison of classifiers given little training;document classification through interactive supervision of document and term labels;finding interesting pass patterns from soccer game records;summarization of dynamic content in web collections;incremental nonlinear PCA for classification;constraint-based mining of episode rules and optimal window sizes;using a hash-based method for apriori-based graph mining;classification in geographical information systems;digging into acceptor splice site prediction;asynchronous and anticipatory filter-stream based parallel algorithm for frequent itemset mining;density-based spatial clustering in the presence of obstacles and facilitators;dealing with predictive-but-unpredictable attributes in noisy data sources;a hierarchical clustering engine for web-page snippets;a tolerance rough set approach to clustering web search results;mining history of changes to web access patterns and visual mining of spatial time series data.
Withthe fast expansion of computer networks, it is inevitable to study data mining on heterogeneous databases. In this paper we propose MDBM, an accurate and efficient approach for classification on multiple heteroge...
详细信息
ISBN:
(纸本)3540292446
Withthe fast expansion of computer networks, it is inevitable to study data mining on heterogeneous databases. In this paper we propose MDBM, an accurate and efficient approach for classification on multiple heterogeneous databases. We propose a regression-based method for predicting the usefulness of inter-database links that serve as bridges for information transfer, because such links are automatically detected and may or may not be useful or even valid. Because of the high cost of inter-database communication, MDBM employs a new strategy for cross-database classification, which finds and performs actions with high benefit-to-cost ratios. the experiments show that MDBM achieves high accuracy in cross-database classification, with much higher efficiency than previous approaches.
We present a case study on the discovery of clinically relevant domain knowledge in the field of HIV drug resistance. Novel mutations in the HIV genome associated with treatment failure were identified by mining a rel...
详细信息
In this paper, we present an experiment on knowledgediscovery in chemical reaction databases. Chemical reactions are the main elements on which relies synthesis in organic chemistry, and this is why chemical reaction...
详细信息
ISBN:
(纸本)3540231080
In this paper, we present an experiment on knowledgediscovery in chemical reaction databases. Chemical reactions are the main elements on which relies synthesis in organic chemistry, and this is why chemical reactions databases are of first importance. From a problem-solving process point of view, synthesis in organic chemistry must be considered at several levels of abstraction: mainly a strategic level where general synthesis methods are involved, and a tactic level where actual chemical reactions are applied. the research work presented in this paper is aimed at discovering general synthesis methods from chemical reaction databases in order to design generic and reusable synthesis plans. the knowledgediscovery process relies on frequent levelwise itemset search and association rule extraction, but also on chemical knowledge involved within every step of the knowledgediscovery process. Moreover, the overall process is supervised by an expert of the domain. the principles of this original experiment on mining chemical reaction databases and its results are detailed and discussed.
People recently are interested in a new operator, called skyline [3], which returns the objects that are not dominated by any other objects with regard to certain measures in a multi-dimensional space. Recent work on ...
详细信息
ISBN:
(纸本)3540231080
People recently are interested in a new operator, called skyline [3], which returns the objects that are not dominated by any other objects with regard to certain measures in a multi-dimensional space. Recent work on the skyline operator [3, 15, 8, 13, 2] focuses on efficient computation of skylines in large databases. However, such work gives users only thin skylines, i.e., single objects, which may not be desirable in some real applications. In this paper, we propose a novel concept, called thick skyline, which recommends not only skyline objects but also their nearby neighbors within epsilon-distance. Efficient computation methods are developed including (1) two efficient algorithms, Sampling-and-Pruning and Indexing-and-Estimating, to find such thick skyline withthe help of statistics or indexes in large databases, and (2) a highly efficient Microcluster-based algorithm for mining thick skyline. the Microcluster-based method not only leads to substantial savings in computation but also provides a concise representation of the thick skyline in the case of high cardinalities. Our experimental performance study shows that the proposed methods are both efficient and effective.
A continuity is a kind of inter-transaction association which describes the relationships among different transactions. Since it breaks the boundaries of transactions, the number of potential itemsets and the number o...
详细信息
暂无评论