Along with a massive amount of information being placed online, it is a challenge to exploit the internal and external information of documents when assessing similarity between them. A variety of approaches have been...
详细信息
In this paper, we present a method to trace evolution of trend over multiple data streams and detect the abnormal ones. First of all, a definition of trend for single data stream is provided, the advantage of our defi...
详细信息
In this paper, a probabilistic graphical modeling approach for web services is proposed, and the web services Bayesian network (WSBN) is constructed by mining the historical invocations among them. Further, the semant...
详细信息
ISBN:
(纸本)9783540724834
In this paper, a probabilistic graphical modeling approach for web services is proposed, and the web services Bayesian network (WSBN) is constructed by mining the historical invocations among them. Further, the semantic guidance to web services composition is generated based on the Markov blanket and causality reasoning in the WSBN. Preliminary experiments and performance analysis show that our approach is effective and feasible.
An improved range data distribution method is proposed, which is suitable for both the homogeneous and heterogeneous database cluster with consideration of full use of different computing resources of nodes. In order ...
详细信息
In this paper we introduce a probabilistic-reasoning approach to detect web robots (crawlers) from human visitors of web sites. Our approach employs a Naive Bayes network to classify the HTTP sessions of a web-server ...
详细信息
Query rewriting using views is a technique for answering a query that exploits a set of views instead of accessing the database relations directly. There are two categories of rewritings, i.e., equivalent rewritings u...
详细信息
What-if analysis is an important type of DSS analysis processing procedure. It analyzes hypothetical scenarios based on historical data. The data cube view must be updated when the what-if condition is changed. Since ...
详细信息
As the web grows, more and more data has become available from webpages, such as the product items from the back-end databases. To provide efficient access to the data objects contained in these pages, data extraction...
详细信息
ISBN:
(纸本)9783540724834
As the web grows, more and more data has become available from webpages, such as the product items from the back-end databases. To provide efficient access to the data objects contained in these pages, data extraction plays an important role. However, identifying the suitable webpages to feed the data extraction is a pre-requisite and non-trivial task. As a result, there is an increasing need for methods that can automatically identify the target pages from unknown websites. In this paper, we solve the problem by exploiting the structured-token features of the webpage content, and applying decision tree based classification algorithm to induce the structure information. Furthermore, a preliminary recognition of data-object is acquired to efficiently initiate the subsequential data extraction. We experiment our approach on the real-world data, and achieve promising results.
This book constitutes the proceedings of the jointinternationalconference apweb/waim 2009 which was held in Suzhou, China, during April 1-4, 2009. The 42 full papers presented together with 26 short papers and the a...
详细信息
ISBN:
(数字)9783642006722
ISBN:
(纸本)9783642006715
This book constitutes the proceedings of the jointinternationalconference apweb/waim 2009 which was held in Suzhou, China, during April 1-4, 2009. The 42 full papers presented together with 26 short papers and the abstracts of 2 keynote speeches were carefully reviewed and selected for inclusion in the book. The topics covered are query processing, topic-based techniques, webdata processing, multidimensional data analysis, stream data processing, data mining and its applications, and datamanagement support to advanced applications.
In this paper, we studied the efficiency and break-event point of storing video objects into DBMS and proved that storing "small" video objects into database is a suitable solution. To the video objects that...
详细信息
ISBN:
(纸本)9783642039959
In this paper, we studied the efficiency and break-event point of storing video objects into DBMS and proved that storing "small" video objects into database is a suitable solution. To the video objects that stored in database as BLOB data type, we devised a database based time-oriented approach to speed up the video content access. Our experiments showed that, because of we extracted some system-aware metadata and stored into database transparently, the read performance was become practicable.
暂无评论