We propose a three-step technique to achieve this purpose. First, we utilize a collection of XML namespaces organized into hierarchical structure as a medium for expressing data semantics. Second, we define the format...
详细信息
We propose a three-step technique to achieve this purpose. First, we utilize a collection of XML namespaces organized into hierarchical structure as a medium for expressing data semantics. Second, we define the format of resource descriptor for the information source discovery scheme so that we can dynamically register and/or deregister the web data sources on the fly. Third, we employ an inverted-index mechanism to identify the subset of information sources that are relevant to a particular user query. We describe the design, architecture, and implementation of our approach—IWDS, and illustrate its use through case examples.
Key words integration - heterogeneity - web data source - XML namespace
CLC number TP 311.13
Foundation item: Supported by the National Key Technologies R&D Program of China(2002BA103A04)
Biography: WU Wei (1975-), male, Ph.D candidate, research direction: information integration, distribute computing
There are a lot of valuable data on the web that users may need to improve their decision making process. The answer of user query often exists in multiple web data sources. The extraction and combination of answers o...
详细信息
ISBN:
(纸本)9781424426232
There are a lot of valuable data on the web that users may need to improve their decision making process. The answer of user query often exists in multiple web data sources. The extraction and combination of answers of user queries from different web data sources often fails because of syntax and semantic heterogeneity between user queries and web data sources. The retrieval and extraction of these answers from the different web data sources impose a need for queries to be syntactically and semantically mapped to websources query languages. In this paper, we focus on semantic heterogeneity among user query terms and web data sources terms. We propose an approach for the semantic mapping of user query terms to web data sources terms.
We propose a three-step technique to achieve this purpose. First, we utilize a collection of XML namespaces organized into hierarchical structure as a medium for expressing data semantics. Second, we define the format...
详细信息
We propose a three-step technique to achieve this purpose. First, we utilize a collection of XML namespaces organized into hierarchical structure as a medium for expressing data semantics. Second, we define the format of resource descriptor for the information source discovery scheme so that we can dynamically register and/or deregister the web data sources on the fly. Third. we employ an inverted-index mechanism to identify the subset of information sources that are relevant to a particular user query. We describe the design. architecture, and implementation of our approach IWDS, and illustrate its use through case examples.
We propose a three-step technique to achieve this purpose. First, we utilize a collection of XML namespaces organized into hierarchical structure as a medium for expressing data semantics. Second, we define the format...
详细信息
We propose a three-step technique to achieve this purpose. First, we utilize a collection of XML namespaces organized into hierarchical structure as a medium for expressing data semantics. Second, we define the format of resource descriptor for the information source discovery scheme so that we can dynamically register and/or deregister the web data sources on the fly. Third, we employ an inverted-index mechanism to identify the subset of information sources that are relevant to a particular user query. We describe the design, architecture, and implementation of our approach--IWDS. and illustrate its use through case examples.
There are hundreds or thousands of web data sources providing data of relevance to a particular domain on the web, so how to find a suitable set of sources quickly to integrate from a number of sources is becoming mor...
详细信息
暂无评论