检索结果-内蒙古大学图书馆

An ontology-based approach for resolving semantic schema conflicts in the extraction and integration of query-based information from heterogeneous web data sources 07

An ontology-based approach for resolving semantic schema con...

引用

Proceedings of the Third Australasian Workshop on Advances in Ontologies - Volume 85

作者： Abdolreza Hajmoosaei Sameem Abdul-Kareem University of Malaya Kuala lumpur Malaysia

ISBN: (纸本)9781920682668

There are many external resources and heterogeneous data on the internet that an organization or user may need to improve the decision making process. It is therefore, very important and critical that this information are complete, precise and can be acquired on time. Most web sources provide data in semi-structured form on the internet. The combination of semi-structured data from different sources on the internet often fails because of syntactic and semantic differences. The access, retrieval and utilization of information from the different web data sources impose a need for the data to be integrated. integration of web data is a complex process because of the heterogeneity nature of web data and thus needs some kind of a web data integration system. There are many types of heterogeneity and differences among web sources that makes data integration a difficult process (e.g., different data model, different syntax and semantics in schema and data instance level among web sources). Semantic schema heterogeneity, which refers to the misinterpretation of data at the schema level, is one major obstacle that needs to be overcome in web data integration process. Semantic schema heterogeneity has been identified as one of the most important problems when dealing with interoperability and cooperation among multiple data sources on the internet. In this paper, we recommend a system architecture for web data integration focusing on resolving the problems of semantic schema heterogeneity between web data sources. We propose an ontology-based approach as a solution for the reconciliation of semantic conflicts between web data at the schema level.

关键词： semantic schema heterogeneity ontology web data integration

来源：评论

学校读者我要写书评

暂无评论

XHMG: Content-based web hypermedia modeling and retrieval system

XHMG: Content-based Web hypermedia modeling and retrieval sy...

引用

IEEE International Conference on Granular Computing

作者： Radev, Ivan S. N Carolina State Univ Dept Math & Comp Sci Orangeburg SC 29117 USA

ISBN: (纸本)142440133X

Many studies concentrate on developing attractive web applications, but very few discuss the fundamental problems of modeling, integration and retrieval of web hypermedia data from heterogeneous data sources based on its content and semantics. The main focus of this paper is the modeling facilities in the XHMG system for content-based representation, integration and retrieval of heterogeneous web data. The paper shows the basic XHMG structural instruments for web and web page content representation. The most important application of this approach is handling and integrating the hypermedia information in the web based on its content and meaning. The research in this paper will have a potentially large impact on the technologies used in information sources for e-business, e-advertising, e-commerce, e-government, e-learning, portals, digital libraries, web search engines, online catalogs.

关键词： web content-based modeling web content-based search and retrieval web multimedia web data integration

来源：评论

学校读者我要写书评

暂无评论

Integrating Multi-Source web Records into Relational database

引用

Wuhan University Journal of Natural Sciences 2006年第5期11卷 1177-1181页

作者： HUANG Jianbin JI Hongbing SUN Heli School of Electronic Engineering Xidian UniversitylXi＇an 710071 Shaanxi China School of Computer Science Xidian UniversityXi＇an 710071 Shaanxi China Department of Computer Science and Technology Xi＇an Jiaotong University Xi＇an 710049 Shaanxi China

How to integrate heterogeneous semi-structured web records into relational database is an important and challengeable research topic. An improved model of conditional random fields was presented to combine the learning of labeled samples and unlabeled database records in order to reduce the dependence on tediously hand-labeled training data. The pro- posed model was used to solve the problem of schema matching between data source schema and database schema. Experimental results using a large number of web pages from diverse domains show the novel approach＇s effectiveness.

关键词： web data integration schema matching conditional random fields

来源：评论

学校读者我要写书评

暂无评论

Information extraction for the semantic web

引用

1st International Summer School on Reasoning web

作者： Baumgartner, R Eiter, T Gottlob, G Herzog, M Koch, C Vienna Univ Technol Inst Informat Syst Database & Artificial Intelligence Grp A-1040 Vienna Austria Vienna Univ Technol Inst Informat Syst Knowledge Based Syst Grp A-1040 Vienna Austria

ISBN: (纸本)3540278281

The World Wide web represents a universe of knowledge and information. Unfortunately, it is not straightforward to query and access the desired information. Languages and tools for accessing, extracting, transforming, and syndicating the desired information are required. The web should be useful not merely for human consumption but additionally for machine communication. Therefore, powerful and user-friendly tools based on expressive languages for extracting and integrating information from various different web sources, or in general, various heterogeneous sources are needed. The tutorial gives an introduction to web technologies required in this context, and presents various approaches and techniques used in information extraction and integration. Moreover, sample applications in various domains motivate the discussed topics and providing data instances for the Semantic web is illustrated(1).

关键词： web data extraction semi-structured data wrapper languages and systems web data integration Semantic web

来源：评论

学校读者我要写书评

暂无评论

一种基于XML的web数据集成系统查询分解和优化策略

一种基于XML的Web数据集成系统查询分解和优化策略

引用

第二十二届中国数据库学术会议

作者：张硕李建中熊蜀光王春宇哈尔滨工业大学计算机科学与技术学院

1引言XML(extensible markup language)是一套定义语义标记的规则,它已经逐渐成为一种新的网上数据交换标准。随着WWW上站点提供的信息越来越多,web数据管理有待解决很多新的问题。web数据集成以如何正确有效地集成形式多样的

关键词： web data integration Query decomposition Query optimization XML

来源：评论

学校读者我要写书评

暂无评论

web data retrieval and extraction

引用

data & KNOWLEDGE ENGINEERING 2003年第3期44卷 347-367页

作者： Lacroix, Z Arizona State Univ Dept Comp Sci Tempe AZ 85287 USA

We present the Object-web Mediator to querying integrated web data sources composed of a retrieval component based on an intermediate object view mechanism and search views, and an XML engine. Search views map the source capabilities to attributes defined at object classes, and parsers that process retrieved documents and cache them in XML format. The XML engine queries cached documents, extracts data, and returns extracted data for evaluation. The originality of this approach consists of a generic view mechanism to access data sources with limited data access and complex capabilities, and an XML engine to support data extraction and reorganization. This approach has been developed and demonstrated as part of the multi-database system supporting queries via uniform Object Protocol Model interfaces against public web data sources of interest to the biologists. (C) 2002 Elsevier Science B.V. All rights reserved.

关键词： web data integration retrieval extraction XML mediation web source capability biological data integration

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：