版权所有:内蒙古大学图书馆 技术提供:维普资讯• 智图
内蒙古自治区呼和浩特市赛罕区大学西街235号 邮编: 010021
作者机构:School of Electronic Engineering Xidian UniversitylXi'an 710071 Shaanxi China School of Computer Science Xidian UniversityXi'an 710071 Shaanxi China Department of Computer Science and TechnologyXi'an Jiaotong University Xi'an 710049 Shaanxi China
出 版 物:《Wuhan University Journal of Natural Sciences》 (武汉大学学报(自然科学英文版))
年 卷 期:2006年第11卷第5期
页 面:1177-1181页
学科分类:08[工学] 0835[工学-软件工程] 081202[工学-计算机软件与理论] 0812[工学-计算机科学与技术(可授工学、理学学位)]
基 金:Supported by the National Defense Pre-ResearchFoundation of China(4110105018)
主 题:Web data integration schema matching conditional random fields
摘 要:How to integrate heterogeneous semi-structured Web records into relational database is an important and challengeable research topic. An improved model of conditional random fields was presented to combine the learning of labeled samples and unlabeled database records in order to reduce the dependence on tediously hand-labeled training data. The pro- posed model was used to solve the problem of schema matching between data source schema and database schema. Experimental results using a large number of Web pages from diverse domains show the novel approach's effectiveness.