咨询与建议

限定检索结果

文献类型

  • 55 篇 会议
  • 40 篇 期刊文献
  • 2 篇 学位论文

馆藏范围

  • 97 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 91 篇 工学
    • 80 篇 计算机科学与技术...
    • 18 篇 电气工程
    • 13 篇 软件工程
    • 6 篇 控制科学与工程
    • 5 篇 仪器科学与技术
    • 5 篇 信息与通信工程
    • 2 篇 机械工程
    • 2 篇 材料科学与工程(可...
    • 1 篇 力学(可授工学、理...
    • 1 篇 电子科学与技术(可...
  • 10 篇 管理学
    • 8 篇 管理科学与工程(可...
    • 3 篇 图书情报与档案管...
    • 1 篇 工商管理
  • 7 篇 理学
    • 5 篇 数学
    • 4 篇 系统科学
    • 1 篇 地球物理学
  • 2 篇 教育学
    • 2 篇 教育学
  • 1 篇 经济学
    • 1 篇 理论经济学
  • 1 篇 法学
    • 1 篇 社会学

主题

  • 97 篇 web data extract...
  • 12 篇 web mining
  • 7 篇 wrapper inductio...
  • 6 篇 wrapper generati...
  • 6 篇 data mining
  • 5 篇 xml
  • 5 篇 xpath
  • 4 篇 web content mini...
  • 4 篇 clustering
  • 4 篇 semantic web
  • 3 篇 wrapper
  • 3 篇 semi-structured ...
  • 3 篇 structured data
  • 3 篇 wrappers
  • 3 篇 classification
  • 3 篇 document object ...
  • 3 篇 web scraping
  • 3 篇 unsupervised lea...
  • 2 篇 making mashup
  • 2 篇 visual feature

机构

  • 4 篇 univ illinois de...
  • 3 篇 peking univ inst...
  • 2 篇 shandong univ sc...
  • 2 篇 univ oxford oxfo...
  • 2 篇 vienna tech univ...
  • 2 篇 ecole polytech f...
  • 2 篇 univ versailles ...
  • 2 篇 inst sci & tech ...
  • 2 篇 grand paris sud ...
  • 1 篇 prince songkla u...
  • 1 篇 masdar inst sci ...
  • 1 篇 vienna univ tech...
  • 1 篇 univ milano bico...
  • 1 篇 nanyang technol ...
  • 1 篇 univ hannover in...
  • 1 篇 nanjing univ dep...
  • 1 篇 univ hannover re...
  • 1 篇 neusoft corp peo...
  • 1 篇 microsoft corp r...
  • 1 篇 brilliantpr digi...

作者

  • 4 篇 herzog m
  • 4 篇 baumgartner r
  • 4 篇 liu wei
  • 3 篇 henze n
  • 3 篇 yan hualiang
  • 3 篇 chang chia-hui
  • 3 篇 liu bing
  • 3 篇 xiao jianguo
  • 3 篇 fayzrakhmanov ru...
  • 3 篇 sallinger emanue...
  • 3 篇 zhai yanhong
  • 3 篇 uzun erdinc
  • 2 篇 yuan chunfeng
  • 2 篇 finance beatrice
  • 2 篇 shi shengsheng
  • 2 篇 chaillan frederi...
  • 2 篇 tomaschewski k
  • 2 篇 giannakoulopoulo...
  • 2 篇 huang yihua
  • 2 篇 chevallier zoe

语言

  • 93 篇 英文
  • 2 篇 中文
  • 1 篇 德文
  • 1 篇 其他
检索条件"主题词=Web Data Extraction"
97 条 记 录,以下是31-40 订阅
排序:
Unlocking Social Media and User Generated Content as a data Source for Knowledge Management
收藏 引用
INTERNATIONAL JOURNAL OF KNOWLEDGE MANAGEMENT 2020年 第1期16卷 101-122页
作者: Meneghello, James Thompson, Nik Lee, Kevin Wong, Kok Wai Abu-Salih, Bilal Optika Solut Perth Australia Curtin Univ Perth WA Australia Deakin Univ Software Engn & Internet Things IoT Sch Informat Technol Geelong Vic Australia Murdoch Univ Sch Engn & Informat Technol Murdoch WA Australia Univ Jordan Amman Jordan
The pervasiveness of social media and user-generated content has triggered an exponential increase in global data. However, due to collection and extraction challenges, data in embedded comments, reviews and testimoni... 详细信息
来源: 评论
Smart algorithmic based web crawling and scraping with template autoupdate capabilities
收藏 引用
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE 2021年 第22期33卷 e6042-e6042页
作者: Khan, Fazal Qudus Tsaramirsis, Georgios Ullah, Naimat Nazmudeen, Mohamed Jan, Sadeeq Ahmad, Awais King Abdulaziz Univ Fac Comp & IT Dept IT Jeddah Saudi Arabia Univ Buner Khyber Pakhtunkhwa Pakistan Univ Technol Brunei Bandar Seri Begawan Brunei Univ Engn & Technol Peshawar Pakistan Air Univ Dept Comp Sci Islamabad 44000 Pakistan
web scraping is the process of extracting data from web pages and it is an essential part for the generation of datasets. Currently the field is dominated by capable commercial applications, however, there is always a... 详细信息
来源: 评论
Development of Browser Extension for HTML web Page Content extraction  2
Development of Browser Extension for HTML Web Page Content E...
收藏 引用
2nd International Congress on Human-Computer Interaction, Optimization and Robotic Applications (HORA)
作者: Karabulut, Murat Mayda, Islam Istanbul Esenyurt Univ Bilgisayar Muhendisligi Bolumu Istanbul Turkey Yildiz Tekn Univ Bilgisayar Muhendisligi Bolumu Istanbul Turkey
As the amount of content on the websites increases, automatic content extraction from web pages becomes more important. Although many studies have been done in the literature on this subject, a method that fully solve... 详细信息
来源: 评论
Articulating the Construction of a web Scraper for Massive data extraction  2
Articulating the Construction of a Web Scraper for Massive D...
收藏 引用
2nd IEEE International Conference on Electrical, Computer and Communication Technologies (IEEE ICECCT)
作者: Upadhyay, Shreya Pant, Vishal Bhasin, Shivansh Pattanshetti, Mahantesh K. Graph Era Deemed Univ Dept Comp Sci & Engn Dehra Dun India Graph Era Hill Univ Dept Comp Sci & Engn Dehra Dun India
Massive volumes of data are generated by various users, entities, applications and disseminated online. This copious volume of big data is distributed across millions of websites and is available for various applicati... 详细信息
来源: 评论
web data extraction for Developing a Mashup
Web Data Extraction for Developing a Mashup
收藏 引用
International MultiConference of Engineers and Computer Scientists (IMECS 2012)
作者: Chaudhari, Poonam. A. Paikrao, Rahul. L. Gokhale Educ Soc COE Nasik India Amrutvahini COE Sangamner India
web is a huge reservoir of information. data available is extremely diversified and abundant. Various types of data can be easily extracted from the Internet, although not all of the data is relevant to the users. Mos... 详细信息
来源: 评论
VB-PTC: Visual Block Multi-Record Text extraction Based on Sensor Network Page Type Conversion
收藏 引用
IEEE ACCESS 2020年 8卷 167900-167913页
作者: Gong, Jibing Zhang, Hekai Du, Weixia Li, Huanhuan Wen, Hongnian Yanshan Univ Sch Informat Sci & Engn Qinhuangdao 066004 Hebei Peoples R China Yanshan Univ Key Lab Comp Virtual Technol & Syst Integrat Hebe Qinhuangdao 066004 Hebei Peoples R China Yanshan Univ Key Lab Software Engn Hebei Prov Qinhuangdao 066004 Hebei Peoples R China Shijiazhuang Inst Railway Technol Sch Informat Sci & Engn Shijiazhaung 050041 Peoples R China
Usually, in addition to the main content, web pages contain additional information in the form of noise, such as navigation elements, sidebars and advertisements. This kind of noise has nothing to do with the main con... 详细信息
来源: 评论
web Scraping: State-of-the-Art and Areas of Application
Web Scraping: State-of-the-Art and Areas of Application
收藏 引用
IEEE International Conference on Big data (Big data)
作者: Diouf, Rabiyatou Sarr, Edouard Ngor Sall, Ousmane Birregah, Babiga Bousso, Mamadou Mbaye, Seny Ndiaye Univ Thies Thies Senegal UCAO St Michel Dakar Senegal Univ Technol Troyes Troyes France
Main objective of web Scraping is to extract information from one or many websites and process it into simple structures such as spreadsheets, database or CSV file. However, in addition to be a very complicated task, ... 详细信息
来源: 评论
RED: Redundancy-Driven data extraction from Result Pages  19
RED: Redundancy-Driven Data Extraction from Result Pages
收藏 引用
World Wide web Conference (WWW)
作者: Guo, Jinsong Crescenzi, Valter Furche, Tim Grasso, Giovanni Gottlob, George Univ Oxford Oxford England Univ Roma Tre Rome Italy Meltwater San Francisco CA USA Univ Calabria Calabria Italy TU Wien Vienna Austria
data-driven websites are mostly accessed through search interfaces. Such sites follow a common publishing pattern that, surprisingly, has not been fully exploited for unsupervised data extraction yet: the result of a ... 详细信息
来源: 评论
Performance Analysis for Mining Images of Deep web
收藏 引用
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS 2020年 第10期11卷 1-7页
作者: Sabri, Ily Amalina Ahmad Man, Mustafa Univ Malaysia Terengganu Fac Ocean Engn Technol & Informat Terengganu Malaysia
In this paper, advancing web scale knowledge extraction and alignment by integrating few sources has been considered by exploring different methods of aggregation and attention in order to focus on image information. ... 详细信息
来源: 评论
web data extraction research based on wrapper and XPath technology
Web data extraction research based on wrapper and XPath tech...
收藏 引用
International Conference on Advanced Materials and Information Technology Processing (AMITP 2011)
作者: Liu, Hong Ma, YinXiao Zhejiang Gongshang Univ Coll Comp & Informat Engn Hangzhou Zhejiang Peoples R China
For satisfy people's various need, some websites consist of pages that are dynamically generated using a common template populated with data from www, such as product description pages on e-commerce sites. In this... 详细信息
来源: 评论