咨询与建议

限定检索结果

文献类型

  • 6 篇 期刊文献
  • 3 篇 会议
  • 1 册 图书

馆藏范围

  • 10 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 8 篇 工学
    • 8 篇 计算机科学与技术...
    • 4 篇 电气工程
    • 3 篇 软件工程
    • 2 篇 信息与通信工程
    • 1 篇 机械工程
  • 1 篇 法学
    • 1 篇 法学

主题

  • 10 篇 html documents
  • 3 篇 dom trees
  • 2 篇 natural language...
  • 2 篇 information retr...
  • 1 篇 reliability
  • 1 篇 noise reduction
  • 1 篇 world wide web
  • 1 篇 text summarizati...
  • 1 篇 ontology learnin...
  • 1 篇 systems laborato...
  • 1 篇 chapter
  • 1 篇 structured html
  • 1 篇 file
  • 1 篇 textual content
  • 1 篇 noise eliminatio...
  • 1 篇 html file
  • 1 篇 content extracti...
  • 1 篇 speech rendering
  • 1 篇 documents
  • 1 篇 web tables

机构

  • 1 篇 univ seville ets...
  • 1 篇 bahauddin zakari...
  • 1 篇 yanshan univ sch...
  • 1 篇 ahlia univ dept ...
  • 1 篇 women univ inst ...
  • 1 篇 columbia univ de...
  • 1 篇 univ surrey sch ...
  • 1 篇 allama iqbal ope...
  • 1 篇 univ malaya fac ...
  • 1 篇 columbia univ de...
  • 1 篇 yanshan univ key...
  • 1 篇 yanshan univ key...
  • 1 篇 columbia univ de...
  • 1 篇 hewlett packard ...
  • 1 篇 inst southern pu...
  • 1 篇 columbia univ de...
  • 1 篇 german res ctr a...
  • 1 篇 shijiazhuang ins...
  • 1 篇 e-cim center cor...
  • 1 篇 german univ cair...

作者

  • 1 篇 roldan juan c.
  • 1 篇 li huanhuan
  • 1 篇 bakry menna
  • 1 篇 zhang hekai
  • 1 篇 chiang mf
  • 1 篇 maus heiko
  • 1 篇 liu sam
  • 1 篇 corchuelo rafael
  • 1 篇 jabeen taiba
  • 1 篇 honkasalo pessi
  • 1 篇 al-dallal ammar
  • 1 篇 dengel andreas
  • 1 篇 joshi parag mule...
  • 1 篇 raza binish
  • 1 篇 gupta s
  • 1 篇 gong jibing
  • 1 篇 starren j
  • 1 篇 lim jg
  • 1 篇 kaiser ge
  • 1 篇 jimenez patricia

语言

  • 9 篇 英文
  • 1 篇 其他
检索条件"主题词=HTML documents"
10 条 记 录,以下是1-10 订阅
排序:
Using Combined List Hierarchy and Headings of html documents for Learning Domain-Specific Ontology
收藏 引用
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS 2020年 第4期11卷 233-239页
作者: Raza, Muhammad Ahsan Raza, Binish Jabeen, Taiba Raza, Sehrish Abbas, Munnawar Bahauddin Zakariya Univ Dept Informat Technol Multan Pakistan Univ Malaya Fac Comp Sci & Informat Technol Kuala Lumpur Malaysia Allama Iqbal Open Univ Fac Educ Multan Pakistan Women Univ Inst Comp Sci & Informat Technol Multan Pakistan Inst Southern Punjab Dept Comp Sci Multan Pakistan
html pages contain unstructured and diverse information. However, these documents lack semantics and are not machine understandable. Semantic webs aim to add formal semantics to web data, whereas ontology provides for... 详细信息
来源: 评论
On extracting data from tables that are encoded using html
收藏 引用
KNOWLEDGE-BASED SYSTEMS 2020年 190卷 105157-105157页
作者: Roldan, Juan C. Jimenez, Patricia Corchuelo, Rafael Univ Seville ETSI Informat Avda Reina Mercedes S-N E-41012 Seville Spain
Tables are a common means to display data in human-friendly formats. Many authors have worked on proposals to extract those data back since this has many interesting applications. In this article, we summarise and com... 详细信息
来源: 评论
VB-PTC: Visual Block Multi-Record Text Extraction Based on Sensor Network Page Type Conversion
收藏 引用
IEEE ACCESS 2020年 8卷 167900-167913页
作者: Gong, Jibing Zhang, Hekai Du, Weixia Li, Huanhuan Wen, Hongnian Yanshan Univ Sch Informat Sci & Engn Qinhuangdao 066004 Hebei Peoples R China Yanshan Univ Key Lab Comp Virtual Technol & Syst Integrat Hebe Qinhuangdao 066004 Hebei Peoples R China Yanshan Univ Key Lab Software Engn Hebei Prov Qinhuangdao 066004 Hebei Peoples R China Shijiazhuang Inst Railway Technol Sch Informat Sci & Engn Shijiazhaung 050041 Peoples R China
Usually, in addition to the main content, web pages contain additional information in the form of noise, such as navigation elements, sidebars and advertisements. This kind of noise has nothing to do with the main con... 详细信息
来源: 评论
Dec :: Tech Reports :: Nsl-Tn-12
收藏 引用
2016年
[Auto Generated] 1. Scribe vs. html 1 2. Making Scribe produce html 1 3. Making A Structured html Document 2 3.1. Coexistence With Other Device Types 3 3.2. Convert @Section to @MakeSection 3 3.3. Convert @Chapter to ... 详细信息
来源: 评论
Supporting Early Contextualization of Textual Content in Digital documents on the Web  13
Supporting Early Contextualization of Textual Content in Dig...
收藏 引用
13th IAPR International Conference on Document Analysis and Recognition (ICDAR)
作者: Eldesouky, Bahaa Bakry, Menna Maus, Heiko Dengel, Andreas German Res Ctr Artificial Intelligence DFKI Knowledge Management Dept Kaiserslautern Germany German Univ Cairo New Cairo Egypt
The World Wide Web is arguably the most important source of digital documents nowadays. These documents mainly consist of unstructured and semi-structured data comprising a wealth of information at the disposal of the... 详细信息
来源: 评论
The Effect of Hybrid Crossover Technique on Enhancing Recall and Precision in Information Retrieval
The Effect of Hybrid Crossover Technique on Enhancing Recall...
收藏 引用
World Congress on Engineering (WCE 2013)
作者: Al-Dallal, Ammar Ahlia Univ Dept Comp Engn Manama Bahrain
Several techniques are proposed to retrieve the most relevant html documents to user query. Among these techniques is the genetic algorithm which iteratively creates several generations using selection, crossover and ... 详细信息
来源: 评论
Links and copyright law
收藏 引用
COMPUTER LAW & SECURITY REVIEW 2011年 第3期27卷 258-266页
作者: Honkasalo, Pessi Univ Surrey Sch Law Guildford GU2 7XH Surrey England
For at least 15 years, there have been question marks over the legal permissibility of connecting one web resource to another by means of links. The purpose of this paper is to assess where we stand in terms of the le... 详细信息
来源: 评论
Web Document Text and Images Extraction using DOM Analysis and Natural Language Processing  09
Web Document Text and Images Extraction using DOM Analysis a...
收藏 引用
9th ACM Symposium on Document Engineering
作者: Joshi, Parag Mulendra Liu, Sam Hewlett Packard Labs Palo Alto CA 94304 USA
Web has emerged as the most important source of information in the world. This has resulted in need for automated software components to analyze web pages and harvest useful information from them. However, in typical ... 详细信息
来源: 评论
Automating content extraction of html documents
收藏 引用
WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS 2005年 第2期8卷 179-224页
作者: Gupta, S Kaiser, GE Grimm, P Chiang, MF Starren, J Columbia Univ Dept Comp Sci New York NY 10027 USA Columbia Univ Dept Elect Engn New York NY 10027 USA Columbia Univ Dept Ophthalmol New York NY 10032 USA Columbia Univ Dept Biomed Informat New York NY 10032 USA
Web pages often contain clutter (such as unnecessary images and extraneous links) around the body of an article that distracts a user from actual content. Extraction of "useful and relevant" content from web... 详细信息
来源: 评论
USING COOLLISTS TO INDEX html documents IN THE WEB
COMPUTER NETWORKS AND ISDN SYSTEMS
收藏 引用
COMPUTER NETWORKS AND ISDN SYSTEMS 1995年 第1-2期28卷 147-154页
作者: LIM, JG E-CIM Center Corparate Technical Operations Samsung Electronics Suwon Korea
This paper suggests a partial solution (limited to html documents) to the Web-indexing problem using Coo[lists. Roughly, a Coollist is equivalent to a Hotlist in Mosaic except that it automatically records all the vis... 详细信息
来源: 评论