Network forensics is an investigation technique looking at the network traffic generated by a system. PyFlag is a general purpose, open source, forensic package which merges disk forensics, memory forensics and networ...
详细信息
Network forensics is an investigation technique looking at the network traffic generated by a system. PyFlag is a general purpose, open source, forensic package which merges disk forensics, memory forensics and network forensics. This paper describes the PyFlag architecture and in particular how that is used in the network forensics context. The novel processing of html pages is described and the PyFlag page rendering is demonstrated. PyFlag's novel processing of complex web applications such as Gmail and other web applications is described. Finally PyFlag's report generation capabilities are demonstrated. (c) 2008 Digital Forensic Research Workshop. Published by Elsevier Ltd. All rights reserved.
The paper outlines a framework for automated categorisation of web pages to protect against inappropriate content. The paper contains the framework overview, analysis of state-of-the-art, description of the developed ...
详细信息
The paper outlines a framework for automated categorisation of web pages to protect against inappropriate content. The paper contains the framework overview, analysis of state-of-the-art, description of the developed prototype and its evaluation based on series of experiments. Several sources are used for the categorisation, namely text, html tags and URL addresses. During the categorisation, this data and other information are analysed using machine learning and data mining methods. Finally, the evaluation of the categorisation quality is performed. The categorisation system developed as a result of this work are planned to be partially implemented in F-Secure Corporation in mass production systems performing analysis of web content.
Network forensics is an investigation technique looking at the network traffic generated by a system. PyFlag is a general purpose, open source, forensic package which merges disk forensics, memory forensics and networ...
详细信息
Network forensics is an investigation technique looking at the network traffic generated by a system. PyFlag is a general purpose, open source, forensic package which merges disk forensics, memory forensics and network forensics. This paper describes the PyFlag architecture and in particular how that is used in the network forensics context. The novel processing of html pages is described and the PyFlag page rendering is demonstrated. PyFlag's novel processing of complex web applications such as Gmail and other web applications is described. Finally PyFlag's report generation capabilities are demonstrated. (c) 2008 Digital Forensic Research Workshop. Published by Elsevier Ltd. All rights reserved.
In this paper we present CUTER, a system that processes html pages in order to extract the useful text from them. The mechanism is focalized on html pages that include news articles from major portals and blogs. As us...
详细信息
ISBN:
(纸本)9781424439997
In this paper we present CUTER, a system that processes html pages in order to extract the useful text from them. The mechanism is focalized on html pages that include news articles from major portals and blogs. As useful text we define the body of the article that contains the news report. In order to extract the body of the article we deconstruct the html page to its DOM model and we apply a set of algorithms in order to clean and correct the html code, locate and characterize each node of the DOM model and finally store the text from the nodes that are characterized as useful text nodes. CUTER is a subsystem of peRSSonal, a web tool that is used to obtain news articles from all over the world, process them and present them back to the end users in a personalized manner. The role of CUTER is to feed peRSSonal with the body of the. In this paper we present the basic algorithms and experimental results on the efficiency of the CUTER text extractor.
Currently, Most back-end web databases cannot be indexed by traditional hyperlink-based search engines due to their requirement of users' interactive queries via page form submission. In order to make hidden-Web i...
详细信息
ISBN:
(纸本)9783319220475;9783319220468
Currently, Most back-end web databases cannot be indexed by traditional hyperlink-based search engines due to their requirement of users' interactive queries via page form submission. In order to make hidden-Web information more easily accessible, this paper proposes a hierarchical classifier to locate domain-specific hidden Web entries at a large scale. The classifier is trained by appropriately selected page form features to get rid of non-relevant domains and non-searchable forms. Experiments conducted on eight different topics demonstrate that the technique can discover deep web interfaces accurately and efficiently.
This paper presents an html analyzer for the study of web usability. The analyzer parses html code in order to extract usability information from web pages. For usability aspects that can be analyzed automatically, th...
详细信息
ISBN:
(纸本)9781424427932
This paper presents an html analyzer for the study of web usability. The analyzer parses html code in order to extract usability information from web pages. For usability aspects that can be analyzed automatically, the analyzer draws conclusions and offers suggestions. For more subjective usability issues, it assists the expert by presenting relevant information in a convenient way. Many similar applications exist, but they mostly focus on well-known usability problems and pay little attention to subtler aspects. More alarmingly, they suffer from basic usability problems. Our results show that our analyzer examines several usability aspects related to ease of navigation, understandability, flexibility, and compatibility that are generally ignored by the other tools.
暂无评论