Traditional web database cache techniques have a major disadvantage, namely poor data freshness, because they employ an asynchronous data refresh strategy. A novel web database cache, DB Façade, is proposed in th...
详细信息
In this paper, a rapid resynchronization method using intent logs is suggested for replication In-memory databases supporting mobile communication applications. Both the identifiers of unsynchronized segments and the ...
详细信息
Top-k queries in uncertain databases are quite popular and useful due to its wide application usage. However, compared to Top-k in traditional databases, queries over uncertain database are more complicated because of...
详细信息
Social bookmarking systems and their emergent information structures, known as folksonomies, are increasingly important data sources for Semantic Web applications. A key question for harvesting semantics from these sy...
详细信息
ISBN:
(纸本)9781605584874
Social bookmarking systems and their emergent information structures, known as folksonomies, are increasingly important data sources for Semantic Web applications. A key question for harvesting semantics from these systems is how to extend and adapt traditional notions of similarity to folksonomies, and which measures are best suited for applications such as navigation support, semantic search, and ontology learning. Here we build an evaluation framework to compare various general folksonomy-based similarity measures derived from established information-theoretic, statistical, and practical measures. Our framework deals generally and symmetrically with users, tags, and resources. For evaluation purposes we focuson similarity among tags and resources, considering different ways to aggregate annotations across users. After comparing how tag similarity measures predict user-created tag relations, we provide an external grounding by user-validated semantic proxies based on WordNet and the Open Directory. We also investigate the issue of scalability. We find that mutual information with distributional micro-aggregation across users yields the highest accuracy, but is not scalable;per-user projection with collaborative aggregation provides the best scalable approach via incremental computations. The results are consistent across resource and tag similarity. Copyright is held by the International World Wide Web Conference Committee (IW3C2).
Electronic records management (ERM) is fundamental to good governance, business efficiency and effectiveness of e-government in networking and digital society. However, government authorities in China has faced many c...
详细信息
Electronic records management (ERM) is fundamental to good governance, business efficiency and effectiveness of e-government in networking and digital society. However, government authorities in China has faced many challenges in managing electronic records. A study of ERM in e-government strategy in other countries would be significant to support development of ERM in Chinese e-government strategy. As little is known from literature on the status of electronic records management in e-government strategy, this paper selects U.S, New Zealand and UK for case study by means of collecting and analyzing publications, laws, regulations and policies relevant to both electronic records management and e-government strategy. The author first analyzes status of ERM in e-government strategy and then discusses models and driving forces of their existences. The result of the study revealed that electronic records are managed as national assets, resources and trusted information in the e-government strategy. Driven by core business of e-government in different country, there are different approaches to ERM in e-government strategy. The implications of the study is that, to position ERM in e-government strategy, it is significant to provide a strategic approach to ERM for shared agenda by both government authorities and records management professionals. The findings intended to be of use to both the effective e-government strategy and good ERM for dynamic, interactive and proactive citizen-centered services across domains and time dimensions.
Ajax is an important approach for improving rich interactivity between Web server and end users during Web 2.0 eras. At the same time, the structured data in AJAX Web pages can not be extracted easily due to its async...
详细信息
Ajax is an important approach for improving rich interactivity between Web server and end users during Web 2.0 eras. At the same time, the structured data in AJAX Web pages can not be extracted easily due to its asynchronous loading. In this paper, we propose a technique for extracting the structured data from the AJAX based Web pages. Firstly, an AjaxFetcher component is created to fetch the dynamic page content by using an embedded browser. Secondly, two different strategies are used to extract the structured data from the obtained page contents. Especially for the page that contains multi-records, an automatic approach to determine each possible record is proposed. Experimental results show that fetching Ajax pages and extracting the structured data from them is feasible.
Facing limitations of existing systems for process integrity measurement, we put forward a method with its prototype system PDIMS to measure process runtime integrity. Based on structure of process and format of execu...
详细信息
Facing limitations of existing systems for process integrity measurement, we put forward a method with its prototype system PDIMS to measure process runtime integrity. Based on structure of process and format of executable file, PDIMS anatomizes the codepage layout of runtime process. Combining OS mechanisms and modern CPU's support for code execution, PDIMS catches and measures code page in the kernel when it executes. PDIMS depends on CPU's non-executable bit to detect code execution and on the binary format of executables of the process as criterion to verify code modifications in kernel. PDIMS provides trustworthy information about whether a running process is modified. PDIMS introduces less than 4% overhead to OS.
Nowadays mainstream RDF Repository Systems are based on RDBMS. The SPARQL query engine translates a SPARQL query into a SQL one, and then the RDBMS executes the SQL query. However the RDBMS optimizers, which usually a...
详细信息
Nowadays mainstream RDF Repository Systems are based on RDBMS. The SPARQL query engine translates a SPARQL query into a SQL one, and then the RDBMS executes the SQL query. However the RDBMS optimizers, which usually assume that columns are statistically independent, often underestimate the selectivity of conjunctive predicates and choose a bad query execution plan. It is important for query optimizers to detect correlations among properties. We propose a way of computing property correlations based on ontology itself in order to improve the execution performance of the SQL translated from SPARQL statement queries.
One of the major limitations of current search engines is that users could not quickly locate what they want if the input query is too general. Some existing techniques try to cluster web search results into groups so...
详细信息
One of the major limitations of current search engines is that users could not quickly locate what they want if the input query is too general. Some existing techniques try to cluster web search results into groups so as to user's quick browsing. In this paper, we present a new approach to categorize the web search results by using YAGO ontology. It utilizes the YAGO ontology to automatically generate categories for the user's specific query and classify the search results into appropriate categories. Our experimental results indicate that our method is feasible and effectiveness.
暂无评论