检索结果-内蒙古大学图书馆

International Asia-Pacific Web Conference (APWEB)

作者： Yansong Zhang Shan Wang Wei Huang Key Laboratory of Ministry of Education of Data Engineering and Knowledge Engineering Beijing China School of Information Renmin University of China Beijing China

The requirements of OLAP applications increase rapidly by dramatically increased data volume, users, query volume and query complexity. The requirement for shortening update period in data warehouse is another crucial factor for a scalable OLAP application. In this paper, we propose a scalable OLAP prototype to support the query processing with increasing data volume by distributing the whole fact tuples to multiple servers to construct a set of sibling cubes which can be merged together to obtain the whole cube. We employ a light weight distribution policy with fully duplicated dimension tables in each sibling server on the observation of very low proportion of space cost for dimension tables. OLAP query with distributed aggregate functions can be transformed into queries to be performed parallel in sibling servers. For non-distributed computing aggregate functions, such as median, the optimized median aggregate computing algorithm is proposed to reduce transmission volume between servers while computing the global median values. We also present a three-level framework in data warehouse to meet the requirement of shorter update period in "operational business intelligence". An asynchronous tunnel model is proposed to reduce update latency by pre-fetching updated tuples to OLAP processing server. Finally, we set up prototype system ParaCube to evaluate performance in SN (shared-nothing) system and multi-core platforms.

关键词： Aggregates Distributed computing Concurrent computing data warehouses Merging Application software Prototypes Query processing Acceleration Material storage

来源：评论

学校读者我要写书评

暂无评论

Query-Aware Complex Object Buffer Management in XML Information Retrieval

Query-Aware Complex Object Buffer Management in XML Informat...

引用

International Asia-Pacific Web Conference (APWEB)

作者： Qiushi Li Qiuyue Wang Shan Wang Key Laboratory of Ministry of Education of Data Engineering and Knowledge Engineering Beijing China School of Information Renmin University of China Beijing China

In this paper, we analyse the data access characteristics of a typical XML information retrieval system and propose a new query aware buffer replacement algorithm based on prediction of Minimum Reuse Distance (MRD for short). The algorithm predicts an object's next reference distance according to the retrieval system's running status and replaces the objects that have maximum reuse distances. The factors considered in the replacement algorithm include the access frequency, creation cost, and size of objects, as well as the queries being executed. By taking into account the queries currently running or queuing in the system, MRD algorithm can predict more accurately the reuse distances of index data objects.

关键词： Information management XML Information retrieval Tree data structures Indexing Prediction algorithms Algorithm design and analysis Frequency data analysis Information analysis

来源：评论

学校读者我要写书评

暂无评论

VANETs-Based Real-time Traffic data Dissemination

VANETs-Based Real-time Traffic Data Dissemination

引用

2010 IEEE International Conference Conferenhce on Wireless Communications,Networking and Information Security(2010 IEEE 无线通信、网络技术与信息安全国际会议 WCNIS)

作者： Wenping Chen Department of Computer Science??Renmin University of China Beijing Key Laboratory of Data Engineering and Knowledge Engineering (Renmin University of China) MOE

ISBN: (纸本)9781424458509

Traffic congestion is a very serious problem in large cities. With the number of vehicles increasing rapidly, especially in cities whose economy is booming, the situation is getting even worse. In this paper, by leveraging the techniques of Vehicular Ad hoc Networks (VANETs) we present a real-time abnormal traffic data dissemination protocol. Specifically, all vehicles running on the same road segment are regarded as a cluster to generate traffic message about this segment. To reduce communication cost, only abnormal traffic data is issued and spread to nearby road segments. By employing event-driven and period combined mechanism, the abnormal traffic messages are disseminated to the vehicles that probably need the messages in time. We propose a distance dependent forwarder selection method to disseminate traffic message. When transmitted inside a cluster, messages are forwarded along the segment from one end to the other based on the least hops principle;while transmitted among clusters, messages are transmitted in epidemic routing mode, which ensure the fast and reliable dissemination. To evaluate the performance of our protocol, we use the real traffic data of Beijing at peak hour. The simulation results demonstrate that our protocol is feasible and efficient for metropolitan-size city.

关键词： vehicular networks traffic real-time

来源：评论

学校读者我要写书评

暂无评论

Constructing Corpus for Query-oriented XML Text Summarization

Constructing Corpus for Query-oriented XML Text Summarizatio...

引用

2010 International Conference on Management of e-Commerce and e-Government(第四届电子商务与电子政务管理国际会议 ICMeCG 2010)

作者： Shihan WU Dexi LIU Xianpei JIAO Jiangxi Key Laboratory of Data and Knowledge Engineering School of Information Technology Jiangxi University of Finance & Economics Nanchang Jiangxi China

XML Retrieval is becoming the focus study of the field of Information Retrieval and database. Summarization of the results which come from the XML search engines will alleviate the read burden of user's. However, as the basis of this study, the construction of the query-oriented XML text summarization corpus has not yet received enough attention. In this paper, we introduce our works on constructing this kind of corpus, including the selection of topics and XML elements/documents, construction process and the feature of the constructed corpus. Up to now, the corpus has 25 English query topics, including 422 elements for summarization, and 32 Chinese topics which including 402 elements. For each topic, 4 pieces of extracted summaries and 4 pieces of generated summaries are made manually by 4 experts.

关键词： Query-oriented XML Automatic summarization Corpus

来源：评论

学校读者我要写书评

暂无评论

Notice of Retraction: Studies of knowledge management methods from literature

Notice of Retraction: Studies of knowledge management method...

引用

IEEE International Conference on Advanced Management Science (ICAMS)

作者： Jing Zhang Liuqi Ye Business School Hohai University HHU Nanjing Jiangsu China Key Laboratory of Data Engineering and Knowledge Engineering Renmin University of China Beijing China

This article has been retracted by the publisher.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Notice of Retraction: knowledge management technologies and applications: A literature review

Notice of Retraction: Knowledge management technologies and ...

引用

IEEE International Conference on Advanced Management Science (ICAMS)

作者： Xiaomi An Wang Wang Key Laboratory of Data Engineering and Knowledge Engineering Renmin University of China Beijing China School of Information Resource Management Renmin University of China Beijing China

This article has been retracted by the publisher.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Detecting Spam Comments with Malicious Users’ Behavioral Characteristics

Detecting Spam Comments with Malicious Users’ Behavioral Ch...

引用

2010 IEEE International Conference on Information Theory and Information Security(2010 IEEE 国际信息论与信息安全会议)

作者： Wenchang Shi Bin Liang Zhaohui Liang Qianqian Wang Wei Sun Key Laboratory of Data Engineering and Knowledge Engineering (Renmin University of China) MOE Schoo Beijing Venustech Cybervision Co. Ltd Beijing 100094 China

In recent years, the spread of spam comments has become a main obstacle which limits the development of commercialized social networks. This paper analyzes the differences of behavioral characteristics between normal users and malicious users. Based on these characteristics, we propose several heuristic methods to detect spam comments. These methods evaluate comments from three perspectives, including time-frequency characteristic of comments, text similarity of comments and the number of target domains each user's comments refer to. In our collected dataset, our experimental results indicate the accuracy of our detection strategy I (strategy for high accuracy) and strategy II (strategy for wide coverage) are 100% and 92.6%, respectively. The preliminary evaluation of the proposed detection methods shows promising result.

关键词： Books Conferences Unsolicited electronic mail Accuracy Commercialization Time series analysis Filtering

来源：评论

学校读者我要写书评

暂无评论

Notice of Retraction: System Formulation of knowledge Sharing and Exchange in the Organization

Notice of Retraction: System Formulation of Knowledge Sharin...

引用

International Conference on Management and Service Science, MASS

作者： Jing Zhang Liuqi Ye Business School Hohai University HHU Nanjing Jiangsu China Key Laboratory of Data Engineering and Knowledge Engineering Ministry of Education Renmin University of China Beijing China

This article has been retracted by the publisher.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Global Top-k aggregate queries based on x-tuple in uncertain database

Global Top-k aggregate queries based on x-tuple in uncertain...

引用

24th IEEE International Conference on Advanced Information Networking and Applications Workshops, WAINA 2010

作者： Liu, Dexi Wan, Changxuan Xiong, Naixue Park, Jong Hyuk Yeoe, Sang-Soo School of Information Technology Jiangxi Key Laboratory of Data and Knowledge Engineering Jiangxi University of Finance and Economics Nanchang China Dept. of Computer Science Georgia State University Atlanta GA United States Dept. of Computer Science and Engineering Seoul National University of Technology Seoul Korea Republic of Division of Computer Engineering Mokwon University Daejeon Korea Republic of

ISBN: (纸本)9780769540191

A Top-k aggregate query, which is a powerful technique when dealing with large quantity of data, ranks groups of tuples by their aggregate values and returns k groups with the highest aggregate values. However, compared to Top-k in traditional databases, queries over uncertain database are more complicated because of the existence of exponential possible worlds. As a powerful semantic of Top-k in uncertain database, Global Top-k return k highest-ranked tuples according to their probabilities of being in the Top-k answers in possible worlds. We propose a x-tuple based method to process Global Top-k aggregate queries in uncertain database. Our method has two levels, group state generation and G-x-Top-k query processing. In the former level, group states, which satisfy the properties of x-tuple, are generated one after the other according to their aggregate values, while in the latter level, dynamic programming based Global x-tuple Top-k query processing are employed to return the answers. Comprehensive experiments on different data sets demonstrate the effectiveness of the proposed solutions. © 2010 IEEE.

关键词： Dynamic programming

来源：评论

学校读者我要写书评

暂无评论

Pseudo-relevance feedback driven for XML query expansion

引用

Journal of Convergence Information Technology 2010年第9期5卷 146-156页

作者： Minjuan, Zhong Changxuan, Wan School of Information Technology Jiangxi University of Finance and Economics Nanchang 330013 China Jiangxi Key Laboratory of Data and Knowledge Engineering Jiangxi University of Finance and Economics Nanchang 330013 China

Pseudo-relevance feedback has been perceived as an effective solution for automatic query expansion. However, a recent study has shown that traditional pseudo-relevance feedback may bring into topic drift and hence be harmful to the retrieval performance. It is often crucial to identify those good feedback documents from which useful expansion terms can be added to the query. Compared with traditional query expansion, XML query expansion needs not only content expansion but also considering structural expansion. This paper presents a solution for both identifying related documents and selecting good expansion information with new content and path constrains. Combined with XML semantic feature, a naïve document similarity measurement is proposed in this paper. Based on this, k-median clustering algorithm is firstly implemented and some related documents are found. Secondly, query expansion is only performed by two steps in the set of related documents, which key phrase extraction algorithm is carried out to expand original query in the first step and the second step is structural expansion based on the expanded key phrases. Finally a full-edged content-structure query expression which can represent user's intention is formalized. Experimental results on IEEE CS collection show that the proposed method can reduce the topic drift effectively and obtain the better retrieval quality.

关键词： Clustering algorithms

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：