检索结果-内蒙古大学图书馆

Mining outliers with faster cutoff update and space utilization

PATTERN RECOGNITION LETTERS 2010年第11期31卷 1292-1301页

作者： Szeto, Chi-Cheong Hung, Edward Hong Kong Polytech Univ Dept Comp Hong Kong Hong Kong Peoples R China

It is desirable to find unusual data objects by Ramaswamy et al.'s distance-based outlier definition, because only a metric distance function between two objects is required. This definition does not need any neighborhood distance threshold required by many existing algorithms based on the definition of Knorr and Ng. Bay and Schwabacher proposed an efficient algorithm ORCA, which can give near linear time performance, for this task. To further reduce the running time, we propose in this paper two algorithms RC and RS using the following two techniques, respectively: (i) faster cutoff update, and (ii) space utilization after pruning. We tested RC, RS, and RCS (a hybrid approach combining both RC and RS) on several large and high-dimensional real data sets with millions of objects. The experiments show that the speed of RCS is as fast as 1.4-2.3 times that of ORCA, and the improvement of RCS is relatively insensitive to the increase in the data size. (C) 2010 Elsevier B.V. All rights reserved.

关键词： Outlier detection Distance-based outliers disk-based algorithms Memory optimization

来源：评论

学校读者我要写书评

暂无评论

Distributed disk-based algorithms for model checking very large Markov chains

引用

FORMAL METHODS IN SYSTEM DESIGN 2006年第2期29卷 177-196页

作者： Bell, Alexander Haverkort, Boudewijn R. Univ Twente Dept Elect Engn Math & Comp Sci NL-7500 AE Enschede Netherlands

In this paper we present data structures and distributed algorithms for CSL model checking-based performance and dependability evaluation. We show that all the necessary computations are composed of series or sums of matrix-vector products. We discuss sparse storage structures for the required matrices and present efficient sequential and distributed disk-based algorithms for performing these matrix-vector products. We illustrate the effectivity of our approach in a number of case studies in which continuous-time Markov chains (generated in a distributed way from stochastic Petri net specifications) with several hundreds of millions of states are solved on a workstation cluster with 26 dual-processor nodes. We show details about the memory consumption, the solution times, and the speedup. The distributed message-passing algorithms have been implemented in a tool called PARSECS, that also takes care of the distributed Markov chain generation and that can also be used for distributed CTL model checking of Petri nets.

关键词： Markov chains matrix-vector product disk-based algorithms state-space generation CSL model checking distributed algorithms

来源：评论

学校读者我要写书评

暂无评论

Optimization and evaluation of shortest path queries

引用

VLDB JOURNAL 2007年第3期16卷 343-369页

作者： Chan, Edward P. F. Lim, Heechul Univ Waterloo Sch Comp Sci Waterloo ON N2L 3G1 Canada

We investigate the problem of how to evaluate efficiently a collection of shortest path queries on massive graphs that are too big to fit in the main memory. To evaluate a shortest path query efficiently, we introduce two pruning algorithms. These algorithms differ on the extent of materialization of shortest path cost and on how the search space is pruned. By grouping shortest path queries properly, batch processing improves the performance of shortest path query evaluation. Extensive study is also done on fragment sizes, cache sizes and query types that we show that affect the performance of a disk-based shortest path algorithm. The performance and scalability of proposed techniques are evaluated with large road systems in the Eastern United States. To demonstrate that the proposed disk-based algorithms are viable, we show that their search times are significant better than that of main-memory Dijkstra's algorithm.

关键词： shortest path queries route queries query evaluation and optimization graph pruning disk-based algorithms graph algorithms

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：