版权所有:内蒙古大学图书馆 技术提供:维普资讯• 智图
内蒙古自治区呼和浩特市赛罕区大学西街235号 邮编: 010021
作者机构:Univ Nacl San Luis San Luis Argentina Univ Santiago Chile Yahoo! Res Latin Amer Santiago Chile
出 版 物:《JOURNAL OF DISCRETE ALGORITHMS》 (离散算法杂志)
年 卷 期:2009年第7卷第1期
页 面:3-17页
核心收录:
学科分类:07[理学] 0701[理学-数学] 070101[理学-基础数学]
基 金:FONDECYT, Chile HPC-EUROPA++ project CYTED [506PI0293] Conicet
主 题:Metric space databases Query processing and indexing Parallel and distributed computing
摘 要:Similarity search has been proved suitable for searching in large collections of unstructured data objects. A number of practical index data structures for this purpose have been proposed. All of them have been devised to process single queries sequentially. However, in large-scale systems such as Web Search Engines indexing multi-media content, it is critical to deal efficiently with streams of queries rather than with single queries. In this paper we show how to achieve efficient and scalable performance in this context. To this end we transform a sequential index based on clustering into a distributed one and devise algorithms and optimizations specially tailored to support high-performance parallel query processing. (C) 2008 Elsevier B. V. All rights reserved.