文献详情 >Two-dimensional indexing to pr... 收藏

Two-dimensional indexing to provide one-integrated-memory view of distributed memory for a massively-parallel search engine (vol 22, pg 2437, 2018)

二维的索引将为一个大规模平行搜索引擎提供分布式的存储器的 one-integrated-memory 看法

作者：Yun, Tae-Seob Whang, Kyu-Young Kwon, Hyuk-Yoon Kim, Jun-Sung Song, Il-Yeol

作者机构：Korea Adv Inst Sci & Technol Dept Comp Sci Daejeon South Korea Seoul Natl Univ Sci & Technol Dept Global Fus Ind Engn Seoul South Korea Drexel Univ Coll Informat Sci & Technol Philadelphia PA 19104 USA

出版物：《WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS》 (万维网)

年卷期：2019年第22卷第6期

页面：2469-2470页

核心收录：

学科分类：08[工学] 0835[工学-软件工程] 0812[工学-计算机科学与技术（可授工学、理学学位）]

基　　金：This work was supported by the National Research Foundation of Korea(NRF) grant funded by Korean Government(MSIT) (No. 2016R1A2B4015929)

主　　题：Massively-parallel search engine DB-IR integration Pre-join Multiple-keyword search queries Distributed memory

摘要：We propose two-dimensional indexing—a novel in-memory indexing architecture that operates over distributed memory of a massively-parallel search engine. The goal of two-dimensional indexing is to provide a one-integrated-memory view as in a single node system using one large integrated memory. In two-dimensional indexing, we partition the entire index into n× m fragments and distribute them over the memories of multiple nodes in such a way that each fragment is entirely stored in main memory of one node. The proposed architecture is not only scalable as it uses a scaled-out shared-nothing architecture but also is capable of achieving low query response time as it processes queries in main memory. We also propose the concept of the one-memory point, which is the amount of the memory space required to completely store the entire index in main memory providing a one-integrated-memory view. We first prove the effectiveness of two-dimensional indexing with single-keyword queries, and then, extend the notion so as to be able to handle multiple-keyword queries. To handle multiple-keyword queries, we adopt pre-join that materializes a multiple-keyword query a priori as well as a new notion of semi-memory join that obviates extensive communication overhead to perform join across multiple nodes. In experiments using the real-life search query set over a database consisting of 100 million Web documents crawled, we show that two-dimensional indexing can effectively provide a one-integrated-memory view without too much of additional memory compared with the single node system using one large integrated memory. We also show that, with a six-node prototype, in an ideal case, it significantly improves the query processing performance over a disk-based search engine with an equivalent amount of in-memory buffer but without two-dimensional indexing — by up to 535.54 times. This improvement is expected to get larger as the system is scaled-out with a larger number of machines.

本地馆藏 | 借阅须知 | 我要预约

已订购，未入库

sda

目录详情 | 试阅读 |

读者评论与其他读者分享你的观点

学校读者

用户名:未登录

我的评分

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

看过本文的还看了

相关文献

该作者的其他文献

CADAL相关文献

Two-dimensional indexing to provide one-integrated-memory view of distributed memory for a massively-parallel search engine (vol 22, pg 2437, 2018)

读者评论与其他读者分享你的观点

请选择收藏分类：

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

看过本文的还看了

相关文献

该作者的其他文献

CADAL相关文献

Two-dimensional indexing to provide one-integrated-memory view of distributed memory for a massively-parallel search engine (vol 22, pg 2437, 2018)

读者评论 与其他读者分享你的观点

请选择收藏分类： 新增自定义分类 确定 取消

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

读者评论与其他读者分享你的观点

请选择收藏分类：