检索结果-内蒙古大学图书馆

Automatic identification and classification of Palomar Transient Factory astrophysical objects in GLADE

INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING 2018年第4期16卷 337-349页

作者： Zhao, Weijie Rusu, Florin Wu, Kesheng Nugent, Peter Univ Calif Merced 5200 N Lake Rd Merced CA 95343 USA Lawrence Berkeley Natl Lab 1 Cyclotron Rd Berkeley CA 94720 USA

Palomar Transient Factory (PTF) is a comprehensive detection system for the identification and classification of transient astrophysical objects. In this paper, we make two significant contributions to the PTF pipeline. First, we present an experimental study that evaluates a novel implementation of the real-time classifier in GLADE -a parallel data processing system that combines the efficiency of a database with the extensibility of map-reduce. We show how each stage in the classifier maps optimally into GLADE tasks by taking advantage of the unique features of the system - range-based data partitioning, columnar storage, multi-query execution, and in-database support for complex aggregate computation. Second, we introduce a novel parallel similarity join algorithm for advanced transient classification. We implement this algorithm in GLADE and execute it on a massive supercomputer with more than 3,000 threads, achieving more than three orders of magnitude improvement over the PostgreSQL solution.

关键词： parallel databases multi-query processing scientific data analysis similarity join astronomical surveys transient identification

来源：评论

学校读者我要写书评

暂无评论

Simultaneous processing of multi-Skyline Queries with MapReduce

引用

IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS 2017年第7期E100D卷 1516-1520页

作者： Kim, Junsu Lee, Kyong-Ha Kim, Myoung-Ho Korea Adv Inst Sci & Technol Daejeon South Korea KISTI Daejeon South Korea

With rapid increase of the number of applications as well as the sizes of data, multi-query processing on the MapReduce framework has gained much attention. Meanwhile, there have been much interest in skyline query processing due to its power of multi-criteria decision making and analysis. Recently, there have been attempts to optimize multi-query processing in MapReduce. However, they are not appropriate to process multiple skyline queries efficiently and they also require modifications of the Hadoop internals. In this paper, we propose an efficient method for processing multi-skyline queries with MapReduce without any modification of the Hadoop internals. Through various experiments, we show that our approach outperforms previous studies by orders of magnitude.

关键词： multi-query processing skyline query MapReduce framework

来源：评论

学校读者我要写书评

暂无评论

LCA-based algorithms for efficiently processing multiple keyword queries over XML streams

引用

DATA & KNOWLEDGE ENGINEERING 2016年 103卷 1-18页

作者： Barros, Evandrino G. Laender, Alberto H. F. Moro, Mirella M. da Silva, Altigran S. Ctr Fed Educ Tecnol Minas Gerais Belo Horizonte MG Brazil Univ Fed Minas Gerais Belo Horizonte MG Brazil Univ Fed Amazonas Manaus Amazonas Brazil

In a stream environment, differently from traditional databases, data arrive continuously, unindexed and potentially unbounded, whereas queries must be evaluated for producing results on the fly. In this article, we propose two new algorithms (called SLCAStream and ELCAStream) for processing multiple keyword queries over XML streams. Both algorithms process keyword-based queries that require minimal or no schema knowledge to be formulated, follow the lowest common ancestor (LCA) semantics, and provide optimized methods to improve the overall performance. Moreover, SLCAStream, which implements the smallest LCA (SLCA) semantics, outperforms the state-of-the-art, with up to 49% reduction in response time and 36% in memory usage. In turn, ELCAStream is the first to explore the exclusive LCA (ELCA) semantics over XML streams. A comprehensive set of experiments evaluates several aspects related to performance and scalability of both algorithms, which shows they are effective alternatives to search services over XML streams. (C) 2016 Elsevier B.V. All rights reserved.

关键词： multi-query processing Keyword-based queries XML streams LCA semantics

来源：评论

学校读者我要写书评

暂无评论

MKStream: An Efficient Algorithm for processing multiple Keyword Queries over XML Streams 33

引用

33rd International Conference on Conceptual Modeling (ER)

ISBN: (纸本)9783319122069;9783319122052

In this paper, we tackle the problem of processing various keyword-based queries over XML streams in a scalable way, improving recent multi-query processing approaches. We propose a customized algorithm, called MKStream, that relies on parsing stacks designed for simultaneously matching several queries. Particularly, it explores the possibility of adjusting the number of parsing stacks for a better trade-off between processing time and memory usage. A comprehensive set of experiments evaluates its performance and scalability against the state-of-the-art, and shows that MKStream is the most efficient algorithm for keyword search services over XML streams.

关键词： multi-query processing keyword-based queries XML streams

来源：评论

学校读者我要写书评

暂无评论

优化分布式环境中的多个XML查询

优化分布式环境中的多个XML查询

引用

第二十二届中国数据库学术会议

作者：和菊珍彭敦陆王晓玲周傲英复旦大学计算机科学与工程系上海理工大学计算机工程学院

1引言作为信息交换和发布的标准数据格式,近年来XML已经得到了广泛应用。如RSS(RDF SiteSummary)技术,它是一种以XML为标准进行站点之间共享内容的简易方式,涉及的应用包括blog最新内容收集和新闻信息集成,将来还可能涉及到新闻搜索、... 详细信息

1引言作为信息交换和发布的标准数据格式,近年来XML已经得到了广泛应用。如RSS(RDF SiteSummary)技术,它是一种以XML为标准进行站点之间共享内容的简易方式,涉及的应用包括blog最新内容收集和新闻信息集成,将来还可能涉及到新闻搜索、求职信息注册等。对于一个持有大规模RSS文档的数据源,随着用户查询数量的不断增

关键词： multi-query processing XML compression Load balance

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：