检索结果-内蒙古大学图书馆

ACM SIGMOD International Conference on Management of Data

作者： Kalinin, Alexander Cetintemel, Ugur Zdonik, Stan Brown Univ Providence RI 02912 USA

ISBN: (纸本)9781450335317

Searchlight [9] enables search and exploration of large, multidimensional data sets interactively. It allows users to explore by specifying rich constraints for the "objects" they are interested in identifying. Constraints can express a variety of properties, including a shape of the object (e.g., a waveform interval of length 10-100ms), its aggregate properties (e.g., the average amplitude of the signal over the interval is greater than 10), and similarity to another object (e.g., the distance between the interval's waveform and the query waveform is less than 5). Searchlight allows users to specify an arbitrary number of such constraints, with mixing different types of constraints in the same query. Searchlight enhances the query execution engine of an array DBMS (currently SciDB) with the ability to perform sophisticated search using the power of Constraint Programming (CP). This allows an existing CP solver from Or Tools (an open-source suite of operations research tools from Google) to directly access data inside the DBMS without the need to extract and transform it. This demo will illustrate the rich search and exploration capabilities of Searchlight, and its innovative technical features, by using the real-world MIMIC II data set, which contains waveform data for multi-parameter recordings of ICU patients, such as ABP (Arterial Blood Pressure) and ECG (electrocardiogram). Users will be able to search for interesting waveform intervals by specifying aggregate properties of the corresponding signals. In addition, they will be able to search for intervals similar to already found, where similarity is defined as a distance between the signal sequences.

关键词： Data exploration Query processing Constraint Programming array databases Interactive Results

来源：评论

学校读者我要写书评

暂无评论

Formal representation of the SS-DB benchmark and experimental evaluation in EXTASCID

引用

DISTRIBUTED AND PARALLEL databases 2015年第3期33卷 277-317页

作者： Cheng, Yu Rusu, Florin Univ Calif Merced CA 95343 USA

Evaluating the performance of scientific data processing systems is a difficult task considering the plethora of application-specific solutions available in this landscape and the lack of a generally-accepted benchmark. The dual structure of scientific data coupled with the complex nature of processing complicate the evaluation procedure further. SS-DB is the first attempt to define a general benchmark for complex scientific processing over raw and derived data. It fails to draw sufficient attention though because of the ambiguous plain language specification and the extraordinary SciDB results. In this paper, we remedy the shortcomings of the original SS-DB specification by providing a formal representation in terms of arrayQL algebra operators and arrayQL/SciQL constructs. These are the first formal representations of the SS-DB benchmark. Starting from the formal representation, we give a reference implementation and present benchmark results in EXTASCID, a novel system for scientific data processing. EXTASCID is complete in providing native support both for array and relational data and extensible in executing any user code inside the system by the means of a configurable metaoperator. These features result in significant improvement over SciDB at data loading, extracting derived data, and operations over derived data.

关键词： array databases array algebra array query languages Benchmark evaluation Scientific data processing

来源：评论

学校读者我要写书评

暂无评论

xWCPS: Bridging the Gap Between array and Semi-structured Data 19

xWCPS: Bridging the Gap Between Array and Semi-structured Da...

引用

19th International Conference on Knowledge Engineering and Knowledge Management (EKAW)

作者： Liakos, Panagiotis Koltsida, Panagiota Kakaletris, George Baumann, Peter Athena Res & Innovat Ctr Maroussi 15125 Greece Jacobs Univ Bremen D-28759 Bremen Germany

ISBN: (纸本)9783319179667;9783319179650

The ever growing amount of information collected by scientific instruments and the presence of descriptive metadata accompanying them calls for a unified way of querying over array and semi-structured data. We present xWCPS, a novel query language that bridges the path between these two different worlds, enhancing the expressiveness and user-friendliness of previous approaches.

关键词： array databases Query language Scientific data

来源：评论

学校读者我要写书评

暂无评论

A Distributed Infrastructure for Earth-Science Big Data Retrieval

引用

INTERNATIONAL JOURNAL OF COOPERATIVE INFORMATION SYSTEMS 2015年第2期24卷 1550002-1550002页

作者： Liakos, Panagiotis Koltsida, Panagiota Kakaletris, George Baumann, Peter Ioannidis, Yannis Delis, Alex Athena Res & Innovat Ctr Maroussi 15125 Greece Jacobs Univ Bremen D-28759 Bremen Germany Univ Athens Athens 15784 Greece

Earth-Science data are composite, multi-dimensional and of significant size, and as such, continue to pose a number of ongoing problems regarding their management. With new and diverse information sources emerging as well as rates of generated data continuously increasing, a persistent challenge becomes more pressing: To make the information existing in multiple heterogeneous resources readily available. The widespread use of the XML data-exchange format has enabled the rapid accumulation of semi-structured metadata for Earth-Science data. In this paper, we exploit this popular use of XML and present the means for querying metadata emanating from multiple sources in a succinct and effective way. Thereby, we release the user from the very tedious and time consuming task of examining individual XML descriptions one by one. Our approach, termed Meta-array Data Search (MAD Search), brings together diverse data sources while enhancing the user-friendliness of the underlying information sources. We gather metadata using different standards and construct an amalgamated service with the help of tools that discover and harvest such metadata;this service facilitates the end-user by offering easy and timely access to all metadata. The main contribution of our work is a novel query language termed xWCPS, that builds on top of two widely-adopted standards: XQuery and the Web Coverage Processing Service (WCPS). xWCPS furnishes a rich set of features regarding the way scientific data can be queried with. Our proposed unified language allows for requesting metadata while also giving processing directives. Consequently, the xWCPS-enabled MAD Search helps in both retrieval and processing of large data sets hosted in an heterogeneous infrastructure. We demonstrate the effectiveness of our approach through diverse use-cases that provide insights into the syntactic power and overall expressiveness of xWCPS. We evaluate MAD Search in a distributed environment that comprises five high-vol

关键词： array databases declarative query language scientific data

来源：评论

学校读者我要写书评

暂无评论

SciDB: A Database Management System for Applications with Complex Analytics

引用

COMPUTING IN SCIENCE & ENGINEERING 2013年第3期15卷 54-62页

作者： Stonebraker, Michael Brown, Paul Zhang, Donghui Becla, Jacek MIT Comp Sci & Artificial Intelligence Lab Cambridge MA 02139 USA MIT Intel Sci & Technol Ctr Cambridge MA 02139 USA

A description and discussion of the SciDB database management system focuses on lessons learned, application areas, performance comparisons against other solutions, and additional approaches to managing data and compl... 详细信息

关键词： array databases complex analytics large-scale matrix operations parallel databases scientific computing scientific databases Scientific computing Scientific database database management systems parallel databases

来源：评论

学校读者我要写书评

暂无评论

Flexible and efficient IR using array databases

引用

VLDB JOURNAL 2008年第1期17卷 151-168页

作者： Cornacchia, Roberto Heman, Sandor Zukowski, Marcin de Vries, Arjen P. Boncz, Peter Ctr Wiskunde & Informat NL-1098 SJ Amsterdam Netherlands

The Matrix Framework is a recent proposal by Information Retrieval (IR) researchers to flexibly represent information retrieval models and concepts in a single multidimensional array framework. We provide computational support for exactly this framework with the array database system SRAM (Sparse Relational array Mapping), that works on top of a DBMS. Information retrieval models can be specified in its comprehension-based array query language, in a way that directly corresponds to the underlying mathematical formulas. SRAM efficiently stores sparse arrays in (compressed) relational tables and translates and optimizes array queries into relational queries. In this work, we describe a number of array query optimization rules. To demonstrate their effect on text retrieval, we apply them in the TREC TeraByte track (TREC-TB) efficiency task, using the Okapi BM25 model as our example. It turns out that these optimization rules enable SRAM to automatically translate the BM25 array queries into the relational equivalent of inverted list processing including compression, score materialization and quantization, such as employed by custom-built IR systems. The use of the high-performance MonetDB/X100 relational backend, that provides transparent database compression, allows the system to achieve very fast response times with good precision and low resource usage.

关键词： information retrieval array databases query optimization database compression

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：