In this demonstration, we show three interrelated tools intended to improve different aspects of the quality of data warehouse solutions. Firstly, the deductive object manager ConceptBase is intended to enrich the sem...
详细信息
ISBN:
(纸本)0818685751
In this demonstration, we show three interrelated tools intended to improve different aspects of the quality of data warehouse solutions. Firstly, the deductive object manager ConceptBase is intended to enrich the semantics of data warehouse solutions by including an explicit enterprise-centered concept of quality. the positive impact of precise multidimensional data models on the client interface is demonstrated by CoDecide, an Internet-based toolkit for the flexible visualization of multiple, interrelated data cubes. Finally, MIDAS is a hybrid data mining system which analyses multi-dimensional data to further enrich the semantics of the meta database, using a combination of neural network techniques, fuzzy logic, and machine learning.
In tertiary storage systems, the data is stored on multiple tape volumes where each tape is further divided into files. Since in many such systemsthe minimum unit of data transfer is a file, it is an important proble...
详细信息
In tertiary storage systems, the data is stored on multiple tape volumes where each tape is further divided into files. Since in many such systemsthe minimum unit of data transfer is a file, it is an important problem to match file sizes withthe access patterns to the data. In general, if the file size is large relative to the query size it will lead to the transfer of large amount of irrelevant data whereas small file sizes will incur an overhead penalty associated with reading each new file. In this work, we analyze the relationship between file sizes and query response times and provide a methodology to compute the optimal file size given information about the distribution of query sizes. Exact closed form solutions for the cost function are given for two common distributions.
In the RasDaMan project a database system for management of multdimensional arrays is being built. It offers a declarative query language extending SQL-92 with operations on arrays of arbitrary base types and a C++ pr...
详细信息
ISBN:
(纸本)0818685751
In the RasDaMan project a database system for management of multdimensional arrays is being built. It offers a declarative query language extending SQL-92 with operations on arrays of arbitrary base types and a C++ programming interface. Integrating arrays in the query language enables the system to process complex queries on high-volume multidimensional data in the database server close to physical data storage. Storage of arrays is done in tiles of arbitrary size. Operations on arrays are transformed into operations on riles during query optimization and execution. these operations are then executed on tiles loaded from mass storage. this paper describes the underlying formal model for tile based operations on multidimensional arrays and its efficient implementation in C++ as part of the RasDaMan system.
A methodology (JAGAth) and a tool based on it for establishing the performance of a distributed system within an empirical framework are presented. A set of performance variables representing the system performance is...
详细信息
ISBN:
(纸本)3540649492
A methodology (JAGAth) and a tool based on it for establishing the performance of a distributed system within an empirical framework are presented. A set of performance variables representing the system performance is derived in terms of a combination of a set of performance measures. these performance measures represent the actual measurements of the events in the system. these performance variables are used to display the system performance status in the form of Kiviat graphs. the causal relationship between the performance variables and the internal system control variables and the workload characteristics can be established. this can be used in a performance 'tuning' system.
We address the issue of designing effective query languages for OLAP databases. the basis of our investigation is MD), a new data model for multidimensional databases that unlike other multidimensional models, is inde...
详细信息
ISBN:
(纸本)0818685751
We address the issue of designing effective query languages for OLAP databases. the basis of our investigation is MD), a new data model for multidimensional databases that unlike other multidimensional models, is independent of any specific implementation and as such provides a clear separation between practical and conceptual aspects. In this framework, we present and compare two query languages, based on different paradigms, for OLAP databases. the first language is algebraic and provides an effective way to manipulate multidimensional data in a procedural fashion. Although this language is clean and powerful, it is clearly not suited for final users. We therefore propose a high-level graphical language that allows the user to specify analytical queries in a natural ann intuitive walt. It turns out that the two languages have the same expressive power.
For the past several years a small team of developers at the Jet Propulsion Laboratory and at the University of Rhode Island have been working on a system to provide Earth scientists with access to remotely held data ...
详细信息
For the past several years a small team of developers at the Jet Propulsion Laboratory and at the University of Rhode Island have been working on a system to provide Earth scientists with access to remotely held data sets stored in various standard formats as though the data were local to the user's application. this capability requires not only interoperability among multiple formats and remote access to data via the Internet, but also the ability to identify and select small subsets of data by space, time, and measured parameter. this demo presents the combination of an http-based client/server application that facilitates Internet access to Earth science data coupled with a Java applet GUI that allows the user to graphically select data based on spatial and temporal coverage plots and scientific parameters. Access to data by values of the measured parameters is feasible via the same indexing schemes used for space and time, and work to include this capability is in progress.
the National Comprehensive Cancer Network (NCCN) is an alliance of 16 of the world's leading cancer centers, formed to help ensure the highest quality cancer care. City of Hope National Medical Center serves as th...
详细信息
the National Comprehensive Cancer Network (NCCN) is an alliance of 16 of the world's leading cancer centers, formed to help ensure the highest quality cancer care. City of Hope National Medical Center serves as the data-coordinating center (DCC) for the first NCCN shared outcomes research project. A Web-based database system has been designed and implemented over the Internet. Cancer centers located all over the United States are submitting data to this system that accommodates varying levels of database expertise and capabilities. the application is browser independent and incorporates several unique design, confidentiality, and security features, such as and the use of `ghost buttons' and a `secret agent' approach to security. New SQL and Web-enabled features in the SAS statistical software package are integrated into this system as well. A demonstration of the NCCN database system will be presented.
We present results providing DB support to biomedicine via federation of SDB Cooperation/lntegration based upon the KEGG GUI to molecular biology. the federation provides a common link to three molecular biology datab...
详细信息
ISBN:
(纸本)0818685751
We present results providing DB support to biomedicine via federation of SDB Cooperation/lntegration based upon the KEGG GUI to molecular biology. the federation provides a common link to three molecular biology databases. the value-added of the federation is freedom from consulting multiple references to ascertain the fill set of enzymatic reactions in a metabolic pathway, and the option of selecting multiple queries to submit to the federated SDBs. Each of the SDBs is extensive, but incomplete. the union of the SDBs, implemented transparently by the federation, is more complete. Each SDB provides a different approach to the options available for data presentation and a different set of web server tools for data analysis. thus, an important part of the value-added of the federation is the cross-fertilization available in the union of the molecular biological content, the presentation of data, and the tools available for analysis.
the proceedings contain 35 papers. the special focus in this conference is on Software Performance Tools and Network Performance. the topics include: A modular and scalable simulation tool for large wireless networks;...
ISBN:
(纸本)3540649492
the proceedings contain 35 papers. the special focus in this conference is on Software Performance Tools and Network Performance. the topics include: A modular and scalable simulation tool for large wireless networks;designing process replication and threading policies;software reliability estimation and prediction tool;reusable software components for performability tools and their utilization for web-based configurable tools;a performance evaluation tool for communication networks with multicast data streams;response times in client-server systems;a queueing model with varying service rate for ABR;simulative performance evaluation of the temporary pseudonym method for protecting location information in GSM networks;a model driven monitoring approach to support the multi-view performance analysis of parallel responsive applications;instrumentation of synchronous reactive systems for performance analysis;a perturbation and reduction based algorithm;a comparison of numerical splitting-based methods for Markovian dependability and performability models;probability, parallelism and the state space exploration problem;an improved multiple variable inversion algorithm for reliability calculation;performance evaluation of web proxy cache replacement policies;performance analysis of a WDM bus network based on GSPN models;scheduling write backs for weakly-connected mobile clients;on choosing a task assignment policy for a distributed server system;structured characterization of the Markov chain of phase-type SPN;performance evaluation of distributed object architectures;an execution driven interconnection network simulator for DSM systems and integrated measurement and analysis tool for internet and its use in wireless in-house environment.
the proceedings contain 22 papers. the special focus in this conference is on Design I, Data Warehouses and Extensible DBMS. the topics include: A comprehensive view of process engineering;aligning legacy information ...
ISBN:
(纸本)354064556X
the proceedings contain 22 papers. the special focus in this conference is on Design I, Data Warehouses and Extensible DBMS. the topics include: A comprehensive view of process engineering;aligning legacy information systems to business processes;automated reverse engineering of legacy 4GL information system applications using the ITOC workbench;adapting function points to object oriented information systems;global cache management for multi-class workloads in data warehouses;architecture and quality in data warehouses;model extensibility of OODBMS for advanced application domains;an environment for designing exceptions in workflows;automating handover in dynamic workflow environments;document-centric groupware for distributed governmental agencies;specifying the reuse context of scenario method chunks;change analysis and management in a reuse-onented software development setting;a filter-mechanism for method-driven trace capture;subject-based organization of the information space in multi-database networks;an interactive networked multimedia applications specification environment with E-LOTOS translator;a user-oriented approach to querying the web;application in electricity deregulation;real-time information system for risk management on motorways;describing business processes with a guided use case approach;building quality into case-based reasoning systems;assembly techniques for method engineering and formalizing materialization using a metaclass approach.
暂无评论