scientific research relies as much on the dissemination and exchange of data sets as on the publication of conclusions. Accurately tracking the lineage (origin and subsequent processing history) of scientific data set...
详细信息
ISBN:
(纸本)0769516327
scientific research relies as much on the dissemination and exchange of data sets as on the publication of conclusions. Accurately tracking the lineage (origin and subsequent processing history) of scientific data sets is thus imperative for the complete documentation of scientific work. However, the lack of a definitive data model for lineage, and the poor fit between current data management tools and scientific software, effectively prevent researchers from determining, preserving, or providing the lineage of the data products they use and create. Based on a comprehensive review of lineage-related research and previous prototype systems, a conceptual framework is presented to help identify and assess basic lineage system components. Within this framework, a direction is outlined for future work on general methods for composing and managing lineage for scientific data.
There are many examples where cooperation among scientists takes place through exchanging scientific resources, such as data, programs and mathematical models. This is particularly true for environmental applications....
详细信息
ISBN:
(纸本)0769516327
There are many examples where cooperation among scientists takes place through exchanging scientific resources, such as data, programs and mathematical models. This is particularly true for environmental applications. Finding the right resource to apply in an environmental problem is a difficult task. Usually, this decision is based on previous experience. Scientists have to cooperate in order to solve such problems. To facilitate the exchange, reuse and dissemination of information we propose an architecture for managing distributed scientific resources. Our proposal combines a mediation-based heterogeneous distributed database system and an enhanced metadata support system for effective management of distributed scientific models and data.
The goal of this panel is to address the issues associated with establishing scientific and statisticaldatabases as an integral part of the educational curriculum within Academe. The purpose of this panel is to initi...
详细信息
ISBN:
(纸本)0818679530
The goal of this panel is to address the issues associated with establishing scientific and statisticaldatabases as an integral part of the educational curriculum within Academe. The purpose of this panel is to initiate a dialog among the SSDBM research community, government agencies and academe to discuss the challenges associated with specifying and implementing a scientificdatabase Education (SDBE) Curriculum. The panelists bring experience in facilitating and teaching such courses through traditional and distance education models.
databasemanagement systems (DBMS) have to provide certain facilities meeting the requirements of scientific and statisticaldatabasemanagement. Reviewing problems and promises for current database technology, the ST...
详细信息
ISBN:
(纸本)0818679530
databasemanagement systems (DBMS) have to provide certain facilities meeting the requirements of scientific and statisticaldatabasemanagement. Reviewing problems and promises for current database technology, the STEP standard is assessed for selected aspects of data management, data access, data exchange, and data modeling. STEP-based solutions are proposed for concrete examples of SS-DBM especially in the context of the scientific darn exchange standard FITS. We introduce and discuss EXPRESS, the modeling language of STEP, and SDAI, the corresponding data access interface. The performance of navigational access provided by SDAI is considered a crucial aspect. Exploiting the code generation mechanism used to instantiate SDAI for a given programming language - we catt it a generated call interface - an adequate software architecture ota top of an ODBMS as well as STEP-specific optimizations are proposed.
To extend the scope of multivariate data visualization, the notion of comparative visualization is introduced: it allows the comparison of visualization methods by interconnecting several different graphic displays. T...
详细信息
To extend the scope of multivariate data visualization, the notion of comparative visualization is introduced: it allows the comparison of visualization methods by interconnecting several different graphic displays. This linking of visualizations, together with the possibility to interactively manipulate data, enable an analyst to display the same data set with a number of conceptually different visualization methods simultaneously and to carry out graphical operations across them. Graphical effects in different displays not only reveal information about the data themselves, they also provide the basis to investigate how the different visualization methods relate to each other. With the 'VisuLab', we developed a software tool for personal computers to investigate comparative multivariate data visualization.
Described is a prototype database and analysis system developed to support the specific domain of forest canopy research. This effort utilized a multidiscipline team comprised of information (database systems), statis...
详细信息
Described is a prototype database and analysis system developed to support the specific domain of forest canopy research. This effort utilized a multidiscipline team comprised of information (database systems), statistical analysis and forest canopy scientists. Both large scale (Oracle) and smaller scale (Visual FoxPro) databases were prototyped. A WEB-based query interface to the Oracle system was also demonstrated. This paper addresses the FoxPro database and S-Plus statistical analysis interface developed to address the data analysis, data integration, and data distribution requirements of the originating forest canopy research team. The prototype system employs Visual FoxPro (VFP) as the database engine. Visualization and analytical functions are demonstrated by the use of VFP forms and custom designed S-Plus procedures. Finally, a WEB database server facility is also demonstrated. The authors conclude that value-added support centers could be created to develop and disseminate small-scale database and analysis systems to a specific scientific community.
The CenSSIS Image database is a scientificdatabase that enables effective data management and collaboration to accelerate fundamental research. This paper describes the design and use of a state-of-the-art relational...
详细信息
ISBN:
(纸本)0769519644
The CenSSIS Image database is a scientificdatabase that enables effective data management and collaboration to accelerate fundamental research. This paper describes the design and use of a state-of-the-art relational image databasemanagement system, accessible through a standard web-browser interface. The application utilizes a robust security architecture and is designed for efficient data submission. Our database query engine provides complex query capabilities to facilitate fast and efficient data retrieval. The system offers a highly extensible metadata schema, with the option of storing data within a hierarchical format.
The concepts abstracted from reality and represented through the dimensions in a statisticaldatabase (SDB) support the user to use them in query formulation and processing. Instead, all those useful properties involv...
详细信息
The concepts abstracted from reality and represented through the dimensions in a statisticaldatabase (SDB) support the user to use them in query formulation and processing. Instead, all those useful properties involved in a query that cannot be obtained through dimension/s in a SDB (for example, the concept of adjacency) can be presented in a Geographical database (GDB). This paper presents a conceptual approach to allow the end user, working in a Geographic database (GDB) environment, to use data cube stored in a statisticaldatabase (SDB). In this context, we need to extend the geographic data structure with some special `functional attributes'. They support links between environments mentioned above through geographic dimensions always implicitly or explicitly present in SDB. Therefore, the main objective of this paper is to propose a solution to answer queries involving data stored in both environments in a transparent way to the user. Then, a query language to support the integration of multidimensional operators with geographic operators is proposed. Finally, the main characteristics of the proposed approach are illustrated through some examples.
We describe the design and implementation of a scientificdatabase for the map assembly tasks performed by the geneticists at the University of Michigan Human Genome Center. Our system manages complex genomic data and...
详细信息
We describe the design and implementation of a scientificdatabase for the map assembly tasks performed by the geneticists at the University of Michigan Human Genome Center. Our system manages complex genomic data and supports the automation of the associated map assembly tasks. For the former, we present a genomic object model that integrates both experimental and derived data. For the latter, we describe operators to automate some of the analysis steps. To develop a framework for implementing our rule-based approach to physical mapping, we have designed and implemented an active object-oriented database (OODB) system, called Crystal, on GemStone. Crystal seamlessly integrates inference capabilities with complex object modeling and other typical database capabilities as required for physical mapping. We also discuss the implementation of a physical map assembly tool on top of Crystal. In conclusion, we provide a walk-through example that demonstrates how our approach can be used to effectively support physical contig assembly.
This paper discusses issues and solutions for supporting multiple overlap-ping classifications in database systems. These classifications are commonly found in science, although they are often ignored in computing app...
详细信息
ISBN:
(纸本)0769516327
This paper discusses issues and solutions for supporting multiple overlap-ping classifications in database systems. These classifications are commonly found in science, although they are often ignored in computing applications for scientific data, and inappropriate solutions adopted as their replacement. Known database models and classification techniques offer some degree of support for multiple overlapping classifications, but do not fully support the basic features we have identified as necessary: trees/graphs, traceability, semantics of classifications, independence of classification and data, and identity of classifications. The approach to the problem adopted by the Prometheus project, based on an extended object-oriented database model and the independence of classification schemes from classified data, is presented and discussed.
暂无评论