The area of data integration has gained increased popularity in recent years. A data browser is described for a data integration system where an intermediate layer of distributed mediators is used to query and integra...
详细信息
ISBN:
(纸本)0769508359
The area of data integration has gained increased popularity in recent years. A data browser is described for a data integration system where an intermediate layer of distributed mediators is used to query and integrate data,from heterogeneous data sources. The data sources can be regular relational databases but also other data producing programs. They often have complex data representations and are often object-oriented (DO). The mediator database layer is therefore also object-oriented for a high abstraction level. An OO query interface is used to access the mediator layer from application programs and users. For a scalable and component-based architecture the mediators can be used as servers for other mediators. This leads to a distributed mediator architecture where mediator servers interact with other mediators and data sources. The OO multi-mediator browser GOOVI is presented, which enables maintenance of such distributed mediator databases. With GOOVI all autonomous mediators in a federation can be viewed, queried, and updated. This multi-mediator browser also provides userinterfaces for integrating data through OO views. The paper describes the architecture and functionality of GOOVI.
While surfing through the World Wide Web, the user typically has to deal with a broad range of heterogeneity, not only with respect to contents but also to conceptual structure which is particularly reflected by the h...
详细信息
ISBN:
(纸本)0769508340;0769508359
While surfing through the World Wide Web, the user typically has to deal with a broad range of heterogeneity, not only with respect to contents but also to conceptual structure which is particularly reflected by the hyperlink structure. Supported by the protagonists of the Semantic Web. ontology-based approaches to structuring parts of the Internet or Intranets have recently gained attention. These approaches usually rely on internal annotation of Web pages. Since it is not easy to commit distributed Web authors on a common terminology and correct syntax for annotation, internal annotation requires central access to all Web pages of interest. Therefore, the approach is well suited for (closed) Intranets but not for the open Internet. In contrast, this contribution presents an ontology-based approach to external annotation of Web pages. External annotation yields the advantage that existing documents need not to be edited and, therefore, is potentially applicable to parts of the Internet which cannot be controlled. This approach is applied in order to up-scale a prototype semantic navigation of an evidence-based medical information service on the Internet, running at a relatively small scale, to dataintensive Extranets.
This paper presents a new approach for interactive visualization of data warehouses and data mining results in an immersed virtual environment. DIVE-ON is a data mining system prototype that is capable of constructing...
详细信息
ISBN:
(纸本)0769508359
This paper presents a new approach for interactive visualization of data warehouses and data mining results in an immersed virtual environment. DIVE-ON is a data mining system prototype that is capable of constructing a multidimensional data model on a remote system, transporting pertinent views to a CAVE, creating an immersed virtual environment and providing an interactive data mining toolset. The main objective of this research is to examine the possibility of effective mining, visualizing and manipulating large amounts of distributed multidimensional data with little or no instructional help. To achieve this, DIVE-ON immerses the user in a virtual environment and provides a set of intuitive and effective interaction techniques within the CAVE environment. Intuitiveness was tackled by exploiting the user's considerable natural experience in interacting and navigating through a 3-dimensional world and by understanding the characteristics of a virtual environment that is well suited for the visual analysis of data. The ability to perform OLAP operations intuitively in such an environment provides the user with an effective means to conceptualize and gain an insight into large volumes of data from several distributed sources.
The evaluation of the usability and the learnability of a computer system may be performed with predictive models during the design phase. It may be done on the executable code as well as by observing the user in acti...
详细信息
The evaluation of the usability and the learnability of a computer system may be performed with predictive models during the design phase. It may be done on the executable code as well as by observing the user in action. In the latter case, data collected in vivo must be processed. The goal is to provide software supports for performing this difficult and time consuming-task. The paper presents an early analysis of, and experience relating to, the automatic evaluation of multimodal userinterfaces. With this end in view, a generic Wizard of Oz platform has been designed to allow the observation and automatic recording of subjects' behavior while they interact with a multimodal interface. It is then shown how recorded data can be analyzed to detect behavioral patterns, and how deviations of such patterns from a data-flow-oriented task model can be exploited by a software usability critic.
The data-intensive computing generates a huge number of data in wide area network. The dataGrid technology tries to manage such distributed data on the Internet to provide the quick and efficient data search/access me...
详细信息
ISBN:
(纸本)9781424425785
The data-intensive computing generates a huge number of data in wide area network. The dataGrid technology tries to manage such distributed data on the Internet to provide the quick and efficient data search/access mechanism. The difficulties of the data access on dataGrid systems is caused from the differences in the data management manner and policy among organizations which manage storage resources. In this paper, we propose the new distributed data management scheme and design for data-intensive computing. Especially, we focus on the data attributes. We define the pairs of data attribute and its values as its metadata. In our system, users can be find/access data with the metadata. We have been developing the prototype systems. We show the usage of our system with the applications.
The sixth HUMANIZE workshop(1) on Transparency and Explainability in Adaptive systems through user Modeling Grounded in Psychological Theory took place in conjunction with the 27th annual meeting of the Intelligent Us...
详细信息
ISBN:
(纸本)9781450391450
The sixth HUMANIZE workshop(1) on Transparency and Explainability in Adaptive systems through user Modeling Grounded in Psychological Theory took place in conjunction with the 27th annual meeting of the Intelligent userinterfaces (IUI)(2) community that was hosted virtually by the University of Helsinki (Finland) on March 22, 2022. The 2022 edition of the workshop was held together with TExSS (Transparency and Explanations in Smart systems) (3). The workshop provided a venue for researchers from different fields to interact by accepting contributions on the intersection of practical data mining methods and theoretical knowledge for personalization. A total of two papers was accepted for this edition of the workshop.
The goal here is to present a multidimensional visualization methodology and its applications to Visual and Automatic Knowledge Discovery in a coherent paper. Visualization provides insight through images and can be c...
详细信息
ISBN:
(纸本)0769508340;0769508359
The goal here is to present a multidimensional visualization methodology and its applications to Visual and Automatic Knowledge Discovery in a coherent paper. Visualization provides insight through images and can be considered as a collection of application specific mappings: Problem Domain --> Visual Range. For the visualization of multivariate problems a multidimensional system of Parallel coordinates (abbr. ||-coords) is constructed which induces a one-to-one mapping between subsets of N-space and subsets of 2-space. The result is a rigorous methodology for doing and seeing N-dimensional geometry. We start with an overview of the mathematical foundations where it is seen that from the display of high-dimensional datasets the search for multivariate relations among the variables is transformed into a 2-D pattern recognition problem. This is the basis for the application to Visual Knowledge Discovery which is illustrated in the second part with real dataset of VLSI production. Then a recent geometric classifier is presented and applied to 3 real datasets. The results compared to those of 23 other classifiers have the least error. The algorithm, has quadratic computational complexity in the size and number of parameters, provides comprehensible and explicit rules, does dimensionality selection - where the minimal set of original variables required to state the rule is found, and orders these variables so as to optimize the clarity of separation between the designated set and its complement. Finally a simple visual economic model of a real country is constructed and analyzed in order to illustrate the special strength of ||-coords in modeling multivariate relations by means of hypersurfaces.
The fourth HUMANIZE workshop(1) on Transparency and Explainability in Adaptive systems through user Modeling Grounded in Psychological Theory took place in conjunction with the 25th annual meeting of the Intelligent U...
详细信息
ISBN:
(纸本)9781450375139
The fourth HUMANIZE workshop(1) on Transparency and Explainability in Adaptive systems through user Modeling Grounded in Psychological Theory took place in conjunction with the 25th annual meeting of the Intelligent userinterfaces (IUI)(2) community in Cagliari, Italy on March 17, 2020. The workshop provided a venue for researchers from different fields to interact by accepting contributions on the intersection of practical data mining methods and theoretical knowledge for personalization. A total of four papers was accepted for this edition of the workshop.
The seventh HUMANIZE workshop(1) on Transparency and Explainability in Adaptive systems through user Modeling Grounded in Psychological Theory took place in conjunction with the 29th annual meeting of the Intelligent ...
详细信息
ISBN:
(纸本)9798400705090
The seventh HUMANIZE workshop(1) on Transparency and Explainability in Adaptive systems through user Modeling Grounded in Psychological Theory took place in conjunction with the 29th annual meeting of the Intelligent userinterfaces (IUI)(2) community that took place on between March 18-21, 2024 in Greenville, South Carolina, USA. The 2024 edition of the workshop was held together with SOCIALIZE (social and cultural integration with personalized interfaces)(3). The workshop provided a venue for researchers from different fields to interact by accepting contributions on the intersection of practical data mining methods and theoretical knowledge for personalization. A total of two papers were accepted for this edition of the workshop.
For systems executing a mixture of different dataintensive applications in parallel there is always the question about the impact that each application has on the storage subsystem. From the perspective of storage, I...
详细信息
ISBN:
(纸本)9781479970384
For systems executing a mixture of different dataintensive applications in parallel there is always the question about the impact that each application has on the storage subsystem. From the perspective of storage, I/O is typically anonymous as it does not contain user identifiers or similar information. This paper focuses on the analysis of performance data collected on shared system components like global file systems that can not be mapped back to user activities immediately. Our approach classifies user jobs based on their properties into classes and correlates these classes with global timelines. Within the paper we will show details of the clustering algorithm, depict our measurement environment and present first results. The results are valuable for tuning HPC storage system to achieve an optimized behavior on a global system level or to separate users into classes with different I/O demands.
暂无评论