The increase of traffic in large cities cause serious problems, such as traffic accidents, delays at workplaces, stress and other problems, it makes it necessary to use new technologies to identify these zones and mak...
详细信息
This paper presents a tool enabling the visual analysis of multivariate heterogeneous data. large amounts of measured and contextual data are being gathered for a large number of applications, increasing connectivity ...
This paper presents a tool enabling the visual analysis of multivariate heterogeneous data. large amounts of measured and contextual data are being gathered for a large number of applications, increasing connectivity across different data types. While measured data are often quantitative, contextual data tend to be categorical. This results in datasets containing multivariate data with heterogeneous properties. Difference in the natures of these properties raises challenges when combining them for analysis. This paper presents the design of a tool that enables the exploration of multivariate heterogeneous data by combining the strengths of Parallel Coordinates and Parallel Sets. The design relied on the application domain of real-life mobility monitoring that is particularly affected by the challenge mentioned above. To validate the suggested approach this paper presents the result of a usability evaluation, which confirms that the presented design is as efficient as other exiting tools while providing more features for correlation analysis.
Scientific analysis of changes of the Earth's land surface benefit from well characterized, science quality remotely sensed data. This data quality is the result of models that estimate and remove atmospheric cons...
详细信息
ISBN:
(纸本)9798350320107
Scientific analysis of changes of the Earth's land surface benefit from well characterized, science quality remotely sensed data. This data quality is the result of models that estimate and remove atmospheric constituents and account for sun-sensor geometry [1]-[3]. Surface reflectance (SR) in commercial very high resolution (< 5 m;VHR) spaceborne imagery routinely varies for unchanged surface features because of signal variation from the combined effects of atmospheric haze and a range of sun-sensor geometric scenarios of acquisitions [4]. Consistency from this imagery must be sufficient to identify and track the change or stability of fine-scale features that, though small, may be widely distributed across remote domains, and serve as key indicators of critical broad-scale environmental change [5], [6]. Currently commercial SR products are available, but typically the model employed is proprietary and the costs for using these products over a large domain can be significant ( e.g., Planet Surface Reflectance v.2). Here we describe an open source workflow for the scientific community to improve detection of fine-scale change with commercial VHR imagery.
Noise, vibration, and harshness (NVH) simulation represents an important step in modern automotive design. This type of simulation produces large and complex data that is hard to analyze. The data resides in two domai...
详细信息
Thicket is an open-source Python toolkit for Exploratory dataanalysis (EDA) of multi-run performance experiments. It enables an understanding of optimal performance configuration for large-scale application codes. Mo...
详细信息
ISBN:
(纸本)9798400701559
Thicket is an open-source Python toolkit for Exploratory dataanalysis (EDA) of multi-run performance experiments. It enables an understanding of optimal performance configuration for large-scale application codes. Most performance tools focus on a single execution (e.g., single platform, single measurement tool, single scale). Thicket bridges the gap to convenient analysis in multi-dimensional, multi-scale, multi-architecture, and multi-tool performance datasets by providing an interface for interacting with the performance data. Thicket has a modular structure composed of three components. The first component is a data structure for multi-dimensional performance data, which is composed automatically on the portable basis of call trees, and accommodates any subset of dimensions present in the dataset. The second is the metadata, enabling distinction and sub-selection of dimensions in performance data. The third is a dimensionality reduction mechanism, enabling analysis such as computing aggregated statistics on a given data dimension. Extensible mechanisms are available for applying analyses (e.g., top-down on Intel CPUs), data science techniques (e.g., K-means clustering from scikit-learn), modeling performance (e.g., Extra-P), and interactive visualization. We demonstrate the power and flexibility of Thicket through two case studies, first with the open-source RAJA Performance Suite on CPU and GPU clusters and another with a large physics simulation run on both a traditional HPC cluster and an AWS Parallel Cluster instance.
The proceedings contain 75 papers. The topics discussed include: accelerating large language model training with hybrid GPU-based compression;demystifying swarm learning: an emerging decentralized federated learning s...
ISBN:
(纸本)9798350395662
The proceedings contain 75 papers. The topics discussed include: accelerating large language model training with hybrid GPU-based compression;demystifying swarm learning: an emerging decentralized federated learning system;IDIOMS: index-powered distributed object-centric metadata search for scientific data management;Brug: an adaptive memory (re-)allocator;MDSTGCN : multi-scale dynamic spatial-temporal graph convolution network with edge feature embedding for traffic forecasting;fair, efficient multi-resource scheduling for stateless serverless functions with Anubis;STRonG: system topology risk analysis on graphs;federated semi-supervised learning with local and global updating frequency optimization;and towards better QoS and lower costs of P4 EIP gateway at the edge.
With the high growth rate of text data, extracting meaningful information from a large corpus becomes increasingly difficult. Keyword extraction and analysis is a common approach to tackle the problem, but it is non-t...
详细信息
ISBN:
(纸本)9781665439312
With the high growth rate of text data, extracting meaningful information from a large corpus becomes increasingly difficult. Keyword extraction and analysis is a common approach to tackle the problem, but it is non-trivial to identify important words in the text and represent the multifaceted properties of those words effectively. Traditional topic modeling based keyword analysis algorithms require hyper-parameters which are often difficult to tune without enough prior knowledge. In addition, the relationships among the keywords are often difficult to obtain. In this paper, we utilize the attention scores extracted from Transformer-based language models to capture word relationships. We propose a domain-driven attention tuning method, guiding the attention to learn domain-specific word relationships. From the attention, we build a keyword network and propose a novel algorithm, Attention-based Word Influence (AWI), to compute how influential each word is in the network. An interactive visual analytics system, KeywordMap, is developed to support multi-level analysis of keywords and keyword relationships through coordinated views. We measure the quality of keywords captured by our AWI algorithm quantitatively. We also evaluate the usefulness and effectiveness of KeywordMap through case studies.
Traditional fault diagnosis technology or a single defect diagnostic approach cannot match the actual needs of specific and sophisticated large-scale systems such as launch rockets. The wide application of artificial ...
详细信息
We propose TimeTables, a novel prototype system that aims to support data exploration, using embodiment with space-time cubes in virtual reality. TimeTables uses multiple space-time cubes on virtual tabletops, which u...
详细信息
ISBN:
(纸本)9781665496179
We propose TimeTables, a novel prototype system that aims to support data exploration, using embodiment with space-time cubes in virtual reality. TimeTables uses multiple space-time cubes on virtual tabletops, which users can manipulate by extracting time layers or individual buildings to create new tabletop views. The surrounding environment includes a large space for multiple linked tabletops and a storage wall. TimeTables presents information at different time scales by stretching layers to drill down in time. Users can also jump into tabletops to inspect data from an egocentric perspective. We present a use case scenario of energy consumption displayed on a university campus to demonstrate how our system could support data exploration and analysis over space and time. From our experience and analysis we believe the system has a high potential in assisting spatio-temporal data exploration and analysis.
Automated program repair techniques address software errors, vulnerabilities, and defects through automation. With the rapid development of deep learning, deep learning-based automated repair techniques have improved ...
详细信息
暂无评论