visualization tools allow visual and interactive data exploration for facilitating the interpretation of complex data sets. The Center for Research and Ambulatory Care of the Sickle Cell Disease (CERPAD), leading the ...
详细信息
ISBN:
(纸本)9781728128382
visualization tools allow visual and interactive data exploration for facilitating the interpretation of complex data sets. The Center for Research and Ambulatory Care of the Sickle Cell Disease (CERPAD), leading the neonatal screening of sickle cell program in Saint-Louis of Senegal, aims at setting up a long term database on which data analysis and reporting tools are built for epidemiological and socioanthropological studies. In this paper, we propose a tool for the proportional visualization of genotypes and phenotypes, using rainbow boxes, a recently introduced set visualization technique. We propose an improvement for rainbow boxes, featuring area-proportional boxes instead of height-proportional boxes. This new approach is integrated in the SIMENS module of the CERPAD for the visualization of sickle cell data proportion according to genotypes and phenotypes, and by ethnic groups. A qualitative evaluation is provided by two domain experts.
Breakthroughs in transformer-based models have revolutionized not only the NLP field, but also vision and multimodal systems. However, although visualization and interpretability tools have become available for NLP mo...
详细信息
ISBN:
(数字)9781665469463
ISBN:
(纸本)9781665469463
Breakthroughs in transformer-based models have revolutionized not only the NLP field, but also vision and multimodal systems. However, although visualization and interpretability tools have become available for NLP models, internal mechanisms of vision and multimodal transformers remain largely opaque. With the success of these transformers, it is increasingly critical to understand their inner workings, as unraveling these black-boxes will lead to more capable and trustworthy models. To contribute to this quest, we propose VL-InterpreT, which provides novel interactive visualizations for interpreting the attentions and hidden representations in multimodal transformers. VL-InterpreT is a task agnostic and integrated tool that (1) tracks a variety of statistics in attention heads throughout all layers for both vision and language components, (2) visualizes cross-modal and intra-modal attentions through easily readable heatmaps, and (3) plots the hidden representations of vision and language tokens as they pass through the transformer layers. In this paper, we demonstrate the functionalities of VL-InterpreT through the analysis of KD-VLP, an end-toend pretraining vision-language multimodal transformer-based model, in the tasks of Visual Commonsense Reasoning (VCR) and WebQA, two visual question answering benchmarks. Furthermore, we also present a few interesting findings about multimodal transformer behaviors that were learned through our tool.
Software engineers have a wide variety of tools and techniques that can help them improve the quality of their code, but still, a lot of bugs remain undetected. In this paper we build on the idea that if a particular ...
详细信息
ISBN:
(纸本)9781450375177
Software engineers have a wide variety of tools and techniques that can help them improve the quality of their code, but still, a lot of bugs remain undetected. In this paper we build on the idea that if a particular fragment of code is changed too often, it could be caused by some technical or architectural issues, therefore, this fragment requires additional attention from developers. Most teams nowadays use version control systems to track changes in their code and organize cooperation between developers. We propose to use data from version control systems to track the number of changes for each method in a project for a selected time period and display this information within the IDE's code editor. The paper describes such a tool called Topias built as a plugin for IntelliJ IDEA. Its source code is available at https://***/JetBrains-Research/topias. A demonstration video can be found at https://***/watch?v=xsqc4gCTxfA.
Augmented reality (AR) is well suited for situated visualization (SV), a method to represent data in context, with potential in many situations. Furthermore, AR-based SV poses challenges and research prospects. A vita...
详细信息
ISBN:
(纸本)9781665440578
Augmented reality (AR) is well suited for situated visualization (SV), a method to represent data in context, with potential in many situations. Furthermore, AR-based SV poses challenges and research prospects. A vital one is the egocentric viewpoint limitation of the users, which reduces their ability to explore all the available information. To create new approaches that will overcome this limitation, this work began by understanding the relevance of viewpoints in typical AR-based SV scenarios, characterizing the approaches already proposed in order to tackle it. The new proposed approaches should be evaluated in different settings to improve and validate them, as well as propose guidelines for some relevant scenarios.
Infographics communicate information using a combination of textual, graphical and visual elements. This work explores the automatic understanding of infographic images by using a Visual Question Answering technique. ...
详细信息
ISBN:
(纸本)9781665409155
Infographics communicate information using a combination of textual, graphical and visual elements. This work explores the automatic understanding of infographic images by using a Visual Question Answering technique. To this end, we present InfographicVQA, a new dataset comprising a diverse collection of infographics and question-answer annotations. The questions require methods that jointly reason over the document layout, textual content, graphical elements, and data visualizations. We curate the dataset with an emphasis on questions that require elementary reasoning and basic arithmetic skills. For VQA on the dataset, we evaluate two Transformer-based strong baselines. Both the baselines yield unsatisfactory results compared to near perfect human performance on the dataset. The results suggest that VQA on infographics-images that are designed to communicate information quickly and clearly to human brain-is ideal for benchmarking machine understanding of complex document images. The dataset is available for download at ***
Modern software typically performs more than one functionality. These functionalities or features are not always organized in a way for modules representing these features to be used individually. Many software engine...
详细信息
ISBN:
(纸本)9781665448970
Modern software typically performs more than one functionality. These functionalities or features are not always organized in a way for modules representing these features to be used individually. Many software engineering approaches like programming language constructs, or product line visualization techniques have been proposed to organize projects as modules. Unfortunately, much legacy software suffer from years or decades of improper coding practices that leave the modules in the code almost undetectable. In such scenarios, a desirable requirement is to identify modules representing different features to be extracted. In this paper, we propose a novel approach that combines information retrieval and program analysis approaches to allow domain experts to identify slices of the program that represent modules using natural language search terms. We evaluate our approach by building a proof of concept tool in C, and extract modules from open source projects.
The development of digital media technology has led to changes in news communication. Whether we can find an effective way to the many visual expression, both full explanation and objective news and information, but a...
详细信息
ISBN:
(纸本)9781538676165
The development of digital media technology has led to changes in news communication. Whether we can find an effective way to the many visual expression, both full explanation and objective news and information, but also help the audience realize the most convenient digestion and cognitive information. In this study, we use the method of control experiment, taking the channel effect as the theoretical basis, and taking graphics and animation visualization as the object, to measure the cognitive effect of the 60 college students in Wuhan when they face the specific news information presented by the two visual techniques.
The geographical and meteorological conditions of Japan make the country prone to frequent natural disasters such as earthquakes, tsunamis, typhoons, and floods. In the event of a major natural disaster, rapid and acc...
详细信息
Executive decisions are the core components affecting the growth of the organization. While one right decision can make the business to reach the sky, one wrong decision can bring down the business. With the increasin...
详细信息
ISBN:
(纸本)9781538605691
Executive decisions are the core components affecting the growth of the organization. While one right decision can make the business to reach the sky, one wrong decision can bring down the business. With the increasing competition of IT industries, the investments, business directives and the business data are increasing exponentially. Hence, the business owner should be extra cautious and must keep all the factors in mind while decision making. Thus, demanding a great requirement of a tool which measures and monitors the growth of the business and to evidence that the business is heading towards the profitable direction. Even though there are few existing methods to measure the growth of the company, these are limited to provide basic information or have restrictions in analyzing the behavior of the business. Therefore, fail to provide the complete assurance to business owners in decision making. This work provides an efficient Nobel solution to address these problems by focusing on developing a Performance Dashboard. The proposed technique involves an integration of business intelligence technologies, data mining and data visualization technologies creating a perfect solution to analyze the business trends, business growth, the amount of profit, employee performance, customer satisfaction, areas of improvements in business and much more. This performance dashboard showcases the information by understating the business behavior right from the organization start period. It acts as an information management tool that is used to track the metrics, Key Performance Indicators (KPIs) and additional key factors applicable to the business or specific process. Using data visualization techniques, dashboard simplifies the complex data sets to deliver users with a glancing awareness of present performance and to keep track on the department's capability to accomplish service level targets.
It is well-established by cognitive neuroscience that human perception of objects constitutes a complex process, where object appearance information is combined with evidence about the so-called object "affordanc...
详细信息
ISBN:
(纸本)9781538604571
It is well-established by cognitive neuroscience that human perception of objects constitutes a complex process, where object appearance information is combined with evidence about the so-called object "affordances", namely the types of actions that humans typically perform when interacting with them. This fact has recently motivated the "sensorimotor" approach to the challenging task of automatic object recognition, where both information sources are fused to improve robustness. In this work, the aforementioned paradigm is adopted, surpassing current limitations of sensorimotor object recognition research. Specifically, the deep learning paradigm is introduced to the problem for the first time, developing a number of novel neuro-biologically and neuro-physiologically inspired architectures that utilize state-of-the-art neural networks for fusing the available information sources in multiple ways. The proposed methods are evaluated using a large RGB-D corpus, which is specifically collected for the task of sensorimotor object recognition and is made publicly available. Experimental results demonstrate the utility of affordance information to object recognition, achieving an up to 29% relative error reduction by its inclusion.
暂无评论