Deep learning for automated cell imaging analysis has become a tool of choice to process large amounts of data. But many of these methods lack explainability, slowing down their deployment for tasks such as diagnosis....
详细信息
ISBN:
(纸本)9798350349405;9798350349399
Deep learning for automated cell imaging analysis has become a tool of choice to process large amounts of data. But many of these methods lack explainability, slowing down their deployment for tasks such as diagnosis. We present a prototype-based framework to analyze structural changes which addresses the specific challenges of explainability in the context of cell imaging. Our method relies on classification between two distinct cell populations in a weakly supervised context where no label for individual cells is available. Our model extracts typical features from each population, representing intra-cellular structure, and provides an explanation on its classification decision by creating visualization of the local textures corresponding to the structures of interest. We show a real application where it effectively highlights a change in the organization of the actin content of the cells.
We describe Cartolabe, a web-based multiscale system for visualizing and exploring large textual corpora based on topics, introducing a novel mechanism for the progressive visualization of filtering queries. Initially...
详细信息
We describe Cartolabe, a web-based multiscale system for visualizing and exploring large textual corpora based on topics, introducing a novel mechanism for the progressive visualization of filtering queries. Initially designed to represent and navigate through scientific publications in different disciplines, Cartolabe has evolved to become a generic framework and accommodate various corpora, ranging from Wikipedia (4.5M entries) to the French National Debate (4.3M entries). Cartolabe is made of two modules: The first relies on natural language processing methods, converting a corpus and its entities (documents, authors, and concepts) into high-dimensional vectors, computing their projection on the two-dimensional plane, and extracting meaningful labels for regions of the plane. The second module is a web-based visualization, displaying tiles computed from the multidimensional projection of the corpus using the Umap projection method. This visualization module aims at enabling users with no expertise in visualization and dataanalysis to get an overview of their corpus, and to interact with it: exploring, querying, filtering, panning, and zooming on regions of semantic interest. Three use cases are discussed to illustrate Cartolabe's versatility and ability to bring large-scale textual corpus visualization and exploration to a wide audience.
The work is devoted to the methods of parallel processing of spectral data of astronomical objects, which is an important aspect in many scientific fields, in particular in astronomy when studying physical processes a...
详细信息
Over the past few years, photovoltaic systems are replacing traditional sources of energy. These photovoltaic systems are powered by solar energy, which depends on the amount of incident solar irradiance. However, sol...
详细信息
ISBN:
(纸本)9798350320107
Over the past few years, photovoltaic systems are replacing traditional sources of energy. These photovoltaic systems are powered by solar energy, which depends on the amount of incident solar irradiance. However, solar irradiance has a large amount of variability due to various spatio-temporal factors including region, cloud cover, humidity, and rainfall. In this paper1, we identify the most relevant spatio-temporal features for estimating solar irradiance. Such an accurate identification of weather variables is useful for developing accurate data-driven solar irradiance estimation models. This leads to an improved forecasting performance, thus creating a stable photovoltaic system. We base our analysis on the dataset of three countries, Ireland, Singapore and India. We identify the cloud type as the most relevant weather variables in all regions. The other relevant features holding differential importance across regions include time, relative humidity, solar zenith angle, and pressure.
Textbook question answering (TQA) is the task of correctly answering diagram or nondiagram (ND) questions given large multimodal contexts consisting of abundant essays and diagrams. In real-world scenarios, an explain...
详细信息
Textbook question answering (TQA) is the task of correctly answering diagram or nondiagram (ND) questions given large multimodal contexts consisting of abundant essays and diagrams. In real-world scenarios, an explainable TQA system plays a key role in deepening humans' understanding of learned knowledge. However, there is no work to investigate how to provide explanations currently. To address this issue, we devise a novel architecture toward span-level eXplanations for TQA (XTQA). In this article, spans are the combinations of sentences within a paragraph. The key idea is to consider the entire textual context of a lesson as candidate evidence and then use our proposed coarse-to-fine grained explanation extracting (EE) algorithm to narrow down the evidence scope and extract the span-level explanations with varying lengths for answering different questions. The EE algorithm can also be integrated into other TQA methods to make them explainable and improve the TQA performance. Experimental results show that XTQA obtains the best overall explanation result [mean intersection over union (mIoU)] of 52.38% on the first 300 questions of CK12-QA test splits, demonstrating the explainability of our method (ND: 150 and diagram: 150). The results also show that XTQA achieves the best TQA performance of 36.46% and 36.95% on the aforementioned splits. We have released our code in https://***/dr-majie/opentqa.
The paper aims to explore the differences in the field of smart education by comparing and analyzing the annotation results of international Chinese classroom data. Applying Professor Yanqun Zheng's International ...
详细信息
In recent years, wargame has become an important competition in China. The data research of the MiaoSuan Wargame platform provide powerful data computing support for effective decision-making to commanders, who will f...
详细信息
ISBN:
(纸本)9798350385113;9798350385106
In recent years, wargame has become an important competition in China. The data research of the MiaoSuan Wargame platform provide powerful data computing support for effective decision-making to commanders, who will face a large number of rapidly updated battlefield situation data per second. In this paper, three kinds of damage effect datasets of direct aiming attack, indirect aiming attack and guided aiming attack have been constructed based on the wargame replay big data and the matching map data, and visual dataanalysis has been carried out. The correlation between the damage effect and each parameter have been analyzed. Preparing for the model of future machine learning algorithm training, which can provide damage effect prediction analysis for commanders decision-making to improve damage effect and attack efficiency.
The minimum spanning tree (MST) plays an important role in variant fields, such as chip design and network analysis. With the rapid expansion of vertices in real-life graphs, the bottleneck problem of MST algorithms i...
详细信息
ISBN:
(纸本)9798350387117;9798350387124
The minimum spanning tree (MST) plays an important role in variant fields, such as chip design and network analysis. With the rapid expansion of vertices in real-life graphs, the bottleneck problem of MST algorithms in large-scale graphs grows more prominent. While there have been many FPGA-based accelerators for large-scale graph algorithms such as Graph Random Walk, and various algorithms to accelerate MST on CPUs and GPUs, effectively implementing MST algorithms for large-scale graphs on FPGAs remains quite challenging. This is due to several reasons: (1) The neighbor vertices in the graph require extensive random memory access and the memory access characteristics vary across different stages and iterations. (2) There are a large number of useless computations due to the existence of internal edges within a component (intra-edge). (3) Parallel MST algorithm suffers from significant communication overhead due to the minimum edge data update conflicts and memory read-write conflicts. This paper proposes AMST to accelerate large-scale graph MST computation on FPGA. First, AMST employs a customized hash-based high-degree vertex cache (HDC) to improve memory access efficiency. Second, AMST adopts a graph pruning strategy that skips intra-edge and sorts edges by weight to eliminate useless computation and memory access. Finally, AMST utilizes a sorting networking module and a multi-port HDC to improve parallel efficiency. The experimental results demonstrate that AMST achieves an average performance speedup of 17.52x over CPU and 1.89x over GPU, as well as 74.96x over CPU and 10.45x over GPU on energy efficiency.
All major web browsers support extensions to provide additional functionalities and enhance users' browsing experience while the extensions can access and collect users' data during their web browsing. Althoug...
详细信息
ISBN:
(纸本)9781665493369
All major web browsers support extensions to provide additional functionalities and enhance users' browsing experience while the extensions can access and collect users' data during their web browsing. Although the web extensions inform users of their data practices via multiple forms of notices, prior work has overlooked the critical gap between the actual data practices and the published privacy notices of browser extensions. To fill this gap, we propose ExtPrivA that automatically detects the inconsistencies between browser extensions' data collection and their privacy disclosures. From the privacy policies and Dashboard disclosures, ExtPrivA extracts privacy statements to have a clear interpretation of the privacy practices of an extension. It emulates user interactions to trigger the extension's functionalities and analyzes the initiators of network requests to accurately extract the users' data transferred by the extension from the browser to external servers. Our end-to-end evaluation has shown ExtPrivA to detect inconsistencies between the privacy disclosures and data-collection behavior with an 85% precision. In a large-scale study of 47.2k extensions on the Chrome Web Store, we found 820 extensions with 1,290 flows that are inconsistent with their privacy statements. Even worse, we have found 525 pairs of contradictory privacy statements in the Dashboard disclosures and privacy policies of 360 extensions. These discrepancies between the privacy disclosures and the actual data-collection behavior are deemed as serious violations of the Store's policies. Our findings highlight the critical issues in the privacy disclosures of browser extensions that potentially mislead, and even pose high privacy risks to, end-users.
暂无评论