Building on recent advances in describing redundancy and synergy in multivariate interactions among random variables, we propose an approach to quantify cooperative effects in feature importance, a key technique in ex...
详细信息
Building on recent advances in describing redundancy and synergy in multivariate interactions among random variables, we propose an approach to quantify cooperative effects in feature importance, a key technique in explainable artificial intelligence. Specifically, we introduce an adaptive version of the widely used metric Leave One Covariate Out (LOCO), designed to disentangle high-order effects involving a particular input feature in regression problems. LOCO measures the reduction in prediction error when the feature of interest is added to the set of features used in regression. Unlike the standard approach that computes LOCO using all available features, our method identifies the subsets of features that maximize and minimize LOCO. This results in a decomposition of LOCO into a two-body component and higher-order components (redundant and synergistic), while also identifying the features that contribute to these high-order effects in conjunction with the driving feature. We demonstrate the effectiveness of the proposed method in a benchmark dataset related to wine quality and to proton versus pion discrimination using simulated detector measurements generated by GEANT.
Generative Adversarial Network has high capabilities for generating realistic images and other real-life applications due to its generator-discriminator interactions. Like other neural networks in deep learning, an ef...
详细信息
Supervised deep learning (SDL) has shown remarkable success in various financial applications, such as stock prediction and fraud detection. However, SDL’s reliance on class labels renders it unsuitable for portfolio...
详细信息
ISBN:
(数字)9798350370249
ISBN:
(纸本)9798350370270
Supervised deep learning (SDL) has shown remarkable success in various financial applications, such as stock prediction and fraud detection. However, SDL’s reliance on class labels renders it unsuitable for portfolio management (PM) tasks, where such labels are often unavailable. To address this limitation, we propose a novel two-level architecture based on deep reinforcement learning (DRL) for PM, which does not require class labels. Our approach comprises several local agents that provide trading decisions and uncertainty assessments for individual stocks, and a global agent that makes portfolio management decisions based on the outputs of the local agents. Additionally, we incorporate the concept of explainable AI (XAI) into our framework using the SHAP (Shapley additive explanations) method, enhancing the transparency and interpretability of the global agent’s decisions. Our experimental results demonstrate that the proposed architecture consistently yields profitable outcomes in the market.
The popularity of social media amplified the amount of text data that is used to enrich text classification research with machine learning approach. The modernization in weather information system is needed to produce...
详细信息
Nuanced dialects are a linguistic variant that pose several challenges for NLP models and techniques. One of the main challenges is the limited amount of datasets to enable extensive research and experimentation. We p...
详细信息
作者:
Arai, KeisukeTakai, YuukiDepartment of Mathematics
School of Science and Technology for Future Life Tokyo Denki University 5 Senju Asahi-cho Adachi-ku Tokyo120-8551 Japan Mathematics
Science Data Science and AI Program Academic Foundations Programs Kanazawa Institute of Technology 7-1 Ohgigaoka Ishikawa Nonoichi921-8501 Japan
In this paper, we give an equivalent condition for an abelian variety over a finite field to have multiplication by a quaternion algebra over a number field. We prove the result by combining Tate’s classification of ...
详细信息
3D pathology can potentially improve disease detection, but the datasets are too large to review. We're developing a deep-learning-based triage method to identify the highest-risk 2D sections within 3D pathology d...
详细信息
The possibility to computationally prioritize candidate disease genes capitalizing on existing information has led to a speedup in the discovery of new methods. Many gene discovery techniques exploit network data, lik...
详细信息
To determine whether Ohio college re-opening plans were effective in controlling the spread of COVID-19, cumulative case counts by county were gathered to compare various metrics related to the spread of COVID-19 case...
To determine whether Ohio college re-opening plans were effective in controlling the spread of COVID-19, cumulative case counts by county were gathered to compare various metrics related to the spread of COVID-19 cases between counties with NCAA colleges and counties without NCAA colleges. Various non-parametric statistical tests were used to determine if the samples were similar, and the analysis found the differences were statistically significant. Metropolitan and non-metropolitan groupings were also added to further subdivide the data set, but the analysis found no statistically significant differences in this case.
This paper introduces SARA, a semantic-assisted reinforced active learning framework for enhancing entity alignment (EA) under limited supervision scenarios. SARA addresses the challenges of EA in real-world scenarios...
详细信息
ISBN:
(数字)9798350359312
ISBN:
(纸本)9798350359329
This paper introduces SARA, a semantic-assisted reinforced active learning framework for enhancing entity alignment (EA) under limited supervision scenarios. SARA addresses the challenges of EA in real-world scenarios, including knowledge graph heterogeneity and limited training ground truth. SARA effectively selects valuable entity pairs with limited labeled data by combining reinforced active learning and semantic information. It utilizes a pair-wise language model based on Sentence-BERT to learn informative name embeddings that capture entity name semantics. These embeddings are combined with structural embeddings and trained using a novel semantic-assisted alignment loss. Extensive experiments on benchmark datasets and a real-world dataset demonstrate the superiority of SARA over existing approaches, particularly in limited labeled data scenarios. The paper also provides insights into fine-tuning strategies, presents ablation studies, and conducts sensitivity analyses to validate the effectiveness of SARA.
暂无评论