Deep learning networks are gradually deployed in edge applications, such as phones and cameras, which has a restriction of the computational resources. Thus, to improve the computational efficiency, numerous types of ...
详细信息
Deep learning proved effective in detecting COVID-19 from chest X-ray images. Although computing is the current bottleneck in training and deploying models, few studies focus on developing efficient frameworks while m...
详细信息
In reality, the laborious nature of label annotation leads to the widespread existence of limited labeled data. Moreover, multi-scale data have received widespread attention due to its rich knowledge representation. H...
详细信息
We tackle the problem of Non-stochastic Control (NSC) with the aim of obtaining algorithms whose policy regret is proportional to the difficulty of the controlled environment. Namely, we tailor the Follow The Regulari...
详细信息
Multilingual Neural Machine Translation (NMT) excels in sharing knowledge across languages and transferring insights from high-resource languages to improve performance in low-resource languages. However, its performa...
详细信息
Multilingual Neural Machine Translation (NMT) excels in sharing knowledge across languages and transferring insights from high-resource languages to improve performance in low-resource languages. However, its performance lags in specific domains such as legal and medical. Previous works have focused on adding language-specific and domain-specific adapters to achieve domain adaptation. Although effective, these adapter-based methods only use domain data to train additional parameters, limiting the performance of multilingual NMT. In this paper, we propose CDSTX, a novel approach that achieves robust multilingual domain adaptation without relying on adapters, solely leveraging multilingual models. Specifically, we utilize the pretrained model XLM-R and frozen embeddings to preserve its robust multilingual capabilities and design a two-stage training strategy for domain adaptation. This includes separate training for the decoder and fine-tuning of the entire model, ensuring that the model effectively acquires domain knowledge and correctly represents domain-specific text. Moreover, to better utilize the domain information conveyed implicitly by the training data, we devise special domain tokens at the beginning of the source and target sentences called Source&Target Domain Tags. In addition, back translation is employed to enhance the cross-lingual transfer ability of our approach. Our proposed method is evaluated on two datasets across three translation tasks: Single-domain Multilingual NMT, Multi-domain bilingual NMT, and Multilingual Multi-domain NMT. Notably, in the single-domain multilingual NMT task, CDSTX significantly enhances zero-shot domain translation performance, achieving improvements of up to +30 BLEU points through the utilization of back-translation techniques. Even in the bilingual multi-domain NMT task where specific domain data for the target translation direction is unavailable, our method consistently outperforms all SOTA methods. Moreover, in the
Spherical evolution (SE) is a recently proposed meta-heuristic algorithm. Its special search approach has been proved to be very effective in exploring the search space. SE is very powerful for optimization, but still...
详细信息
Depending on a device's encryption mechanism, a wide variety of tangible details could be exposed. These leaks are used in side-channel analysis, which is used to get keys. Due to deep learning's sensitivity t...
详细信息
Continuous-time Markov processes are governed by the Chapman-Kolmogorov differential equation. We show that replacing the standard time derivative of the governing equation with a Caputo fractional derivative of order...
详细信息
Continuous-time Markov processes are governed by the Chapman-Kolmogorov differential equation. We show that replacing the standard time derivative of the governing equation with a Caputo fractional derivative of order 0<α<1, leads to a fractional differential equation whose solution can describe the state probabilities of a class of non-Markovian stochastic processes. We show that the same state probabilities also solve a system of equations that describe semi-Markov processes in which the sojourn times follow a Mittag-Leffler distribution, contrasting the usual Markov processes with exponentially distributed sojourn times. We apply the fractional framework to the ɛ−SIS epidemic process on any contact graph and we propose a microscopic epidemic description in which infection and curing events follow a Mittag-Leffler distribution and are not independent. We analytically prove that the description exactly solves the fractional extension of the Chapman-Kolmogorov differential equation, and we provide an extensive study of how the dependence between events strongly affects the dynamics of the spreading process. We conclude verifying the proposed framework with Monte Carlo simulations.
Understanding the impact of events reported in news articles plays pivotal roles in various tasks. In this study, we propose a framework for extracting event components from the news and analyzing their multifaceted i...
详细信息
ISBN:
(数字)9798350377088
ISBN:
(纸本)9798350377095
Understanding the impact of events reported in news articles plays pivotal roles in various tasks. In this study, we propose a framework for extracting event components from the news and analyzing their multifaceted impact. The framework employs transformers techniques for event extraction and leverages a biography knowledge base to assess the potential impact of events across social, political, economic, and culture (SPEC) dimensions. Our framework achieves an F1-score of 78.12% in event identification as well as significant accuracy score in evaluating the impact levels prediction. By integrating results from event extraction and impact analysis, the level of multifaceted influence of the event along with the event information are effectively visualized. This comprehensive approach enhances understanding of multi-perspective interpretation and supports insights for various applications.
Deep learning (DL) has transformed the field of medical image processing, enabling unprecedented accuracy and efficiency in various applications. It has been widely utilized for medical image analysis of different ana...
Deep learning (DL) has transformed the field of medical image processing, enabling unprecedented accuracy and efficiency in various applications. It has been widely utilized for medical image analysis of different anatomical regions. In this article, we provide an overview of commonly used deep learning methods, including convolutional neural networks (CNNs), recurrent neural networks (RNNs), and generative adversarial networks (GANs). We briefly overview their applications in medical image analysis, such as image classification, object detection/localization, segmentation, generation, and registration. We also highlight the strengths and limitations of each method and identify the challenges that still need to be addressed, including the limited availability of annotated data, variability in medical images, and the interpretability issue. Finally, we discuss future research directions, including developing explainable deep learning methods and integrating multi-modal data.
暂无评论