Zika virus (ZIKV) and Dengue virus (DENV) infections cause severe disease in humans and are significant socio-economic burden worldwide. These flavivirus infections are difficult to diagnose serologically due to antig...
详细信息
Zika virus (ZIKV) and Dengue virus (DENV) infections cause severe disease in humans and are significant socio-economic burden worldwide. These flavivirus infections are difficult to diagnose serologically due to antigenic overlap. The phylogenetic analysis shows that ZIKV clusters with DENVs at a higher node of the phylogenetic tree with significant genomic and structural similarity. Our study aims to identify gene biomarkers for the classification of Dengue and Zika viral infections using machine learning algorithms and bioinformatics analysis. The gene expression count matrix for single-cell RNA sequencing dataset GSE110496 was analyzed using binary classifiers, namely Logistic regression, Support Vector Machines, Random Forest, and Decision trees. The GSE110496 dataset represents a unique study of the transcriptional and translational dynamics of DENV and ZIKV infections at 4-, 12-, 24-, and 48-h time points for human hepatoma (Huh7) cells. Out of which 24-h time point has been analyzed in this study, at the optimal threshold of viral molecules. Feature selection was performed using two different approaches Random Forest Classifier (RFC) for gene ranking and Recursive Feature Elimination (RFE). Out of which RFE, showed more accuracy and precision. The classification accuracy of 89.4% and the precision of 90% were obtained using selected 10 gene features. SCY1 Like Pseudokinase 3 (SCYL3), Chromosome 1 Open Reading Frame 112 (C1orf112), Complement factor H (CFH), Heme-binding protein 1 (HEBP1), Cadherin 1 (CDH1), Nibrin (NBN), Histone deacetylase 5 (HDAC5), nuclear receptor subfamily 0, group B, member 2 (NR0B2), Annexin A9 (ANXA9) and Alcohol dehydrogenase 6 (ADH6) are the proposed gene biomarkers in this study. The functional analysis of the reported biomarkers was performed using KEGG and GO with the WEB-based Gene SeT AnaLysis Toolkit (WebGestalt). The relationship of the selected biomarkers with DENV and ZIKV infections analyzed using a gene–gene interaction n
Pre-trained multilingual language models (PMLMs) such as mBERT and XLM-R have shown good cross-lingual transferability. However, they are not specifically trained to capture cross-lingual signals concerning sentiment ...
详细信息
Pre-trained multilingual language models (PMLMs) such as mBERT and XLM-R have shown good cross-lingual transferability. However, they are not specifically trained to capture cross-lingual signals concerning sentiment words. This poses a disadvantage for low-resource languages (LRLs) that are under-represented in these models. To better fine-tune these models for sentiment classification in LRLs, a novel intermediate task fine-tuning (ITFT) technique based on a sentiment lexicon of a high-resource language (HRL) is introduced. The authors experiment with LRLs Sinhala, Tamil and Bengali for a 3-class sentiment classification task and show that this method outperforms vanilla fine-tuning of the PMLM. It also outperforms or is on-par with basic ITFT that relies on an HRL sentiment classification dataset.
When data privacy is imposed as a necessity,Federated learning(FL)emerges as a relevant artificial intelligence field for developing machine learning(ML)models in a distributed and decentralized *** allows ML models t...
详细信息
When data privacy is imposed as a necessity,Federated learning(FL)emerges as a relevant artificial intelligence field for developing machine learning(ML)models in a distributed and decentralized *** allows ML models to be trained on local devices without any need for centralized data transfer,thereby reducing both the exposure of sensitive data and the possibility of data interception by malicious third *** paradigm has gained momentum in the last few years,spurred by the plethora of real-world applications that have leveraged its ability to improve the efficiency of distributed learning and to accommodate numerous participants with their data *** virtue of FL,models can be learned from all such distributed data sources while preserving data *** aim of this paper is to provide a practical tutorial on FL,including a short methodology and a systematic analysis of existing software ***,our tutorial provides exemplary cases of study from three complementary perspectives:i)Foundations of FL,describing the main components of FL,from key elements to FL categories;ii)Implementation guidelines and exemplary cases of study,by systematically examining the functionalities provided by existing software frameworks for FL deployment,devising a methodology to design a FL scenario,and providing exemplary cases of study with source code for different ML approaches;and iii)Trends,shortly reviewing a non-exhaustive list of research directions that are under active investigation in the current FL *** ultimate purpose of this work is to establish itself as a referential work for researchers,developers,and data scientists willing to explore the capabilities of FL in practical applications.
Understanding the structural growth of paediatric brains is a key step in the identification of various neuro-developmental disorders. However, our knowledge is limited by many factors, including the lack of automated...
详细信息
Negative sampling has emerged as an effective technique that enables deep learning models to learn better representations by introducing the paradigm of 'learn-to-compare.' The goal of this approach is to add ...
详细信息
In this study, we outline the design and implementation of a portable massively parallel asynchronous solver for time-dependent partial differential equations (PDEs). The solver is implemented using Kokkos library for...
详细信息
Pangenome reference graphs are useful in genomics because they compactly represent the genetic diversity within a species, a capability that linear references lack. However, efficiently aligning sequences to these gra...
详细信息
With the rapid advancement of machine learning (ML) models and their widespread application across various sectors such as intrusion detection, medical diagnosis, natural language processing, and autonomous driving, t...
详细信息
A supervised ranking model, despite its effectiveness over traditional approaches, usually involves complex processing - typically multiple stages of task-specific pre-training and fine-tuning. This has motivated rese...
详细信息
This work addresses the problem of deriving improved quantitative susceptibility mapping (QSM) from magnetic resonance (MR) acquisitions. Deep learning based models that map measured MR local phase field to QSM maps, ...
详细信息
暂无评论