As the need for achieving optimal performance in prediction models, several recent and complex models have been developed. However, many of these models operate as black boxes, providing little insight into their pred...
详细信息
ISBN:
(数字)9798350376210
ISBN:
(纸本)9798350376227
As the need for achieving optimal performance in prediction models, several recent and complex models have been developed. However, many of these models operate as black boxes, providing little insight into their predictive results. In recent years, three gradient-boosting methods based on Decision Trees have been recommended, i.e., XGBoost, CatBoost, and LightGBM. These methods have proven to deliver competitive performance with fast training times. Nevertheless, in critical domains like security, there is a growing need for increased transparency among stakeholders. One pressing concern in the security field is the proliferation of phishing websites. To address this issue, we propose an explainable machine learning-based approach using gradient-boosting methods with hyperparameter optimization on three phishing website datasets. Our best methods surpass the state-of-the-art phishing website detection methods, achieving accuracy rates of 97.45%, 99.16%, and $\mathbf{9 7. 8 5 \%}$ for the UCI (2015), Mendeley (2018), and Mendeley (2020) datasets, respectively. Subsequently, we implement posthoc explainability using SHAP and LIME for the selected dataset. The experimental results indicate that three features, i.e., length_url, directory_length, and time_domain_activation, are consistently identified as the most influential features in the dataset. Moreover, our proposed approach demonstrates promising results for detecting phishing websites with both high accuracy and explainability.
Sudden cardiac arrest (SCA) poses a significant health challenge, necessitating accurate predictions of neurological outcomes in comatose patients, where good outcomes are defined as the recovery of most cognitive fun...
详细信息
Dynamic programming is a fundamental algorithm that can be found in our daily lives easily. One of the dynamic programming algorithm implementations consists of solving the 0/1 knapsack problem. A 0/1 knapsack problem...
详细信息
This paper introduces HAAQI-Net, a non-intrusive music audio quality assessment model for hearing aid users. Unlike traditional methods such as Hearing Aid Audio Quality Index (HAAQI), which requires intrusive referen...
详细信息
This paper introduces HAAQI-Net, a non-intrusive music audio quality assessment model for hearing aid users. Unlike traditional methods such as Hearing Aid Audio Quality Index (HAAQI), which requires intrusive reference signal comparisons, HAAQI-Net offers a more accessible and computationally efficient alternative. Leveraging a bidirectional long short-term memory architecture with attention mechanisms and features extracted from a pre-trained BEATs model, it can predict HAAQI scores directly from music audio clips and hearing loss patterns. The experimental results demonstrate that, compared to the traditional HAAQI as the reference, HAAQI-Net achieves a linear correlation coefficient (LCC) of 0.9368, a Spearman's rank correlation coefficient (SRCC) of 0.9486, and a mean squared error (MSE) of 0.0064, while significantly reducing the inference time from 62.52 seconds to 2.54 seconds. Furthermore, a knowledge distillation strategy was applied, reducing the parameters by 75.85% and inference time by 96.46%, while maintaining strong performance (LCC: 0.9071, SRCC: 0.9307, MSE: 0.0091). To expand its capabilities, HAAQI-Net was adapted to predict subjective human scores, mean opinion score (MOS), by fine-tuning. This adaptation significantly improved the prediction accuracy. Furthermore, the robustness of HAAQI-Net was evaluated under varying sound pressure level (SPL) conditions, revealing optimal performance at a reference SPL of 65 dB, with the accuracy gradually decreasing as SPL deviated from this point. The advancements in subjective score prediction, SPL robustness, and computational efficiency position HAAQI-Net as a reliable solution for music audio quality assessment, significantly contributing to the development of efficient and accurate models in audio signal processing and hearing aid technology.
Multivariate time series (MTS) data, when sampled irregularly and asynchronously, often present extensive missing values. Conventional methodologies for MTS analysis tend to rely on temporal embeddings based on timest...
详细信息
The Human-Centered Internet of Things(HC-IoT)is fast becoming a hotbed of security and privacy *** users can establish a common session key through a trusted server over an open communication channel using a three-par...
详细信息
The Human-Centered Internet of Things(HC-IoT)is fast becoming a hotbed of security and privacy *** users can establish a common session key through a trusted server over an open communication channel using a three-party authenticated key *** of the early authenticated key agreement systems relied on pairing,hashing,or modular exponentiation processes that are computationally intensive and *** order to address this problem,this paper offers a new three-party authenticated key agreement technique based on fractional chaotic *** new scheme uses fractional chaotic maps and supports the dynamic sensing of HC-IoT devices in the network architecture without a password *** projected security scheme utilized a hash function,which works well for the resource-limited HC-IoT *** results show that our new technique is resistant to password guessing attacks since it does not use a ***,our approach provides users with comprehensive privacy protection,ensuring that a user forgery attack causes no ***,our new technique offers better security features than the techniques currently available in the literature.
This research aims to help the user monitor their heart’s condition and notify other people in case of an emergency due to an abnormal heartbeat (normal heartbeat around 60-100 beats per minute). From the heart rate ...
详细信息
Microsatellite instability (MSI) is a pivotal genetic marker influencing the efficacy of immunotherapy in colorectal cancer. Traditional MSI examination often requires additional genetic or immunohistochemical tests, ...
详细信息
ISBN:
(数字)9798331518622
ISBN:
(纸本)9798331518639
Microsatellite instability (MSI) is a pivotal genetic marker influencing the efficacy of immunotherapy in colorectal cancer. Traditional MSI examination often requires additional genetic or immunohistochemical tests, whereas histology images, widely available in colorectal cancer diagnosis, offer a valuable alternative for MSI prediction. Although Transformer-based models have demonstrated promising outcomes in predicting MSI from histology images, they are hampered by traditional local attention mechanisms that struggle to capture long-range interdependencies and establish a comprehensive global receptive field. In this study, we introduce DiNAT-MSI, a novel framework for histology-based MSI prediction that incorporates the Dilated Neighborhood Attention Transformer (DiNAT). This model enhances global context recognition and substantially expands receptive fields, all without additional computational burden. Our results demonstrate that DiNAT-MSI achieves a superior patientwise AUROC compared to ResNet18 and Swin Transformer, along with commendable explainability. Our work not only illustrates a more accessible diagnostic tool for leveraging histological data but also underscores the potential of Transformerbased models with sophisticated attention designs in advancing precision medicine for colorectal cancer patients.
Superior and quality human resources are based on healthy human resources with indicators of adequate nutritional intake according to age development. However, the world still faces the problem of hunger and malnutrit...
Superior and quality human resources are based on healthy human resources with indicators of adequate nutritional intake according to age development. However, the world still faces the problem of hunger and malnutrition today. According to a UNICEF report, the number of people suffering from malnutrition in the world will reach 767.9 million people in 2021. The World Health Organization (WHO) said that malnutrition is a dangerous threat to the health of the world's population. Stunting also has an impact in Indonesia, the prevalence of toddlers experiencing stunting in Indonesia is 24.4% in 2021. The solution created is to classify and cluster the prevalence of stunting to produce a pattern that can be used as best practice to be transmitted to other affected areas. The algorithm used is Euclid. The Euclid algorithm can cluster stunting prevalence data into 4 clusters with the very little category at 79%, the little category at 67%, the many categories at 51%, and the very much category at 21%. The results of the classification and clustering of the best stunting prevalence in cluster one with a very small number, can be used as a source of accurate and updated information that can be used by the government in its efforts to optimize stunting handling in each district/city based on artificial intelligence which can provide handling and optimization patterns. stunting in every district/city.
Electronic Health Records (EHRs) are a cornerstone of modern healthcare analytics, offering rich datasets for various disease analyses through advanced deep learning algorithms. However, the pervasive issue of missing...
详细信息
ISBN:
(数字)9798350371499
ISBN:
(纸本)9798350371505
Electronic Health Records (EHRs) are a cornerstone of modern healthcare analytics, offering rich datasets for various disease analyses through advanced deep learning algorithms. However, the pervasive issue of missing values in EHRs significantly hampers the development and performance of these models. Addressing this challenge is crucial for enhancing clinical decision-making and patient care. Existing methods for handling missing data, ranging from simple imputation to more sophisticated approaches, often fall short of capturing the temporal dynamics inherent in EHRs. To bridge this gap, we introduce the Deep Stochastic Time-series Imputation (Deep STI) algorithm, an innovative end-to-end deep learning model that seamlessly integrates a sequence-to-sequence generative network with a prediction network. Deep STI is designed to leverage the observed time-series data in EHRs, learning to infer missing values from the temporal context with high accuracy. We evaluated Deep STI on the liver cancer data from the National Taiwan University Hospital (NTUH), Taiwan. Our results showed that Deep STI achieved better 5-year hepatocellular carcinoma predictions (19.21% in the area under the precision-recall curve) than extreme gradient boosting (18.15%) and Transformer (18.09%). The ablation study also illustrates the efficacy of our generative architecture design compared to regular imputations. This approach not only promises to improve the reliability of disease analysis in the presence of incomplete data but also sets a new standard for utilizing EHRs in predictive healthcare. Our work aims to advance the field of healthcare analytics and open new avenues for research in deep learning applications to EHRs.
暂无评论