Intelligent agents (IAs) are highly autonomous software applications designed for performing tasks in a broad spectrum of virtual environments by circulating freely around them, possibly in numerous copies, and taking...
详细信息
An imbalanced dataset often challenges machine learning, particularly classification methods. Underrepresented minority classes can result in biased and inaccurate models. The Synthetic Minority Over-Sampling Techniqu...
详细信息
An imbalanced dataset often challenges machine learning, particularly classification methods. Underrepresented minority classes can result in biased and inaccurate models. The Synthetic Minority Over-Sampling Technique (SMOTE) was developed to address the problem of imbalanced data. Over time, several weaknesses of the SMOTE method have been identified in generating synthetic minority class data, such as overlapping, noise, and small disjuncts. However, these studies generally focus on only one of SMOTE’s weaknesses: noise or overlapping. Therefore, this study addresses both issues simultaneously by tackling noise and overlapping in SMOTE-generated data. This study proposes a combined approach of filtering, clustering, and distance modification to reduce noise and overlapping produced by SMOTE. Filtering removes minority class data (noise) located in majority class regions, with the k-nn method applied for filtering. The use of Noise Reduction (NR), which removes data that is considered noise before applying SMOTE, has a positive impact in overcoming data imbalance. Clustering establishes decision boundaries by partitioning data into clusters, allowing SMOTE with modified distance metrics to generate minority class data within each cluster. This SMOTE clustering and distance modification approach aims to minimize overlap in synthetic minority data that could introduce noise. The proposed method is called “NR-Clustering SMOTE,” which has several stages in balancing data: (1) filtering by removing minority classes close to majority classes (data noise) using the k-nn method;(2) clustering data using K-means aims to establish decision boundaries by partitioning data into several clusters;(3) applying SMOTE oversampling with Manhattan distance within each cluster. Test results indicate that the proposed NR-Clustering SMOTE method achieves the best performance across all evaluation metrics for classification methods such as Random Forest, SVM, and Naїve Bayes, compared t
With the advancement of cloud-based file management platforms, sharing and storing files in the company's area can be easily achieved through a variety of file-sharing platforms. However, these platforms often lac...
详细信息
Data integration from all health care facilities in Indonesia to the Satu Sehat platform is an obligation that must be carried out, but it turns out that until early October 2024, around 57% of health care facilities ...
详细信息
This research proposes a novel approach to addressing the issue of signature authenticity identification by implementing Deep Metric Learning (DML). The aim of this study is to develop a DML model with high accuracy i...
详细信息
Cardiovascular disease(CVD)remains a leading global health challenge due to its high mortality rate and the complexity of early diagnosis,driven by risk factors such as hypertension,high cholesterol,and irregular puls...
详细信息
Cardiovascular disease(CVD)remains a leading global health challenge due to its high mortality rate and the complexity of early diagnosis,driven by risk factors such as hypertension,high cholesterol,and irregular pulse *** diagnostic methods often struggle with the nuanced interplay of these risk factors,making early detection *** this research,we propose a novel artificial intelligence-enabled(AI-enabled)framework for CVD risk prediction that integrates machine learning(ML)with eXplainable AI(XAI)to provide both high-accuracy predictions and transparent,interpretable *** to existing studies that typically focus on either optimizing ML performance or using XAI separately for local or global explanations,our approach uniquely combines both local and global interpretability using Local Interpretable Model-Agnostic Explanations(LIME)and SHapley Additive exPlanations(SHAP).This dual integration enhances the interpretability of the model and facilitates clinicians to comprehensively understand not just what the model predicts but also why those predictions are made by identifying the contribution of different risk factors,which is crucial for transparent and informed decision-making in *** framework uses ML techniques such as K-nearest neighbors(KNN),gradient boosting,random forest,and decision tree,trained on a cardiovascular ***,the integration of LIME and SHAP provides patient-specific insights alongside global trends,ensuring that clinicians receive comprehensive and actionable *** experimental results achieve 98%accuracy with the Random Forest model,with precision,recall,and F1-scores of 97%,98%,and 98%,*** innovative combination of SHAP and LIME sets a new benchmark in CVD prediction by integrating advanced ML accuracy with robust interpretability,fills a critical gap in existing *** framework paves the way for more explainable and transparent decision-making in he
This paper presents a roadmap for the architecture of an intelligent online virtual university that promises to redefine distance learning through its 3D immersive environment. This Unity-based solution offers a rich,...
详细信息
Our study examines how formatted text influences reading behavior concerning the smartphone screen regions users utilize on social media comments. Aiming to research visual attention in scrollable content, we investig...
详细信息
The Faculty of computer Science, Universitas Brawijaya (Filkom UB) is committed to providing quality services for the users especially internal and external stakeholders, one of which is through the HaloFilkom service...
详细信息
ISBN:
(纸本)9798350379914
The Faculty of computer Science, Universitas Brawijaya (Filkom UB) is committed to providing quality services for the users especially internal and external stakeholders, one of which is through the HaloFilkom service. HaloFilkom services have limitations in terms of time. HaloFilkom services are not available 24 hours due to limited working hours. Questions asked by users are not answered directly. This weakness in the HaloFilkom system can be overcome by using a chatbot system. Chatbot is an interactive system that works with natural human language and can work 24 hours. Thus, the current study explores the basic chatbot model by classifying the Q&A in the closed domain knowledge. The dataset in this research is in the form of pairs of questions and answers regarding various topics at the Filkom UB. The knowledge is preprocessed using text preprocessing which includes case folding, tokenization, padding, and tensorization. One of the chatbot models is a generative model. Creating a generative chatbot model can be done using the Seq2Seq model mechanism which consists of an encoder and decoder. The model created consists of four different architectures, namely a model with an LSTM encoder without attention and with attention and a BiLSTM model encoder without attention and with attention. Hyperparameter tuning was conducted to obtain the best hyperparameter combination. The experiment results show the best hyperparameter combination obtained is hidden size 448, drop out rate 0.5, learning rate 0.001, batch size 64, and teacher force 0. The model with the best loss is obtained with a BiLSTM encoder architecture without an attention mechanism with a train loss of 0.120. The model with the highest BLEU Score was obtained by a model with a BiLSTM encoder architecture without an attention mechanism with a BLEU Score of 0.8587 on the training data. Testing using prompt testing obtained an average BLEU Score of 0.3745 on the BiLSTM encoder without an attention mechanism mo
Forecasting electricity demand is an essential part of the smart grid to ensure a stable and reliable power grid. With the increasing integration of renewable energy resources into the grid, forecasting the demand for...
详细信息
Forecasting electricity demand is an essential part of the smart grid to ensure a stable and reliable power grid. With the increasing integration of renewable energy resources into the grid, forecasting the demand for electricity is critical at all levels, from the distribution to the household. Most existing forecasting methods, however, can be considered black-box models as a result of deep digitalization enablers, such as deep neural networks, which remain difficult to interpret by humans. Moreover, capture of the inter-dependencies among variables presents a significant challenge for multivariate time series forecasting. In this paper we propose eXplainable Causal Graph Neural Network (X-CGNN) for multivariate electricity demand forecasting that overcomes these limitations. As part of this method, we have intrinsic and global explanations based on causal inferences as well as local explanations based on post-hoc analyses. We have performed extensive validation on two real-world electricity demand datasets from both the household and distribution levels to demonstrate that our proposed method achieves state-of-the-art performance.
暂无评论