A user story is commonly applied in requirement elicitation, particularly in agile software development. User story is typically composed in semi-formal natural language, and often follow a predefined template. The us...
详细信息
Effort estimation is essential for successful software project planning, budgeting, and risk identification. However, the techniques used to estimate effort are often inaccurate, outdated, and only consider technical ...
详细信息
This study presents a preliminary investigation into the application of deep learning techniques for the identification of traffic density from unmanned aerial vehicle (UAV) images. The primary objective is to categor...
详细信息
In the current context of research, it investigates user authentication through keystroke dynamics using the neural network model which focuses on Convolutional Neural Network (CNN), Gated Recurrent Unit (GRU) and Lon...
详细信息
The aim of this review is to explore the relationship between the architecture of microservices and antisocial behavior. Microservice architecture is a software design approach that involves decomposing a system into ...
详细信息
In the contemporary business landscape, the success of a company is intricately linked to the engagement and satisfaction of its workforce. This study analyzes the signifi-cance of developing a contented and engaged e...
详细信息
Modern electronics relies heavily on circuit boards, which must be designed and produced using a variety of procedures in order to be filled with electrical components. This procedure includes drilling and plating hol...
详细信息
The rapid development and progress of artificial intelligence algorithms in the last decade has opened up many new possibilities and fields for its application. The field of human-computer interaction is not only not ...
详细信息
An imbalanced dataset often challenges machine learning, particularly classification methods. Underrepresented minority classes can result in biased and inaccurate models. The Synthetic Minority Over-Sampling Techniqu...
详细信息
An imbalanced dataset often challenges machine learning, particularly classification methods. Underrepresented minority classes can result in biased and inaccurate models. The Synthetic Minority Over-Sampling Technique (SMOTE) was developed to address the problem of imbalanced data. Over time, several weaknesses of the SMOTE method have been identified in generating synthetic minority class data, such as overlapping, noise, and small disjuncts. However, these studies generally focus on only one of SMOTE’s weaknesses: noise or overlapping. Therefore, this study addresses both issues simultaneously by tackling noise and overlapping in SMOTE-generated data. This study proposes a combined approach of filtering, clustering, and distance modification to reduce noise and overlapping produced by SMOTE. Filtering removes minority class data (noise) located in majority class regions, with the k-nn method applied for filtering. The use of Noise Reduction (NR), which removes data that is considered noise before applying SMOTE, has a positive impact in overcoming data imbalance. Clustering establishes decision boundaries by partitioning data into clusters, allowing SMOTE with modified distance metrics to generate minority class data within each cluster. This SMOTE clustering and distance modification approach aims to minimize overlap in synthetic minority data that could introduce noise. The proposed method is called “NR-Clustering SMOTE,” which has several stages in balancing data: (1) filtering by removing minority classes close to majority classes (data noise) using the k-nn method;(2) clustering data using K-means aims to establish decision boundaries by partitioning data into several clusters;(3) applying SMOTE oversampling with Manhattan distance within each cluster. Test results indicate that the proposed NR-Clustering SMOTE method achieves the best performance across all evaluation metrics for classification methods such as Random Forest, SVM, and Naїve Bayes, compared t
As global energy demand grows, renewable sources offer a key alternative to fossil fuels. However, integrating these sources into power grids presents challenges, especially with supply unpredictability. This paper de...
详细信息
暂无评论