Machine learning (ML) techniques have been used to solve real-world problems for decades. In the field of medical sciences, these approaches have been found to be useful in the diagnosis and prognosis of a variety of ...
详细信息
Machine learning (ML) techniques have been used to solve real-world problems for decades. In the field of medical sciences, these approaches have been found to be useful in the diagnosis and prognosis of a variety of medical disorders. However, when dealing with voluminous, inconsistent, and higher-dimensional data, conventional ML approaches have failed to deliver the expected results. Researchers have suggested hybrid solutions to resolve these problems, which have been found to be more effective than conventional methods because these systems integrate their merits while reducing their drawbacks. In the current research article, hybrid model has been presented by coupling feature optimization with prediction approach. The proposed hybrid model has two stages: the first involves implementing the relieff algorithm for optimum feature selection in erythemato-squamous diseases, and the second involves implementing k-nearest neighbor (KNN) for prediction of those selected optimum features. The experimentation was carried out on bench mark dataset for erythemato-squamous diseases. The presented hybrid model was also assessed with conventional KNN approach based on various metrics such as classification accuracy, kappa coefficient, recall, precision, and f-score.
Real-time explosive detectors must be developed to facilitate the rapid implementation of appropriate protective measures against terrorism. We report a simple yet efficient methodology to classify three explosives an...
详细信息
Real-time explosive detectors must be developed to facilitate the rapid implementation of appropriate protective measures against terrorism. We report a simple yet efficient methodology to classify three explosives and three non-explosives by using laser-induced breakdown spectroscopy. However, the similarity existing among the spectral emissions collected from the explosives resulted in the difficulty of separating samples. We calculated the weights of lines by using the relieff algorithm and then selected six line regions that could be identified from the arrangement of weights to calculate the area of each line region. A multivariate statistical method involving support vector machines was followed for the construction of the classification model. Several models were constructed using full spectra, 13 lines, and 100 lines selected by the arrangement of weights and areas of the selected line regions. The highest correct classification rate of the model reached 100% by using the six line regions.
In classification, a large number of features often make it difficult to select appropriate classification features. In such situations, feature selection or dimensionality reduction methods play an important role in ...
详细信息
ISBN:
(纸本)9781509033324
In classification, a large number of features often make it difficult to select appropriate classification features. In such situations, feature selection or dimensionality reduction methods play an important role in classification. relieff algorithm is one of the most successful filtering feature selection methods. In this paper, some shortcomings of the relieff algorithm are improved, on the problem of poor stability of neighbor samples selection, proposing the method of using the average value of multiple random selection to improve the anti-volatility of the algorithm. And redundant analysis is added to the relieff algorithm to eliminate the redundant features. The experimental results show that the improved relieff algorithm can effectively establish the classification feature sets, achieve the better classification accuracy.
作者:
Yong, XinGao, Yue-linNorth Minzu Univ
Sch Comp Sci & Engn Wenchang North St Yinchuan 750021 Ningxia Peoples R China North Minzu Univ
Ningxia Prov Key Lab Intelligent Informat & Data Wenchang North St Yinchuan 750021 Ningxia Peoples R China
Feature selection has become popular in data mining tasks currently for its ability of improving the performance of the algorithm and gaining more information about the dataset. Although the firefly algorithm is a wel...
详细信息
Feature selection has become popular in data mining tasks currently for its ability of improving the performance of the algorithm and gaining more information about the dataset. Although the firefly algorithm is a well-performed heuristic algorithm, there is still much room for improvement as to the feature selection problem. In this research, an improved firefly algorithm designed for feature selection with the relieff-based initialization method and the weighted voting mechanism is proposed. First of all, a feature grouping initialization method that combines the results of the relieff algorithm and the cosine similarity is designed to take place of random initialization. Then, the direction of the firefly is modified to move toward the optimal solution. Finally, inspired by the ensemble algorithm, a weighted voter is proposed to build recommended positions for fireflies, which is also integrated with the elite crossover operator and the mutation operator to improve the diversity of the population. Selected from the mixed swarm, a new population is constructed to replace the original population in the next stage. To verify the effectiveness of the algorithm proposed in this paper, 18 datasets are utilized and 9 comparison algorithms (e.g., Black Hole algorithm, Grey Wolf Optimizer and Pigeon Inspired Optimizer) from state-of-the-art related works are selected for the simulating experiments. The experimental results demonstrate the superiority of the proposed algorithm applied to the feature selection problem.
Multi-label learning has become an important area of research due to the increasing number of modern applications that contain multi-label data. The multi-label data are structured in a more complex way than single-la...
详细信息
Multi-label learning has become an important area of research due to the increasing number of modern applications that contain multi-label data. The multi-label data are structured in a more complex way than single-label data. Consequently the development of techniques that allow the improvement in the performance of machine learning algorithms over multi-label data is desired. The feature weighting and feature selection algorithms are important feature engineering techniques which have a beneficial impact on the machine learning. The relieff algorithm is one of the most popular algorithms to feature estimation and it has proved its usefulness in several domains. This paper presents three extensions of the relieff algorithm for working in the multi-label learning context, namely relieff-ML, PPT-relieff and Rrelieff-ML. PPT-relieff uses a problem transformation method to convert the multi-label problem into a single-label problem. relieff-ML and Rrelieff-ML adapt the classic relieff algorithm in order to handle directly the multi-label data. The proposed relieff extensions are evaluated and compared with previous relieff extensions on 34 multi-label datasets. The results show that the proposed relieff extensions improve preceding extensions and overcome some of their drawbacks. The experimental results are validated using several nonparametric statistical tests and confirm the effectiveness of the proposal for a better multi-label learning. (C) 2015 Elsevier B.V. All rights reserved.
This article introduces a novel approach to improve electronic nose classification accuracy by optimizing sensor arrays and aligning features. This involves selecting the best sensor combinations and reducing redundan...
详细信息
This article introduces a novel approach to improve electronic nose classification accuracy by optimizing sensor arrays and aligning features. This involves selecting the best sensor combinations and reducing redundant information for better odor recognition. We employ a feature alignment algorithm to address the discrepancies that impede model sharing between electronic nose devices. Our research focuses on overcoming challenges associated with material selection and the constraints of transferring classification models across different electronic nose devices for drug classification. We fabricated six SnO2-based MEMS gas sensors using physical vapor deposition. The relieff algorithm was employed to rank and score each sensor's contribution to drug classification, identifying the optimal sensor array. We then applied feature alignment from transfer learning to enhance model sharing among three inconsistent devices. This study resolves the issue of electronic noses being hard to use on the same database due to hardware inconsistencies in batch production, laying the groundwork for future mass production.
With the rapid growth of the Internet of Things (IoT), securing interconnected devices is becoming increasingly critical. This paper introduces the LightShield intrusion detection system (IDS) to enhance intrusion det...
详细信息
With the rapid growth of the Internet of Things (IoT), securing interconnected devices is becoming increasingly critical. This paper introduces the LightShield intrusion detection system (IDS) to enhance intrusion detection in IoT environments using high-performance computing. LightShield features preprocessing of IoT data, relieff algorithm for feature selection, and a novel detection model based on LightGBM, a gradient boosting framework. The system leverages GPU acceleration for faster model validation, enabling real-time monitoring. By adapting to IoT characteristics, LightShield provides flexible, scalable defense against evolving cyber threats. Results show its potential to improve security in IoT ecosystems, offering valuable insights into anomaly-based intrusion detection and the future of secure IoT networks. The binary classification model displayed exceptional precision with a 99.82% accuracy in detecting potential attacks, and the multiclass classification model achieved a commendable 97.25% accuracy in classifying distinct attack types.
Common bacterial blight (CBB) is the most destructive bacterial disease affecting the production of common beans, and timely detection of CBB is crucial to limiting its spread. In this study, correlation analysis and ...
详细信息
Common bacterial blight (CBB) is the most destructive bacterial disease affecting the production of common beans, and timely detection of CBB is crucial to limiting its spread. In this study, correlation analysis and the relieff algorithm were used to select vegetation indices (VIs) and texture features (TFs) that are sensitive to CBB. The CBB monitoring model based on support vector machine regression (SVR), random forest regression (RFR), and K-nearest neighbor regression (KNNR) was established using the selected the VIs, TFs, and their combinations. Then, the impact of the spatial resolution on the disease monitoring accuracy was evaluated. In addition, the early infection monitoring model was further optimised. The results show that in the early infection stage, when the spatial resolution was 0.07 m, the window size was 7 x 7, and the independent variable was a combination of VIs and TFs, the R2 of the monitoring model constructed via SVR was 0.72, which was 14.3% higher than that obtained for a 3 x 3 window (0.63). In the middle and late infection stages, the optimal spatial resolution was 0.1 m, and the monitoring model constructed using RFR and a combination of VIs and TFs performed the best, with R2 values of 0.81 and 0.88, respectively. The research results indicate that selecting an appropriate spatial resolution and window size can effectively improve the model's CBB monitoring ability and can provide a reference for accurate monitoring of large-scale CBB of common beans using airborne or spaceborne imaging spectroscopy technology.
The current of the residential series arc fault is affected by the load type, and the fault feature change is not obvious and contains noise. Therefore, the extraction of fault features will affect the arc fault detec...
详细信息
The current of the residential series arc fault is affected by the load type, and the fault feature change is not obvious and contains noise. Therefore, the extraction of fault features will affect the arc fault detection results. To solve this problem, an improved salp swarm optimization algorithm combined with variational mode decomposition is proposed to extract the characteristics of current signal, improve the decomposition effect of current signal, and construct a dataset that can fully reflect the characteristics of arc fault. The relieff algorithm is designed to combine minimum redundancy maximum relevance to reduce the feature dimension and eliminate redundancy. Finally, the random forest model is used to diagnose the fault, which can quickly detect the fault and does not cause overfitting. Experiments based on the self-built sample set prove that the arc fault can be quickly detected under different acquisition frequencies and noise environments, and the detection rate is more than 95%.
relieff algorithm was used to analyze the weight of each water quality evaluation factor, and then based on the Relevance Vector Machine (RVM), Particle Swarm Optimization (PSO) was used to optimize the kernel width f...
详细信息
relieff algorithm was used to analyze the weight of each water quality evaluation factor, and then based on the Relevance Vector Machine (RVM), Particle Swarm Optimization (PSO) was used to optimize the kernel width factor and hyperparameters of RVM to build a water quality evaluation model, and the experimental results of RVM, PSO-RVM, relieff-RVM and PSO-relieff-RVM were compared. The results show that relieff algorithm, combined with threshold value, selects 5 evaluation factors with significant weight from 8 evaluation factors, which reduces the amount of data used in the model, CSI index is used to calculate the separability of each evaluation factor combination. The results show that the overall separability of the combination is best when the evaluation factor with significant weight is reserved. When different water quality evaluation factors were included, the evaluation accuracy of PSO-relieff-RVM model reached 95.74%, 14.23% higher than that of RVM model, which verified the effectiveness of PSO algorithm and relieff algorithm, and had a higher guiding significance for the study of water quality grade evaluation. It has good practical application value.
暂无评论