检索结果-内蒙古大学图书馆

A Comparative Study of Predictive Analysis Using Machine Learning techniques: Performance Evaluation of Manual and AutoML Algorithms

引用

INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS 2025年第1期16卷 12-31页

作者： Rezaul, Karim Mohammed Jewel, Md. Sudhan, Anjali Khan, Mifta Uddin Fernando, Maharage Roshika Sathsarani Siddiquee, Kazy Noor e Alam Jannat, Tajnuva Rahman, Muhammad Azizur Islam, Md Shabiul Wrexham Univ Fac Arts Sci & Technol Wrexham LL11 2AW Wales Ctr Appl Res Software & IT CARSIT 80a Ashfield St London E1 2BJ England Multimedia Univ Fac Engn FOE Cyberjaya 63100 Malaysia Cardiff Metropolitan Univ Dept Comp Sci Llandaff CampusWestern Ave Cardiff CF5 2YB Wales

In this study, we have compared manual machine learning with automated machine learning (AutoML) to see which performs better in predictive analysis. Using data from past football matches, we tested a range of algorithms to forecast game outcomes. By exploring the data, we discovered patterns and team correlations, then cleaned and prepped the data to ensure the models had the best possible inputs. Our findings show that AutoML, especially when using logistic regression can outperform manual methods in prediction accuracy. The big advantage of AutoML is that it automates the tricky parts, like data cleaning, feature selection, and tuning model parameters, saving time and effort compared to manual approaches, which require more expertise to achieve similar results. This research highlights how AutoML can make predictive analysis easier and more accurate, providing useful insights for many fields. Future work could explore using different data types and applying these techniques to other areas to show how adaptable and powerful machine learning can be.

关键词： Machine learning predictive analytics sports forecasting automated machine learning (AutoML) feature engineering model evaluation data pre-processing algorithm comparison football analytics sports betting team performance metrics exploratory data analysis (EDA) cross-validation techniques

来源：评论

学校读者我要写书评

暂无评论

Water Quality Management Using Hybrid Machine Learning and Data Mining Algorithms: An Indexing Approach

引用

IEEE ACCESS 2022年 10卷 119692-119705页

作者： Aslam, Bilal Maqsoom, Ahsen Cheema, Ali Hassan Ullah, Fahim Alharbi, Abdullah Imran, Muhammad No Arizona Univ Sch Informat Comp & Cyber Syst Flagstaff AZ 86011 USA COMSATS Univ Islamabad Dept Civil Engn Islamabad 47040 Pakistan Univ Southern Queensland Sch Surveying & Built Environm Springfield Cent Qld Australia King Saud Univ Community Coll Dept Comp Sci Riyadh 11437 Saudi Arabia Federat Univ Inst Innovat Sci & Sustainabil Brisbane Qld Australia

One of the key functions of global water resource management authorities is river water quality (WQ) assessment. A water quality index (WQI) is developed for water assessments considering numerous quality-related variables. WQI assessments typically take a long time and are prone to errors during sub-indices generation. This can be tackled through the latest machine learning (ML) techniques renowned for superior accuracy. In this study, water samples were taken from the wells in the study area (North Pakistan) to develop WQI prediction models. Four standalone algorithms, i.e., random trees (RT), random forest (RF), M5P, and reduced error pruning tree (REPT), were used in this study. In addition, 12 hybrid data-mining algorithms (a combination of standalone, bagging (BA), cross-validation parameter selection (CVPS), and randomizable filtered classification (RFC)) were also used. Using the 10-fold cross-validation technique, the data were separated into two groups (70:30) for algorithm creation. Ten random input permutations were created using Pearson correlation coefficients to identify the best possible combination of datasets for improving the algorithm prediction. The variables with very low correlations performed poorly, whereas hybrid algorithms increased the prediction capability of numerous standalone algorithms. Hybrid RT-Artificial Neural Network (RT-ANN) with RMSE = 2.319, MAE = 2.248, NSE = 0.945, and PBIAS = -0.64 outperformed all other algorithms. Most algorithms overestimated WQI values except for BA-RF, RF, BA-REPT, REPT, RFC-M5P, RFC-REPT, and ANN-Adaptive Network-Based Fuzzy Inference System (ANFIS).

关键词： Water quality index machine learning hybrid data-mining algorithms cross-validation techniques North Pakistan

来源：评论

学校读者我要写书评

暂无评论

Quantitative structure-activity relationship analysis of human neutrophil elastase inhibitors using shuffling classification and regression trees and adaptive neuro-fuzzy inference systems

引用

SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2012年第5-6期23卷 505-520页

作者： Asadollahi-Baboli, M. Babol Univ Technol Dept Sci Babol Sar Iran

The purpose of this study was to develop quantitative structure-activity relationship models for N-benzoylindazole derivatives as inhibitors of human neutrophil elastase. These models were developed with the aid of classification and regression trees (CART) and an adaptive neuro-fuzzy inference system (ANFIS) combined with a shuffling cross-validation technique using interpretable descriptors. More than one hundred meaningful descriptors, representing various structural characteristics for all 51 N-benzoylindazole derivatives in the data set, were calculated and used as the original variables for shuffling CART modelling. Five descriptors of average Wiener index, Kier benzene-likeliness index, subpolarity parameter, average shape profile index of order 2 and folding degree index selected by the shuffling CART technique have been used as inputs of the ANFIS for prediction of inhibition behaviour of N-benzoylindazole derivatives. The results of the developed shuffling CART-ANFIS model compared to other techniques, such as genetic algorithm (GA)-partial least square (PLS)-ANFIS and stepwise multiple linear regression (MLR)-ANFIS, are promising and descriptive. The satisfactory results (r(p)(2) = 0.845, Q(LOO)(2) = 0.861, r(L25%O)(2) = 0.829, RMSELOO = 0.305 and RMSEL25%O = 0.336) demonstrate that shuffling CART-ANFIS models present the relationship between human neutrophil elastase inhibitor activity and molecular descriptors, and they yield predictions in excellent agreement with the experimental values.

关键词： human neutrophil elastase N-benzoylindazole derivatives classification and regression trees adaptive neuro-fuzzy inference systems cross-validation techniques

来源：评论

学校读者我要写书评

暂无评论

cross-validated density estimates based on Kullback-Leibler information

引用

JOURNAL OF NONPARAMETRIC STATISTICS 2004年第3-4期16卷 493-513页

作者： Berlinet, A Brunel, E Univ Montpellier 2 Lab Probabil & Stat F-34095 Montpellier 5 France Univ Paris 05 MAPS F-75270 Paris France

The convergence of measure estimates in the sense of Kullback-Leibler divergence is required in many applications in decision and information theory. Recently, modified histograms have been shown to have good properties with respect to information divergences. For these estimates deterministic optimal bandwidths have been given, but no automatic smoothing procedure has been shown to be asymptotically optimal. In the present article, we consider the Kullback-Leibler cross-validation method for selecting the bin width of modified histograms. We analyze the behavior of the Kullback-Leibler divergence and of its expectation and prove that the cross-validated estimate is asymptotically optimal with respect to the Kullback-Leibler divergence.

关键词： functional estimation binned data barron density estimates histograms Kullback-Leibler divergence automatic smoothing parameter cross-validation techniques

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：