Mobile IPv6 is a key technology to enable the mobility of devices on next-generation internet protocol networks. Home agents provide simple services to registered mobile nodes. In addition, the use of multiple domesti...
详细信息
Universitat Polit`ecnica de Val`encia (UPV) faces challenges in managing its Alfresco document repository, which contains 600,000 PDF files, of which only 100,000 are correctly categorised. Manual classification is la...
详细信息
ISBN:
(纸本)9783031777301;9783031777318
Universitat Polit`ecnica de Val`encia (UPV) faces challenges in managing its Alfresco document repository, which contains 600,000 PDF files, of which only 100,000 are correctly categorised. Manual classification is laborious and error-prone, hindering information retrieval and advanced search capabilities. This project presents an automated pipeline that integrates optical character recognition (OCR) and machinelearning to efficiently classify documents. Our approach distinguishes between scanned and digital documents, accurately extracts text and categorises it into 51 predefined categories using models such as BERT and RF. By improving document organisation and accessibility, this work optimises UPV's document management and paves the way for advanced search technologies and real-time classification systems.
This study investigates the use of machinelearning models to predict surface roughness (Ra) in milling multi-grade aluminum alloys without prior knowledge of optimal cutting parameters. A diverse milling dataset enco...
详细信息
This study investigates the use of machinelearning models to predict surface roughness (Ra) in milling multi-grade aluminum alloys without prior knowledge of optimal cutting parameters. A diverse milling dataset encompassing material properties and cutting parameters from various aluminum alloy grades was compiled from research articles. Four machinelearning algorithms, Extreme Gradient Boosting (XGB), Random Forest (RFR), Catalogical Gradient Boosting (CAT), and Gradient Boosting Regression (GBR), were employed to develop the predictive model. The dataset underwent cleaning, imputation, and outlier removal to ensure data quality. Feature engineering incorporated material properties and cutting parameters for model training. Performance metrics such as RMSE, MAPE, and R2 were used to assess the models' accuracy. The SHapley Additive exPlanations (SHAP) technique was employed to interpret the models and identify influential features. GBR achieved the highest prediction accuracy with an RMSE of 0.2507 mu m, MAPE of 23.36%, and R2 of 0.8709. Thermal conductivity, feed rate, and cutting speed were consistently identified as the most influential factors, although their rankings differed slightly. This study successfully developed a GBR model for effective Ra prediction in aluminum alloy milling, supporting advancements in smart manufacturing by enabling accurate surface quality prediction and data-driven process optimization through machinelearning.
This article presents the research results on the architectures and components of machinelearning model orchestration systems aimed at solving problems of spatial data analysis. The stages of the life cycle of models...
详细信息
In order to address the interference of concept drift on the results of multi-label learning algorithms, a hybrid kernel extreme learningmachine is used as the foundation for the classification algorithm. Concept dri...
详细信息
Quality of data and complexity of decision boundaries in high-dimensional data streams that are collected from cyber-physical power systems can greatly influence the process of learning from data and diagnosing faults...
详细信息
Quality of data and complexity of decision boundaries in high-dimensional data streams that are collected from cyber-physical power systems can greatly influence the process of learning from data and diagnosing faults in such critical systems. These systems generate massive amounts of data that overburden the system with excessive computational costs. Another issue is the presence of noise in recorded measurements that poses a challenge to the learning process, leading to a degradation in the performance of fault diagnosis. Furthermore, the diagnostic model is often provided with a mixture of redundant measurements that may deviate it from learning normal and fault distributions. This paper presents the effect of feature engineering on mitigating the aforementioned challenges in learning from data streams collected from cyber-physical systems. A data-driven fault diagnosis framework for a 118-bus power system is constructed by integrating feature selection, dimensionality reduction methods, and decision models. A comparative study is enabled accordingly to compare several advanced techniques in both domains. Dimensionality reduction and feature selection methods are compared both jointly and separately. Finally, experiments are concluded, and a setting is suggested that enhances data quality for fault diagnosis.
Bariatric surgery has emerged as an effective treatment option for individuals with severe obesity, offering not only weight loss but also remarkable improvements in metabolic health and endocrine function. Efficient ...
详细信息
ISBN:
(纸本)9783031803543;9783031803550
Bariatric surgery has emerged as an effective treatment option for individuals with severe obesity, offering not only weight loss but also remarkable improvements in metabolic health and endocrine function. Efficient management of the patient's length of stay (LOS) in the hospital is critical to optimizing healthcare resources and ensuring patient well-being. The objective of this study was to analyze post-operative LOS following bariatric surgery using machinelearning (ML) algorithms and determine their predictive performance. data from 757 patients undergoing bariatric surgery from 2019 to 2022 in a single institution were collected and analyzed. The ML algorithms used included Decision Tree (DT), Random Forest (RF), and Gradient Boosted Trees (GBT). The results showed that RF and GBT had comparable accuracy (71.7% and 71.1% respectively) and outperformed DT (62.0%). RF showed better overall performance, while GBT showed higher precision for predicting shorter LOS (less than 5 days). The results highlight the potential of machinelearning algorithms in predicting post-operative LOS, aiding in healthcare resource allocation and personalized patient care.
Accurate calculations are essential in forecasting. Therefore, it is important to select the appropriate forecasting method, which can be determined by testing the accuracy level of the forecasting results using the M...
详细信息
Wastewater treatment is important for pollutant reduction and reclaimed water production. machinelearning is increasing applied in environmental field for deciphering variables' relationships and processing large...
详细信息
Wastewater treatment is important for pollutant reduction and reclaimed water production. machinelearning is increasing applied in environmental field for deciphering variables' relationships and processing large datasets. However, multifarious sewage treatment systems, technologies and data processing methods led to the widespread application of machinelearning in wastewater treatment. Here, we evaluated a total of 398 publications focus on machinelearning-based wastewater treatment from 1993 to 2022 using bibliometric method. We aimed to provide a quantitative analysis on research hotpots, global trends and development prospects of wastewater treatment. Results showed that the related topic began in 1993 and publications' number was significantly increased since 2018. In the past three decades, modeling-based prediction and optimization has always been a research hotspot in wastewater treatment, although the continuous increasing of multifarious research topics in this field. As the international collaboration network core, China published 22.9% of the literatures, followed by the United States (13.1%) and Spain (9.36%). Water Research is the most productive journal with 22 publications containing research articles and review papers. Pollutant and antibiotics removal prediction, and neutral network based regression prediction are three independent research categories. Future research focus will still be on modeling-based wastewater treatment prediction and optimization. The findings provide an important reference and international overview to recognize the potential opportunity for researchers whom are working on machinelearning based wastewater treatment and related projects.
作者:
Han, BoHong Kong Baptist Univ
Ctr Adv Intelligence Project TMLR Grp Dept Comp SciRIKEN Hong Kong Peoples R China
Trustworthy machinelearning (TML) under imperfect data has recently brought much attention in the data-centric fields of machinelearning (ML) and artificial intelligence (AI). Specifically, there are mainly three ty...
ISBN:
(纸本)9781956792041
Trustworthy machinelearning (TML) under imperfect data has recently brought much attention in the data-centric fields of machinelearning (ML) and artificial intelligence (AI). Specifically, there are mainly three types of imperfect data along with their challenges for ML, including i) label-level imperfection: noisy labels;ii) feature-level imperfection: adversarial examples;iii) distribution-level imperfection: out-of-distribution data. Therefore, in this paper, we systematically share our insights and solutions of TML to handle three types of imperfect data. More importantly, we discuss some new challenges in TML, which also open more opportunities for future studies, such as trustworthy foundation models, trustworthy federated learning, and trustworthy causal learning.
暂无评论