Objective: The authors performed several tree-based algorithms and an association rules mining as data mining tools to find useful determinants for neurological outcomes in out-of-hospital cardiac arrest (OHCA) patien...
详细信息
Objective: The authors performed several tree-based algorithms and an association rules mining as data mining tools to find useful determinants for neurological outcomes in out-of-hospital cardiac arrest (OHCA) patients as well as to assess the effect of the first-aid and basic characteristics in the EMS system. Patients and Methods: This was a retrospective cohort study. The outcome was Cerebral Performance Categories grading on OHCA patients at hospital discharge. Decision tree-based models inclusive of C4.5 algorithm, classification and regression tree and random forest were built to determine an OHCA patient's prognosis. Association rules mining was another data mining method which we used to find the combination of prognostic factors linked to the outcome. Results: The total of 3520 patients were included in the final analysis. The mean age was 67.53 (+/- 18.4) year-old and 63.4% were men. To overcome the imbalance outcome issue in machine learning, the random forest has a better predictive ability for OHCA patients in overall accuracy (91.19%), weighted precision (88.76%), weighted recall (91.20%) and F1 score (0.9) by oversampling adjustment. Under association rules mining, patients who had any witness on the spot when encountering OHCA or who had ever ROSC during first-aid would be highly correlated with good CPC prognosis. Conclusion: The random forest has a better predictive ability for OHCA patients. This paper provides a role model applying several machine learning algorithms to the first-aid clinical assessment that will be promising combining with Artificial Intelligence for applying to emergency medical services.
The north coast of Java is the center of economic activity in Indonesia. This area is dynamic and sensitive to various geo-bio-physical aspects. Therefore, a vulnerability study in this area is necessary. This study p...
详细信息
The north coast of Java is the center of economic activity in Indonesia. This area is dynamic and sensitive to various geo-bio-physical aspects. Therefore, a vulnerability study in this area is necessary. This study proposes a machine learning tree-based algorithms modeling approach for Coastal Vulnerability Assessment (CVA) and mapping. The tree-based algorithms used are Gradient tree Boost (GTB), Classification and Regression trees (CART), and Random Forest (RF). The study utilized the Google Earth Engine (GEE) platform and twelve variables as input. The prediction results of each of these modeling algorithms have been compared and evaluated to determine the most optimal performance and accuracy. Reference data was obtained from the Ministry of Maritime Affairs and Fisheries of the Republic of Indonesia (KKP). Approximately 70% of the reference data was allocated for training, while the remaining 30% was designated for validation. The CVA assessment yielded overall accuracies of 80.22%, 77.40%, and 71.18% based on the RF, GTB, and CART algorithms, respectively. Meanwhile, the Kappa Index for these three algorithms was 0.72, 0.67, and 0.58, indicating that the models have adequately classified the data. The research outcomes are anticipated to offer insights into the potential utilization of machine learning technology for vulnerability assessment and mapping, contributing to the management of coastal environmental issues.
Nowadays, the ASIC design is increasing in complexity, and PPA targets are pushed to the limit. The lack of physical information at the early design stages hinders precise timing predictions and may lead to design re ...
详细信息
Nowadays, the ASIC design is increasing in complexity, and PPA targets are pushed to the limit. The lack of physical information at the early design stages hinders precise timing predictions and may lead to design re -spins. In previous work, we successfully improved timing prediction at the post -placement stage using the Random Forest model, achieving 91.25% cell delay accuracy. Building upon this, we further investigate the potential of Ensemble tree-based algorithms, specifically focusing on " Extremely Randomized trees "and " Gradient Boosting ", to close the gap in cell delay accuracy. In this paper, we enrich the training dataset with new 16 nm industrial designs. The results demonstrate a substantial improvement, with an average cell delay accuracy of 92.01% and 84.26% on unseen data. The average Root-Mean-Square-Error is significantly reduced from 12.11 to 3.23 and 7.76 on unseen data.
Open caissons are widely used in foundation engineering because of their load-bearing efficiency and adaptability in diverse soil conditions. However, accurately predicting their undrained bearing capacity in layered ...
详细信息
Open caissons are widely used in foundation engineering because of their load-bearing efficiency and adaptability in diverse soil conditions. However, accurately predicting their undrained bearing capacity in layered soils remains a complex challenge. This study presents a novel application of five ensemble machine (ML) algorithms—random forest (RF), gradient boosting machine (GBM), extreme gradient boosting (XGBoost), adaptive boosting (AdaBoost), and categorical boosting (CatBoost)—to predict the undrained bearing capacity factor ( N c ) of circular open caissons embedded in two-layered clay on the basis of results from finite element limit analysis (FELA). The input dataset consists of 1188 numerical simulations using the Tresca failure criterion, varying in geometrical and soil parameters. The FELA was performed via OptumG2 software with adaptive meshing techniques and verified against existing benchmark studies. The ML models were trained on 70 % of the dataset and tested on the remaining 30 %. Their performance was evaluated using six statistical metrics: coefficient of determination ( R² ), mean absolute error ( MAE ), root mean squared error ( RMSE ), index of scatter ( IOS ), RMSE-to-standard deviation ratio ( RSR ), and variance explained factor ( VAF ). The results indicate that all the models achieved high accuracy, with R² values exceeding 97.6 % and RMSE values below 0.02. Among them, AdaBoost and CatBoost consistently outperformed the other methods across both the training and testing datasets, demonstrating superior generalizability and robustness. The proposed ML framework offers an efficient, accurate, and data-driven alternative to traditional methods for estimating caisson capacity in stratified soils. This approach can aid in reducing computational costs while improving reliability in the early stages of foundation design.
This research work is concerned with the predictability of ensemble and singular tree-based machine learning algorithms during the recession and prosperity of the two companies listed in the Tehran Stock Exchange in t...
详细信息
This research work is concerned with the predictability of ensemble and singular tree-based machine learning algorithms during the recession and prosperity of the two companies listed in the Tehran Stock Exchange in the context of big data. In this regard, the main issue is that economic managers and the academic community require predicting models with more accuracy and reduced execution time;moreover, the prediction of the companies recession in the stock market is highly significant. Machine learning algorithms must be able to appropriately predict the stock return sign during the market downturn and boom days. Addressing the stated challenge will upgrade the quality of stock purchases and, subsequently, will increase profitability. In this article, the proposed solution relies on the utilization of tree-based machine learning algorithms in the context of big data. The proposed solution exploits the decision tree algorithm, which is a traditional and singular tree-based learning algorithm. Furthermore, two modern and ensemble tree-based learning algorithms, random forest and gradient boosted tree, has been utilized for predicting the stock return sign during recession and prosperity. The mentioned cases were implemented by applying the machine learning tools in python programming language and PYSPARK library that is used explicitly for the big data context. The utilized research data of the current study are the shares information of two companies of the Tehran Stock Exchange. The obtained results reveal that the applied ensemble learning algorithms have performed better than the singular learning algorithms. Additionally, adding 23 technical features to the initial data and subsequent applying of the PCA feature reduction method have demonstrated the best performance among other modes. In the meantime, it has been concluded that the initial data do not possess the proper resolution or generalizability, either during prosperity or recession.
The inclusion of human behavioral factors in travel demand analysis is a new trend in transportation planning. Perceived security (PS) is one of the crucial behavioral factors governing urban travelers' choices an...
详细信息
The inclusion of human behavioral factors in travel demand analysis is a new trend in transportation planning. Perceived security (PS) is one of the crucial behavioral factors governing urban travelers' choices and demand analysis, exclusively for active modes of transportation like bicycles and vulnerable road users like women. However, no scientific attempt has been made to explain and quantify PS for women cyclists. We estimate women cyclists’ PS by considering individual, social, and built environment features from the data collected in Tehran, Iran. Fifty-two women were recruited as the test participants in a VR-based bicycle experimental system (VRBES). For analysis, we focused on the tree-based machine learning (ML) methods besides an econometric model. In the end, the random forest (RF) algorithm is chosen and improved for the best performance (with test-R2=0.69) on the dataset, and feature importance showed population, a social feature, is the most important.
In this study, linear and tree-based algorithms were utilized to train the hydrogen-assisted fatigue crack growth test data. Through accuracy verification, it was found that the Gradient Boosting Regression (GB) belon...
详细信息
In this study, linear and tree-based algorithms were utilized to train the hydrogen-assisted fatigue crack growth test data. Through accuracy verification, it was found that the Gradient Boosting Regression (GB) belonging to tree-based algorithms had the best predictive performance. For the model trained by the GB algorithm, SHapley Additive exPlanations (SHAP) values were employed to evaluate the impact of stress intensity factor range, hydrogen pressure, ultimate tensile strength, stress ratio, frequency and chemical compositions on the hydrogenassisted fatigue crack growth. The results show that material properties, experimental conditions and environment all have an effect on the crack growth rate. Besides, the feature influence patterns derived from machine learning models are consistent with the literature, demonstrating that the model has the potential to accurately predict the hydrogen-assisted fatigue crack growth rate of Cr-Mo steel.
A number of machine learning (ML) algorithms using tree-based methodologies were developed to forecast the ultimate strain (epsilon cu) of circular columns wrapped in aramid fiber-reinforced polymer (AFRP). This effor...
详细信息
A number of machine learning (ML) algorithms using tree-based methodologies were developed to forecast the ultimate strain (epsilon cu) of circular columns wrapped in aramid fiber-reinforced polymer (AFRP). This effort aimed to develop and evaluate ML techniques for the assessment of epsilon cu via the use of a dataset including 156 experimental results from 19 published research papers. This purpose led to the development of the decision tree (DT). The hyperparameters of DT, as established by the Beluga whale optimization algorithm (BWOA) and the Golden jackal optimization algorithm (GJOA) (DT(B) and DT(G)), greatly influence DT's effectiveness. The practical novelty of this research lies in its development of more accurate and reliable predictive models for the epsilon cu\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\varepsilon }_{cu}$$\end{document} of circular columns wrapped in AFRP by leveraging advanced optimization algorithms. These innovations enhance the precision of structural design and safety assessments, providing engineers with a robust tool for optimizing material selection, reducing costs, and improving the safety and performance of AFRP-wrapped structures across various conditions. According to forecast reliability and measurement variability, U95 evaluates uncertainty accurately. This shows a measure of dependability for informed decision-making. In learning and assessment, DT(G) had the lowest U95 index values-0.2893 and 0.2261. These values were below DT(B)'s 0.323 and 0.2476 throughout training and assessment. based on the variance percentage that was used, the variation percentage among the two models for these metrics-which is at least 8% and sometimes 42%-demonstrates the predictive power and dependability of the DT(G).
This paper is concerned with near-optimal approximation of a given univariate function with elements of a polynomially enriched wavelet frame, a so-called quarklet frame. Inspired by hp-approximation techniques of Bin...
详细信息
This paper is concerned with near-optimal approximation of a given univariate function with elements of a polynomially enriched wavelet frame, a so-called quarklet frame. Inspired by hp-approximation techniques of Binev, we use the underlying tree structure of the frame elements to derive an adaptive algorithm that, under standard assumptions concerning the local errors, can be used to create approximations with an error close to the best tree approximation error for a given cardinality. We support our findings by numerical experiments demonstrating that this approach can be used to achieve inverse-exponential convergence rates.
This research aimed to investigate the effectiveness of Polyethylene-Terephthalate (PET) as a reinforcement material for sandy soils in enhancing the shear strength. To achieve this, different concentrations of PET we...
详细信息
This research aimed to investigate the effectiveness of Polyethylene-Terephthalate (PET) as a reinforcement material for sandy soils in enhancing the shear strength. To achieve this, different concentrations of PET were tested, and 118 sets of data were collected. Parameters such as relative density, normal stress in direct shear strength test, and types of PET elements (1 x 1, 1 x 5, and fiber) were also recorded. Subsequently, four decision tree-oriented machine learning (ML) methods-decision tree (DT), random forest (RF), AdaBoost, and XGBoost-were applied to construct models capable of forecasting enhancements in shear strength. The evaluation of these models' effectiveness was conducted using four established statistical metrics: R2, RMSE, VAF, and A-10. The results showed that AdaBoost results in the highest prediction accuracy among other algorithms, representing the high modelling performance of the algorithm in dealing with complex nonlinear problems. The conducted sensitivity analysis also revealed that relative density is the most crucial parameter for all the algorithms in predicting the output, followed by PET percentage and normal stress. Furthermore, to make the developed model in this study more practical and easy to use, a Graphical User Interface (GUI) was created, enabling the engineers and researchers to perform the analysis straightforwardly.
暂无评论