Accurate detection and isolation of possible faults are indispensable for operating complex industrial processes more safely, effectively, and economically. In this paper, we propose a fault isolation method for steam...
详细信息
Accurate detection and isolation of possible faults are indispensable for operating complex industrial processes more safely, effectively, and economically. In this paper, we propose a fault isolation method for steam boilers in thermal power plants via classification and regression tree (CART)-based variable ranking. In the proposed method, binary classificationtrees are constructed by applying the CART algorithm to a training dataset which is composed of normal and faulty samples for classifier learning then, to perform faulty variable isolation, variable importance values for each input variable are extracted from the constructed trees. The importance values for non-faulty variables are not influenced by faulty variables, because the values are extracted from the trees with decision boundaries only in the original input space;the proposed method does not suffer from smearing effect. Furthermore, the proposed method, based on the nonparametric CART classifier, can be applicable to nonlinear processes. To confirm the effectiveness, the proposed and comparison methods are applied to two benchmark problems and 250 MW drum-type steam boiler. Experimental results show that the proposed method isolates faulty variables more clearly without the smearing effect than the comparison methods.
Deep learning models have become popular in the analysis of tabular data, as they address the limitations of decision trees and enable valuable applications like semi -supervised learning, online learning, and transfe...
详细信息
Deep learning models have become popular in the analysis of tabular data, as they address the limitations of decision trees and enable valuable applications like semi -supervised learning, online learning, and transfer learning. However, these deep -learning approaches often encounter a trade-off. On one hand, they can be computationally demanding when dealing with large-scale or high -dimensional datasets. On the other hand, they may lack interpretability and may not be suitable for small-scale datasets. In this study, we propose a novel interpretable neural network called Neural classification and regression tree (NCART) to overcome these challenges. NCART is a modified version of Residual Networks that replaces fully -connected layers with multiple differentiable oblivious decision trees. By integrating decision trees into the architecture, NCART maintains its interpretability while benefiting from the end -to -end capabilities of neural networks. The simplicity of the NCART architecture makes it well -suited for datasets of varying sizes and reduces computational costs compared to state-of-the-art deep learning models. Extensive numerical experiments demonstrate the superior performance of NCART compared to existing deep learning models, establishing it as a strong competitor to tree -based models. The code is available at https://***/Luojiaqimath/NCART.
Groundwater is considered one of the most valuable fresh water resources. The main objective of this study was to produce groundwater spring potential maps in the Koohrang Watershed, Chaharmahal-e-Bakhtiari Province, ...
详细信息
Groundwater is considered one of the most valuable fresh water resources. The main objective of this study was to produce groundwater spring potential maps in the Koohrang Watershed, Chaharmahal-e-Bakhtiari Province, Iran, using three machine learning models: boosted regressiontree (BRT), classification and regression tree (CART), and random forest (RF). Thirteen hydrological-geological-physiographical (HGP) factors that influence locations of springs were considered in this research. These factors include slope degree, slope aspect, altitude, topographic wetness index (TWI), slope length (LS), plan curvature, profile curvature, distance to rivers, distance to faults, lithology, land use, drainage density, and fault density. Subsequently, groundwater spring potential was modeled and mapped using CART, RF, and BRT algorithms. The predicted results from the three models were validated using the receiver operating characteristics curve (ROC). From 864 springs identified, 605 (approximate to 70 %) locations were used for the spring potential mapping, while the remaining 259 (approximate to 30 %) springs were used for the model validation. The area under the curve (AUC) for the BRT model was calculated as 0.8103 and for CART and RF the AUC were 0.7870 and 0.7119, respectively. Therefore, it was concluded that the BRT model produced the best prediction results while predicting locations of springs followed by CART and RF models, respectively. Geospatially integrated BRT, CART, and RF methods proved to be useful in generating the spring potential map (SPM) with reasonable accuracy.
Background and objective: Variations in the risk factors for sarcopenia can lead to differences in the likelihood of developing sarcopenia among older adults;however, few studies have explored the interactions among t...
详细信息
Background and objective: Variations in the risk factors for sarcopenia can lead to differences in the likelihood of developing sarcopenia among older adults;however, few studies have explored the interactions among the risk factors. This study examined the interactions among risk factors and identified a discriminative pathway for groups at risk of sarcopenia in community-dwelling older adults. Methods: A cross-sectional study was conducted between July and August 2019 to recruit 200 older adults from an outpatient department of a hospital providing care for older people. Data on various risk factors, namely demographics (age, gender, education, comorbidities, and body mass index [BMI]), dietary habits (weekly consumption of milk, coffee, and meat), lifestyle behaviours (vitamin D supplementation, smoking, drinking, and physical activity), and depression symptoms were collected. Sarcopenia was defined according to the Asian Working Group for Sarcopenia criteria. A classification and regression tree (CART) model was used to examine interactions among these factors and identify groups at risk of sarcopenia. Findings: The prevalence of sarcopenia was 38.5%. The CART model identified two end groups at differential risks of sarcopenia, with a minimum of one and a maximum of three risk factors. In the first group, low BMI (<18.5 kg/m(2)) was a predominant risk factor for sarcopenia among older people. In the second group, older adults with a normal BMI, aged >= 68 years, and without a regular walking habit had a higher probability of developing sarcopenia than did their counterparts. Conclusions: The interactive effects among older age, BMI, and walking may cause different probabilities of developing sarcopenia in the older population. Implications for practice: Older adults with a low or normal BMI but without a regular walking habit could be a predominant risk group for sarcopenia. The appropriate maintenance of body weight and regular walking activity is suggested to
BackgroundPneumonia complicated by septic shock is associated with significant morbidity and mortality. classification and regression tree methodology is an intuitive method for predicting clinical outcomes using bina...
详细信息
BackgroundPneumonia complicated by septic shock is associated with significant morbidity and mortality. classification and regression tree methodology is an intuitive method for predicting clinical outcomes using binary splits. We aimed to improve the prediction of in-hospital mortality in patients with pneumonia and septic shock using decision tree *** and regressiontree models were applied to all patients with pneumonia-associated septic shock in the international, multicenter Cooperative Antimicrobial Therapy of Septic Shock database between 1996 and 2015. The association between clinical factors (time to appropriate antimicrobial therapy, severity of illness) and in-hospital mortality was evaluated. Accuracy in predicting clinical outcomes, sensitivity, specificity, and area under receiver operating curve of the final model was evaluated in training (n=2111) and testing datasets (n=2111).ResultsThe study cohort contained 4222 patients, and in-hospital mortality was 51%. The mean time from onset of shock to administration of appropriate antimicrobials was significantly higher for patients who died (17.2h) compared to those who survived (5.0h). In the training dataset (n=2111), a tree model using Acute Physiology and Chronic Health Evaluation II Score, lactate, age, and time to appropriate antimicrobial therapy yielded accuracy of 73% and area under the receiver operating curve 0.75. The testing dataset (n=2111) had accuracy of 69% and area under the receiver operating curve *** mortality (51%) in patients with pneumonia complicated by septic shock is high. Increased time to administration of antimicrobial therapy, Acute Physiology and Chronic Health Evaluation II Score, serum lactate, and age were associated with increased in-hospital mortality. classification and regression tree methodology offers a simple prognostic model with good performance in predicting in-hospital mortality.
Purpose: To build and validate a decision tree model using classification and regression tree (CART) analysis to distinguish lipoma and lipoma variants from well-differentiated liposarcoma of the extremities and super...
详细信息
Purpose: To build and validate a decision tree model using classification and regression tree (CART) analysis to distinguish lipoma and lipoma variants from well-differentiated liposarcoma of the extremities and superficial trunk. Methods: This retrospective study included patients who underwent surgical resection and preoperative contrast-enhanced MR imaging for lipoma, lipoma variants, and well-differentiated liposarcoma in two tertiary referral centers. Six MRI findings (tumor size, anatomical location, tumor depth, shape, enhancement pattern, and presence of intermingled muscle fibers) and two demographic factors (patient age and sex) were assessed to build a classificationtree using CART analysis with minimal error cross-validation pruning based on a complexity parameter. Results: The model building cohort consisted of 231 patients (186 lipoma and lipoma variants and 45 well-differentiated liposarcoma) from one center, while the validation cohort consisted of 157 patients (136 lipoma and lipoma variants and 21 well-differentiated liposarcoma) from another center. In the CART analysis, the contrast enhancement pattern (no enhancement or thin septal enhancement versus thick septal, nodular, confluent hazy, or solid enhancement) was the first partitioning predictor, followed by a maximal tumor size of 12.75 cm. The tree model allowed distinction of lipoma and lipoma variants from well-differentiated liposarcoma in both the model building cohort (C-statistics, 0.955;sensitivity 80 %, specificity 94.62 %, accuracy 91.77 %) and the external validation cohort (C-statistics, 0.917;sensitivity 66.67 %, specificity 95.59 %, accuracy 91.72 %). Conclusion: The distinction of lipoma and lipoma variants from well-differentiated liposarcoma can be achieved with the simple classificationtree model.
The purpose of this work is to evaluate the ensemble data preprocessing (DP) strategy composing the selected variant of normalization, parametric time warping and baseline correction techniques in varying sequences fo...
详细信息
The purpose of this work is to evaluate the ensemble data preprocessing (DP) strategy composing the selected variant of normalization, parametric time warping and baseline correction techniques in varying sequences for modelling a gas chromatography-mass spectrometry (GC-MS) data via classification and regression tree (CART) algorithm. Firstly, the relative merits between single-DP and ensemble-DP strategies were carefully compared using the best-performing sub-retention time (RT) windows reported elsewhere. Then, all the preprocessed subdatasets were assessed based on predictive capability estimated via the CART algorithm. Performances of CART models were estimated from 50 pairs of training and testing samples that were prepared by a stratified random resampling method. Then, the three shortlisted sub-datasets were further evaluated using increased pairs of training and testing samples. Additionally, the most discriminative RT points were also identified using the three sub-datasets. Eventually, the most desired CART model was constructed using the shortlisted RT points after being treated by the most outstanding DP strategy. Results showed that 3-DP strategies tended to outperform the 1-DP and 2-DP strategies. However, the sequence of application must be carefully optimized as not all the 3-DP strategies induced positive impacts. It was found that the data aligned before baseline correction or normalization will likely outperform those being first normalized or baseline corrected. In conclusion, the untargeted GC-MS data of neat gasoline preferably be first aligned, followed by normalization, and ended by baseline correction.
This study aims to predict amounts of waste generated from detached house by their types and materials when they are constructed and *** achieve this objective, required data was collected based on material informatio...
详细信息
This study aims to predict amounts of waste generated from detached house by their types and materials when they are constructed and *** achieve this objective, required data was collected based on material information of the buildings before *** using the established database, CART analysis was conducted, with the buildings' types and materials as analysis factors, to identify what affects the generation of waste concrete and to estimate indicators of waste *** results were as ***, the most influential factor on generation of waste concrete was types of ***, generation of waste concrete of RC-type and wood-type buildings was not affected by materials, while that of masonry-type buildings was affected by roof ***-type buildings were divided into two categories by roof materials: i) buildings with slab and slab with roof tiles as roof materials and ii) buildings with roof tiles and slate as roof ***, amounts of waste concrete of RC-type and wood-type buildings were 0.324m/m and 0.018 m/m, respectively, and those of masonry-type buildings with roof materials of slab and slab with roof tiles and the other masonrytype buildings were 0.127 m/m and 0.040 m/m, respectively.
OBJECTIVE To evaluate several methods of predicting prostate cancer-related outcomes, i.e. nomograms, look-up tables, artificial neural networks (ANN), classification and regression tree (CART) analyses and risk-group...
详细信息
OBJECTIVE To evaluate several methods of predicting prostate cancer-related outcomes, i.e. nomograms, look-up tables, artificial neural networks (ANN), classification and regression tree (CART) analyses and risk-group stratification (RGS) models, all of which represent valid alternatives. METHODS We present four direct comparisons, where a nomogram was compared to either an ANN, a look-up table, a CART model or a RGS model. In all comparisons we assessed the predictive accuracy and performance characteristics of both models. RESULTS Nomograms have several advantages over ANN, look-up tables, CART and RGS models, the most fundamental being a higher predictive accuracy and better performance characteristics. CONCLUSION These results suggest that nomograms are more accurate and have better performance characteristics than their alternatives. However, ANN, look-up tables, CART analyses and RGS models all rely on methodologically sound and valid alternatives, which should not be abandoned.
IntroductionAmong women with a fetus with a non-cephalic presentation, external cephalic version (ECV) has been shown to reduce the rate of breech presentation at birth and cesarean birth. Compared with ECV at term, b...
详细信息
IntroductionAmong women with a fetus with a non-cephalic presentation, external cephalic version (ECV) has been shown to reduce the rate of breech presentation at birth and cesarean birth. Compared with ECV at term, beginning ECV prior to 37weeks' gestation decreases the number of infants in a non-cephalic presentation at birth. The purpose of this secondary analysis was to investigate factors associated with a successful ECV procedure and to present this in a clinically useful format. Material and methodsData were collected as part of the Early ECV Pilot and Early ECV2 Trials, which randomized 1776 women with a fetus in breech presentation to either early ECV (34-36weeks' gestation) or delayed ECV (at or after 37weeks). The outcome of interest was successful ECV, defined as the fetus being in a cephalic presentation immediately following the procedure, as well as at the time of birth. The importance of several factors in predicting successful ECV was investigated using two statistical methods: logistic regression and classification and regression tree (CART) analyses. ResultsAmong nulliparas, non-engagement of the presenting part and an easily palpable fetal head were independently associated with success. Among multiparas, non-engagement of the presenting part, gestation less than 37weeks and an easily palpable fetal head were found to be independent predictors of success. These findings were consistent with results of the CART analyses. ConclusionsRegardless of parity, descent of the presenting part was the most discriminating factor in predicting successful ECV and cephalic presentation at birth.
暂无评论