Bioinformatics models greatly contribute to individualized assessments of cancer patients. However, considerable research neglected some critical technological points, including comparisons of multiple modeling algori...
详细信息
Bioinformatics models greatly contribute to individualized assessments of cancer patients. However, considerable research neglected some critical technological points, including comparisons of multiple modeling algorithms, evaluating gain effects of constructed model, comprehensive bioinformatics analyses and validation of clinical cohort. These issues are worthy of emphasizing, which will better serve future cancer research.
In the era of globalization, cross-border e-commerce has emerged as a significant driver of international trade, necessitating efficient logistics pathways to enhance delivery speed and reduce costs. This study presen...
详细信息
ISBN:
(纸本)9798400707032
In the era of globalization, cross-border e-commerce has emerged as a significant driver of international trade, necessitating efficient logistics pathways to enhance delivery speed and reduce costs. This study presents a comprehensive approach to designing and simulating cross-border e-commerce logistics pathways using machine learning algorithms. We first identify the key factors influencing logistics efficiency, including geographical, regulatory, and demand variability aspects. Utilizing a dataset comprising historical shipping data, we apply various machinelearning techniques, such as decision trees, random forests, and neural networks, to model and predict optimal logistics routes. The proposed model integrates real-time data processing and predictive analytics to dynamically adapt to changing conditions and optimize the logistics pathways in real-time. A Python-based simulation framework is developed to visualize and test the effectiveness of the proposed logistics pathways under different scenarios. The results demonstrate significant improvements in delivery times and cost reductions when compared to traditional logistics strategies. This research not only contributes to the field of logistics and supply chain management but also provides practical insights for e-commerce businesses seeking to enhance their operational efficiency in the competitive cross-border market. Future work will focus on refining the model with additional variables and exploring the integration of blockchain technology to further enhance transparency and traceability in cross-border logistics operations.
Background: This study aimed to apply eight machine learning algorithms to develop the optimal model to predict amputation-free survival (AFS) after first revascularization in patients with peripheral artery disease (...
详细信息
Background: This study aimed to apply eight machine learning algorithms to develop the optimal model to predict amputation-free survival (AFS) after first revascularization in patients with peripheral artery disease (PAD).Methods: Among 2130 patients from 2011 to 2020, 1260 patients who underwent revascularization were randomly assigned to training set and validation set in an 8:2 ratio. 67 clinical parameters were analyzed by lasso regression analysis. Logistic regression, gradient boosting machine, random forest, decision tree, eXtreme gradient boosting, neural network, Cox regression, and random survival forest (RSF) were applied to develop prediction models. The optimal model was compared with GermanVasc score in testing set comprising patients from 2010. Results: The postoperative 1/3/5-year AFS were 90%, 79.4%, and 74.1%. Age (HR:1.035, 95%CI: 1.015-1.056), atrial fibrillation (HR:2.257, 95%CI: 1.193-4.271), cardiac ejection fraction (HR:0.064, 95%CI: 0.009-0.413), Rutherford grade >= 5 (HR:1.899, 95%CI: 1.296-2.782), creatinine (HR:1.03, 95%CI: 1.02-1.04), surgery duration (HR:1.03, 95%CI: 1.01-1.05), and fibrinogen (HR:1.292, 95%CI: 1.098-1.521) were independent risk factors. The optimal model was developed by RSF algorithm, with 1/3/5-year AUCs in training set of 0.866 (95% CI:0.819-0.912), 0.854 (95% CI:0.811-0.896), 0.844 (95% CI:0.793-0.894), in validation set of 0.741 (95% CI:0.580-0.902), 0.768 (95% CI:0.654-0.882), 0.836 (95% CI:0.719-0.953), and in testing set of 0.821 (95%CI: 0.711-0.931), 0.802 (95%CI: 0.684-0.919), 0.798 (95%CI: 0.657-0.939). The c-index of the model outperformed GermanVasc Score (0.788 vs 0.730). A dynamic nomogram was published on shinyapp (https://wyy ***/amputation/). Conclusion: The optimal prediction model for AFS after first revascularization in patients with PAD was developed by RSF algorithm, which exhibited outstanding prediction performance.
The leaves of Morus alba Linn., which is also known as white mulberry, have been commonly used in many of traditional systems of medicine for centuries. In traditional Chinese medicine (TCM), mulberry leaf is mainly u...
详细信息
The leaves of Morus alba Linn., which is also known as white mulberry, have been commonly used in many of traditional systems of medicine for centuries. In traditional Chinese medicine (TCM), mulberry leaf is mainly used for anti-diabetic purpose due to its enrichment in bioactive compounds such as alkaloids, flavonoids and polysaccharides. However, these components are variable due to the different habitats of the mulberry plant. Therefore, geographic origin is an important feature because it is closely associated with bioactive ingredient composition that further influences medicinal qualities and effects. As a low-cost and non-invasive method, surface enhanced Raman spectrometry (SERS) is able to generate the overall fingerprints of chemical compounds in medicinal plants, which holds the potential for the rapid identification of their geographic origins. In this study, we collected mulberry leaves from five representative provinces in China, namely, Anhui, Guangdong, Hebei, Henan and Jiangsu. SERS spectrometry was applied to characterize the fingerprints of both ethanol and water extracts of mulberry leaves, respectively. Through the combination of SERS spectra and machine learning algorithms, mulberry leaves were well discriminated with high accuracies in terms of their geographic origins, among which the deep learningalgorithm convolutional neural network (CNN) showed the best performance. Taken together, our study established a novel method for predicting the geographic origins of mulberry leaves through the combination of SERS spectra with machine learning algorithms, which strengthened the application potential of the method in the quality evaluation, control and assurance of mulberry leaves.
Determining the mechanical properties of plastic concrete samples through experimental investigation is costly and time-consuming. This research used supervised machinelearning (ML) techniques, including Decision Tre...
详细信息
On account of the wide application of the autonomous vehicle system, accurate and reliable location information is of great significance. Since the GPS and the IMU information doesn't perform well on their own res...
详细信息
ISBN:
(纸本)9798400711848
On account of the wide application of the autonomous vehicle system, accurate and reliable location information is of great significance. Since the GPS and the IMU information doesn't perform well on their own respectively, however when merging the GPS and the IMU data from the sensors, the high precision of the navigation system is achieved. While the Extended Kalman Filter is the most prevalent in fusing the data, in this paper, algorithms of machinelearning, the Random Forest Regression, the Decision Tree Regression, the Multiple Linear Regression and the Extreme Gradient Boosting for instance, algorithms of neural network, the Multiple Perceptron Neural Network for instance is applied for sensor fusion. The results from the experiment demonstrate a promising direction for machinelearning and neural network in application in the sensor merging field.
Thyroid disease is the general concept for a medical problem that prevents one's thyroid from producing enough hormones. Thyroid disease can affect everyone-men, women, children, adolescents, and the elderly. Thyr...
详细信息
Thyroid disease is the general concept for a medical problem that prevents one's thyroid from producing enough hormones. Thyroid disease can affect everyone-men, women, children, adolescents, and the elderly. Thyroid disorders are detected by blood tests, which are notoriously difficult to interpret due to the enormous amount of data necessary to forecast results. For this reason, this study compares eleven machine learning algorithms to determine which one produces the best accuracy for predicting thyroid risk accurately. This study utilizes the Sick-euthyroid dataset, acquired from the University of California, Irvine's machinelearning repository, for this purpose. Since the target variable classes in this dataset are mostly one, the accuracy score does not accurately indicate the prediction outcome. Thus, the evaluation metric contains accuracy and recall ratings. Additionally, the F1-score produces a single value that balances the precision and recall when an uneven distribution class exists. Finally, the F1-score is utilized to evaluate the performance of the employed machine learning algorithms as it is one of the most effective output measurements for unbalanced classification problems. The experiment shows that theANNClassifier with an F1-score of 0.957 outperforms the other nine algorithms in terms of accuracy.
This work presents performance analysis of machine learning algorithms such as logistic regression, naive bayes, decision tree, k nearest neighbour, random forest, support vector machine, and extreme gradient boosting...
详细信息
This work presents performance analysis of machine learning algorithms such as logistic regression, naive bayes, decision tree, k nearest neighbour, random forest, support vector machine, and extreme gradient boosting in heart disease prediction. machine learning algorithms are implemented in python using Scikit learn library in Jupiter notebook. Experiments are conducted by training and testing machine learning algorithms using kaggle heart disease dataset under six test cases. Performance of machine learning algorithms are evaluated using accuracy, precision, recall, F1 score and ROC as metrics. Results show random forest reported high accuracy, precision, recall, F1 score and ROC in heart disease prediction compared to other machine learning algorithms in all six test cases. Results show RF is effective in heart disease prediction in Case 3 with 80% train data and 20% test data.
Timely, accurate estimates of forest aboveground carbon density (AGC) are essential for understanding the global carbon cycle and providing crucial reference information for climate-change-related policies. To date, a...
详细信息
Timely, accurate estimates of forest aboveground carbon density (AGC) are essential for understanding the global carbon cycle and providing crucial reference information for climate-change-related policies. To date, airborne LiDAR has been considered as the most precise remote-sensing-based technology for forest AGC estimation, but it suffers great challenges from various uncertainty sources. Stratified estimation has the potential to reduce the uncertainty and improve the forest AGC estimation. However, the impact of stratification and how to effectively combine stratification and modeling algorithms have not been fully investigated in forest AGC estimation. In this study, we performed a comparative analysis of different stratification approaches (non-stratification, forest type stratification (FTS) and dominant species stratification (DSS)) and different modeling algorithms (stepwise regression, random forest (RF), Cubist, extreme gradient boosting (XGBoost) and categorical boosting (CatBoost)) to identify the optimal stratification approach and modeling algorithm for forest AGC estimation, using airborne LiDAR data. The analysis of variance (ANOVA) was used to quantify and determine the factors that had a significant effect on the estimation accuracy. The results revealed the superiority of stratified estimation models over the unstratified ones, with higher estimation accuracy achieved by the DSS models. Moreover, this improvement was more significant in coniferous species than broadleaf species. The ML algorithms outperformed stepwise regression and the CatBoost models based on DSS provided the highest estimation accuracy (R-2 = 0.8232, RMSE = 5.2421, RRMSE = 20.5680, MAE = 4.0169 and Bias = 0.4493). The ANOVA of the prediction error indicated that the stratification method was a more important factor than the regression algorithm in forest AGC estimation. This study demonstrated the positive effect of stratification and how the combination of DSS and the CatBo
A printed circuit board (PCB) surface can fail by corrosion due to various environmental factors. This paper focuses on machinelearning (ML) techniques to build predictive models to forecast PCB surface failure due t...
详细信息
A printed circuit board (PCB) surface can fail by corrosion due to various environmental factors. This paper focuses on machinelearning (ML) techniques to build predictive models to forecast PCB surface failure due to electrochemical migration (ECM) and leakage current (LC) levels under corrosive conditions containing the combination of six critical factors. The modeling methodology in this paper used common supervised ML algorithms by accomplishing significant evaluation metrics to show the performance of each algorithm. The conclusion of this study presents that ML algorithms can create predictive models to forecast PCB failures and estimate LC values effectively and quickly.
暂无评论