检索结果-内蒙古大学图书馆

Investigating landslide data balancing for susceptibility mapping using generative and machine learning models

LANDSLIDES 2025年第1期22卷 189-204页

作者： Jiang, Yuhang Wang, Wei Zou, Lifang Cao, Yajun Xie, Wei-Chau Hohai Univ Geotech Res Inst Nanjing 210098 Jiangsu Peoples R China Hohai Univ Key Lab Minist Educ Geomech & Embankment Engn Nanjing 210098 Jiangsu Peoples R China Hohai Univ Sch Earth Sci & Engn Nanjing 211100 Jiangsu Peoples R China Univ Waterloo Dept Civil & Environm Engn 200 Univ Ave West Waterloo ON N2L 3G1 Canada

With the development and application of machine learning, significant advances have been made in landslide susceptibility mapping. However, due to challenges in actual field landslide investigations, current landslide susceptibility mapping is usually characterized by insufficient landslide samples (positive samples) and low reliability of non-landslide samples (negative samples). Considering Lianghe County in Yunnan Province, China, as an example, this paper aims to research the effectiveness of three oversampling models in generating positive samples for landslides: conditional tabular generative adversarial networks (CTGAN), generative adversarial networks (GAN), and the traditional Synthetic Minority Oversampling Technique (SMOTE) algorithms. Additionally, three machine learning methods, including 1D Convolutional Neural Network-Long Short-Term Memory Neural Network (CNN-LSTM), Random Forest (RF), and Gradient Boosting Decision Tree (GBDT) classifiers, are used for landslide susceptibility assessment. We also devise a non-landslide data (negative samples) screening method utilizing a self-trained support vector machine within a semi-supervised framework. The results show that by training on the dataset after negative sample screening, the AUC values for the 1D-CNN-LSTM, RF, and GBDT models have shown significant improvement, increasing from (0.778, 0.869, 0.849) to (0.837, 0.936, 0.877). Compared with the original training set, the prediction accuracy of the three machine learning models is improved after training on the augmented data by CTGAN, GAN, and SMOTE models. The RF model, augmented with 200 positive samples generated by CTGAN, achieves the highest prediction accuracy in the study (AUC = 0.962). The 1D CNN-LSTM model achieves its highest prediction accuracy (AUC = 0.953) when augmented with 200 positive samples from GAN. Similarly, the GBDT model reaches its highest prediction accuracy (AUC = 0.928) when augmented with 200 positive samples created by SM

关键词： Landslide susceptibility mapping conditional tabular generative adversarial networks Convolutional Neural Network Long Short-Term Memory Neural Network Self-training semi-supervised SVM algorithm

来源：评论

学校读者我要写书评

暂无评论

Data-driven estimates of the strength and failure modes of CFRP-steel bonded joints by implementing the CTGAN method

引用

ENGINEERING FRACTURE MECHANICS 2024年 299卷

作者： Wang, Songbo Stratford, Tim Li, Yang Li, Biao Hubei Univ Technol Sch Civil Engn Architecture & Environm Wuhan 430068 Peoples R China Hubei Univ Technol Key Lab Intelligent Hlth Percept & Ecol Restorat R Minist Educ Wuhan 430068 Peoples R China Univ Edinburgh Inst Infrastructure & Environm Sch Engn Edinburgh EH9 3FG Scotland

The bond strength between the CFRP and steel usually dominates the final strengthened effectiveness. However, the CFRP-steel bond strength is affected by various geometric and material properties and exhibits different failure modes, making accurate predictions challenging. This study utilises data -driven machine learning (ML) methods to predict the strength and failure modes of CFRP-steel joints. An experimental dataset consisting of 178 single -lap shear test results was first built, after which the conditional tabular generative adversarial networks (CTGAN) method was applied to augment the limited available data. Four broadly used ML algorithms: Support Vector Machines (SVM), K -Nearest Neighbours (KNN), Decision Trees (DT) and Artificial Neural networks (ANN) were applied. The ANN regression model achieved the best performance in predicting joint strength (R-test(2) = 0.95), while the SVM classification model achieved the best performance in predicting failure modes (accuracy >= 92.3 %). The SHapley Additive exPlanations analysis further revealed that the Young's modulus of the adhesive was most significant to the joint strength, while the tensile strength of the adhesive was most significant to the failure modes. The ultimately constructed ML models and the corresponding analyses presented can benefit practical structural engineering applications and provide insights into the optimal CFRP-steel joint design.

关键词： CFRP-steel bonded joint Joint strength Failure modes Data-driven machine learning conditional tabular generative adversarial networks SHAP feature importance

来源：评论

学校读者我要写书评

暂无评论

Enhanced classification of hydraulic testing of directional control valves with synthetic data generation

引用

PRODUCTION ENGINEERING-RESEARCH AND DEVELOPMENT 2023年第5期17卷 669-678页

作者： Neunzig, Christian Moellensiep, Dennis Hartmann, Melanie Kuhlenkoetter, Bernd Moeller, Matthias Schulz, Juergen Ruhr Univ Bochum Chair Prod Syst Industriestr 38C D-44894 Bochum Germany Bosch Rexroth AG Bexbacher Str 72 D-66424 Homburg Germany

Production environments bring inherent system challenges that are reflected in the high-dimensional production data. The data is often nonstationary, is not available in sufficient size and quality, and is class imbalanced due to the predominance of good parts. Data-driven manufacturing analytics requires data of sufficient quantity and quality. In order to predict quality characteristics, production data is collected across processes in the industrial use case at Bosch Rexroth AG for the purpose of inferring results in hydraulic final inspection using machine learning methods. Since high quality data generation is costly, synthetic data generation methodologies offer a promising alternative to improve prediction models and thus generate safer, more accurate predictions for manufacturing companies. Among the synthetic data generation methodologies used, variational autoencoders compared to generative adversarial networks and synthetic minority oversampling technique methods are best suited to synthesize the feature with highest feature importance from a small sample data set compared to the production data and improve the prediction for the target variable.

关键词： Predictive quality Quality control Machine learning conditional tabular generative adversarial networks tabular variational autoencoder Semi-supervised learning

来源：评论

学校读者我要写书评

暂无评论

CTGAN-based oversampling and cost-sensitive deep neural network to predict smart money activity in stock market

引用

International Journal of Information Technology (Singapore) 2025年第3期17卷 1489-1499页

作者： Baro, Pranita Borah, Malaya Dutta Department of Computer Science and Engineering National Institute of Technology Assam Silchar 788010 India

Investment in the stock market has become a trend in today’s era. The primary force moving the market in a specific direction is the large buying and selling of hedge funds, pension funds, banks, etc. This paper proposes a deep learning strategy to predict smart money activity. Initially, a framework composed of technical analysis and quantitative analysis is considered to create a dataset. Then using this framework a comprehensive dataset is built from the data available in the National Stock Exchange of India (https://***/). In the proposed approach, minority samples in the dataset are first oversampled using conditional tabular generative adversarial Network based approach to even out class imbalance in the real-life dataset. Using the modified dataset, a cost-sensitive deep neural network is trained to predict smart money activity in the stock market. The proposed method is assessed using different evaluation metrics and the findings validate the superiority of the proposed methodology. © Bharati Vidyapeeth's Institute of Computer Applications and Management 2024.

关键词： conditional tabular generative adversarial networks Cost-sensitive Imbalance classification Neural network Smart money

来源：评论

学校读者我要写书评

暂无评论

Warts Disease Treatment Suggestion Using Images and Numerical Data 3

Warts Disease Treatment Suggestion Using Images and Numerica...

引用

International Conference on Advances in Computing, Communication and Applied Informatics (ACCAI)

作者： Yatin, Daketi Avinash, Challa Hemavathi, D. SRM Inst Sci & Technol Dept Data Sci & Business Syst Sch Comp Chennai 603203 Tamil Nadu India

ISBN: (纸本)9798350389432;9798350389449;9798350389456

To classify wart treatment methods, this research paper examines the effectiveness of using machine learning (ML) and deep learning algorithms in conjunction with numerical and image data. Human papillomavirus (HPV)-induced warts are a common dermatological concern. Several factors can affect the severity and spread of these lesions. Making use of both picture and numerical data, the study suggests a thorough method for classifying treatments. The paper shows that the suggested methodology is effective through thorough experimentation. The study achieves remarkable classification accuracy, specifically 91.67% for cryotherapy and 85% for immunotherapy datasets, by utilising machine learning and deep learning algorithms. Notably, accuracy rates of 76% for cryotherapy and 74% for immunotherapy are obtained by combining synthetic and raw data, demonstrating the potential benefits of integrating various data sources. The study adds a comprehensive framework that makes accurate classification of wart treatments possible. The model provides a comprehensive understanding of wart types and treatment outcomes by combining image analysis with numerical data. This creative method uses both quantitative and visual data to enable users to make well-informed decisions. All things considered, the study highlights the potential of AI and ML methods to improve the classification of wart treatments, offering dermatologists and other medical professionals a useful tool. This study is a major step towards more individualised and data-driven dermatological care strategies, which could lead to better patient outcomes and more effective treatments.

关键词： Warts cryotherapy immunotherapy treatment prediction machine learning convolution neural network conditional tabular generative adversarial networks

来源：评论

学校读者我要写书评

暂无评论

Improving Intrusion Detection for Imbalanced Network Traffic using generative Deep Learning

引用

INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS 2022年第4期13卷 959-967页

作者： Alqarni, Amani A. El-Alfy, El-Sayed M. Univ Hafr Al Batin Coll Comp Sci & Engn Dept Comp Sci & Engn Hafar al Batin 39524 Saudi Arabia King Fahd Univ Petr & Minerals Informat & Comp Sci Dept Interdisciplinary Res Ctr Intelligent Secure Syst Coll Comp & Math Dhahran 31261 Saudi Arabia

Network security has become a serious issue since networks are vulnerable and subject to increasing intrusive activities. Therefore, network intrusion detection systems (IDSs) are an essential component to defend against these activities. One of the biggest issues encountered by IDSs is the class imbalance problem which leads to a biased performance by most machine learning models to normal activities (majority class). Several techniques were proposed to overcome the class imbalance problem such as resampling, cost-sensitive, and ensemble learning techniques. Other issues related to intrusion detection data include mixed data types, and non-Gaussian and multimodal distributions. In this study, we employed a conditional tabular generative adversarial network (CTGAN) model with common machine learning algorithms to construct more effective detection systems while addressing the imbalance issue. CTGAN can generate samples of the minority class during training to make the dataset more balanced. To assess the effectiveness of the proposed IDS, we combined CTGAN with three machine learning algorithms: support vector machine (SVM), K-nearest neighbor (KNN), and decision tree (DT). The imbalanced NSLKDD dataset was used and several experiments were conducted. The results showed that CTGAN can improve the performance of imbalance learning for intrusion detection with SVM and DT. On the other hand, KNN showed no improvement in the performance since it is less sensitive to the class imbalance problem. Moreover, the results proved that CTGAN can capture the distribution of discrete features better than continuous features.

关键词： Intrusion detection machine learning imbalance learning conditional tabular generative adversarial networks

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：