检索结果-内蒙古大学图书馆

Euro-Asia Conference on Frontiers of Computer Science and Information Technology (FCSIT)

作者： Liu, Guoqiang Xiao, Ruochen Li, Wenkai Zhang, Youcheng Blended Learning MIT Apple Sin Project Cambridge MA 02139 USA

ISBN: (纸本)9781665463539

We propose using the Sequence Classification modeling, shap algorithm and masked-language modeling (MLM) for the task of text style transfer. To tackle cases when no parallel source-target pairs are available, we train Sequence Classification model based on Bert model with SST-2 task of GLUE for both source and target domain;and we use shap values, which are computed based on Sequence Classification model we gained, to detect and then delete words associated with original attributes. The deleted tokens are replaced by MLM trained with the target domain to retrieve new phrases associated with the target attributes. Based on this, we detect the part of speech (POS) of each word in the sentence in order to replace the suitable positions without much impact on the semantics. Additionally, we use GloVe to determine semantic similarity between the word generated by MLM and the original word so that we can trade off content versus attribute by using grid search to gain their weighting percentage. The experiments show that our methods improve style conversion rate by 9.7% and get a semantic similarity compared to original contents 28.2% on average higher than best previous system.

关键词： Masked language modeling shap algorithm Glove Spacy Sequence Classification modeling

来源：评论

学校读者我要写书评

暂无评论

Remote sensing inversion of water quality parameters (TSM, Chl-a, and CDOM) in subtidal seaweed beds and surrounding waters

引用

ECOLOGICAL INDICATORS 2024年 167卷

作者： Chen, Jianqu Wang, Kai Li, Xunmeng Zhao, Xu Cheng, Xiaopeng Liu, Zhangbin Zhang, Jian Zhang, Shouyu Shanghai Ocean Univ Coll Oceanog & Ecol Sci Shanghai 201306 Peoples R China Shanghai Ocean Univ Engn Technol Res Ctr Marine Ranching Shanghai 201306 Peoples R China Zhejiang Ocean Univ Natl Engn Res Ctr Marine Aquaculture Zhoushan 316002 Peoples R China Guangdong Prov Key Lab Marine Biotechnol Shantou Peoples R China Minist Nat Resources Key Lab Marine Ecol Conservat & Restorat Fujian Prov Key Lab Marine Ecol Conservat & Restor Xiamen Peoples R China Chinese Acad Fishery Sci East China Sea Fisheries Res Inst Shanghai 200090 Peoples R China Kyushu Univ Fac Agr Lab Marine Environm Sci Fukuoka 8190395 Japan Hokkaido Univ Grad Sch Environm Sci Div Biosphere Sci Sapporo 0600811 Japan

Due to environmental factors such as water transparency, subtidal seaweed beds are often challenging to observe directly via satellite. However, the presence of seaweed beds can lead to variations in the concentrations of total suspended matter (TSM), chlorophyll-a (Chl-a), and chromophoric dissolved organic matter (CDOM) in the surrounding waters. This study focuses on the seaweed beds around Gouqi Island, Zhejiang, integrating several months of in-situ water quality sampling data with PlanetScope satellite imagery to develop inversion models for water quality parameters using Random Forest (RF), Gradient Boosting Decision Tree (GBDT), and Support Vector Regression (SVR) algorithms. By analyzing the differences in water quality parameters between areas with seaweed beds and those without, we explored the underlying causes of these variations and proposed an indirect method for estimating the distribution range of underwater seaweed. This research not only provides a new perspective and technical approach for marine resource management but also contributes significant foundational data and scientific evidence for the conservation of coastal zone ecosystems.

关键词： Seaweed bed Water quality inversion Machine learning regression shap algorithm Indirect estimation method

来源：评论

学校读者我要写书评

暂无评论

Machine learning driven post-impact damage state prediction for performance-based crashworthiness design of bridge piers

引用

ENGINEERING STRUCTURES 2023年第1期292卷

作者： Zhou, Chang Xie, Yazhou Wang, Wenwei Zheng, Yuzhou Southeast Univ Sch Transportat Nanjing 211189 Peoples R China McGill Univ Dept Civil Engn Montreal PQ H3A 0C3 Canada Army Engn Univ PLA Sch Field Engn Nanjing 210042 Peoples R China

This study applies machine learning (ML) methods to predict post-impact damage states of reinforced concrete (RC) bridge piers under vehicle collision. 251 datasets of various vehicle-bridge collision scenarios are synthesized for training and testing six supervised ML models, including K-Nearest Neighbors (KNN), Support Vector Machine (SVM), Decision Tree, Random Forest, eXtreme Gradient Boosting Trees (XGBoost), and Artificial Neural Network (ANN). Comparisons on confusion matrices indicate that SVM, Random Forest, XGBoost, and ANN possess superior and comparable classification capabilities. ML models also achieve a much higher level of accuracy when compared with existing empirical models in the literature. Furthermore, the shapley additive explanations (shap) algorithm is utilized to interpret and explain the prediction process of ML models. In particular, the shapley value of each feature captures its positive or negative contribution for the ML model to predict each damage state, where the most influential design variables include impact speed, truck mass, engine mass, and pier diameter. To facilitate the performance-based crashworthiness design of RC bridge piers, an endto-end interactive software is devised to automatically predict impact damage states using the top three ML models against any given design scenario. Real-time interactive illustrations are also provided to elucidate the shapley value contribution of each design parameter for the Random Forest model to reach each damage state. Finally, the final damage state is selected to have the highest likelihood of damage among the three ML model predictions.

关键词： Vehicle -bridge collision Post -impact damage state Machine learning Model interpretation shap algorithm Software development

来源：评论

学校读者我要写书评

暂无评论

Building gender-specific sexually transmitted infection risk prediction models using CatBoost algorithm and NHANES data

引用

BMC MEDICAL INFORMATICS AND DECISION MAKING 2024年第1期24卷 1页

作者： Hu, Mengjie Peng, Han Zhang, Xuan Wang, Lefeng Ren, Jingjing Zhejiang Univ Sch Med Affiliated Hosp 1 Dept Gen Practice Hangzhou 310003 Peoples R China Hangzhou Med Coll Zhejiang Prov Peoples Hosp Affiliated Peoples Hosp Clin Res Inst Hangzhou Peoples R China Zhejiang Univ Sch Med Affiliated Hosp 1 Dept Cardiol Hangzhou 310003 Peoples R China Zhejiang Univ Coll Med Affiliated Hosp 1 Kidney Dis Ctr Hangzhou 310003 Peoples R China

Background and aimsSexually transmitted infections (STIs) are a significant global public health challenge due to their high incidence rate and potential for severe consequences when early intervention is neglected. Research shows an upward trend in absolute cases and DALY numbers of STIs, with syphilis, chlamydia, trichomoniasis, and genital herpes exhibiting an increasing trend in age-standardized rate (ASR) from 2010 to 2019. Machine learning (ML) presents significant advantages in disease prediction, with several studies exploring its potential for STI prediction. The objective of this study is to build males-based and females-based STI risk prediction models based on the CatBoost algorithm using data from the National Health and Nutrition Examination Survey (NHANES) for training and validation, with sub-group analysis performed on each STI. The female sub-group also includes human papilloma virus (HPV) *** study utilized data from the National Health and Nutrition Examination Survey (NHANES) program to build males-based and females-based STI risk prediction models using the CatBoost algorithm. Data was collected from 12,053 participants aged 18 to 59 years old, with general demographic characteristics and sexual behavior questionnaire responses included as features. The Adaptive Synthetic Sampling Approach (ADASYN) algorithm was used to address data imbalance, and 15 machine learning algorithms were evaluated before ultimately selecting the CatBoost algorithm. The shap method was employed to enhance interpretability by identifying feature importance in the model's STIs risk *** CatBoost classifier achieved AUC values of 0.9995, 0.9948, 0.9923, and 0.9996 and 0.9769 for predicting chlamydia, genital herpes, genital warts, gonorrhea, and overall STIs infections among males. The CatBoost classifier achieved AUC values of 0.9971, 0.972, 0.9765, 1, 0.9485 and 0.8819 for predicting chlamydia, genital herpes, genital warts, gonorrhea

关键词： Sexually transmitted infections CatBoost algorithm NHANES data shap algorithm

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：