检索结果-内蒙古大学图书馆

22nd Annual international conference on Computational Science (ICCS)

作者： Kuk, Michal Bobek, Szymon Nalepa, Grzegorz J. AGH Univ Sci & Technol Krakow Poland Jagiellonian Univ Jagiellonian Human Ctr Artificial Intelligence La Krakow Poland Jagiellonian Univ Inst Appl Comp Sci Krakow Poland

ISBN: (纸本)9783031087578;9783031087561

Explainable Artificial Intelligence (XAI) aims at introducing transparency and intelligibility into the decision-making process of AI systems. In recent years, most efforts were made to build XAI algorithms that are able to explain black-box models. However, in many cases, including medical and industrial applications, the explanation of a decision may be worth equally or even more than the decision itself. This imposes a question about the quality of explanations. In this work, we aim at investigating how the explanations derived from black-box models combined with XAI algorithms differ from those obtained from inherently interpretable glass-box models. We also aim at answering the question whether there are justified cases to use less accurate glass-box models instead of complex black-box approaches. We perform our study on publicly available datasets.

关键词： Explainable AI Machine learning Artificial intelligence data mining

来源：评论

学校读者我要写书评

暂无评论

Effective Intrusion Detection in WSNs: Evaluating Machine Learning models on WSN DS and WUSTL EHMS-2020 datasets

Effective Intrusion Detection in WSNs: Evaluating Machine Le...

引用

international conference on Electrical, Computer and Communication Engineering (ECCE)

作者： Ziaur Rahman Saiful Islam Department of Electronics & Telecommunication Engineering Chittagong University of Engineering & Technology(CUET) Chittagong Bangladesh

ISBN: (数字)9798350357509

ISBN: (纸本)9798350357516

This study addresses the detection of DoS (Denial of Service) attacks in WSNs (Wireless Sensor Networks), which are constrained by limited resources and node vulnerabilities. The primary goal is to improve the precision and reliability of IDSs through advanced machine learning techniques. Firstly, data preprocessing involved handling missing values, applying feature scaling (standardization and min-max normalization), encoding labels, and balancing the dataset using the SMOTE (Synthetic Minority Over-sampling Technique). Secondly, feature selection was conducted using Recursive Feature Elimination (RFE) to ensure high-quality input data. The study applies a range of machine learning algorithms, including MLP Classifier, Decision Tree, K-Nearest Neighbors, Gaussian Naive Bayes, SGD Classifier, Logistic Regression, Random Forest, and Voting Classifier, to assess their performance for the classification task. WSN-DS and WUSTL-EHMS-2020 datasets are used to evaluate effective intrusion detection systems (IDSs) for WSNs. Results showed that the Ensembles Learning methods achieved the highest accuracy, while tree-based models were particularly effective in detecting DoS attacks. These findings highlight the need for further research on real-time and hybrid detection strategies to enhance IDS performance in WSNs.

关键词： Wireless sensor networks Logistic regression Machine learning algorithms Computational modeling Intrusion detection Standardization Nearest neighbor methods Real-time systems Reliability Random forests

来源：评论

学校读者我要写书评

暂无评论

Incentivizing Smart Vehicles for Autonomous Driving HD Maps using predictive analytics

Incentivizing Smart Vehicles for Autonomous Driving HD Maps ...

引用

2023 international conference on Machine Vision, Image processing and Imaging Technology, MVIPIT 2023

作者： Khattak, Muhammad Ilyas Hui, Yuan Ullah, Inam Khan, Ajmal Ahmad, Ayaz Shandong University School of Control Science and Engineering Shandong province Jinan City China Depart. of Computer Science Shandong province Jinan City China Woosong University AI & Big Data Depart. Daejeon Korea Republic of Depart. of Electrical and Computer Engineering Wah Cantt Pakistan

ISBN: (纸本)9798350306545

Recent advancements in distributed data storage, colloborative processing capabilities, and their architectural design have a profound influence on our daily lives, simplifying the process of hosting computation-intensive software applications. A single software hosting device, for instance, an Onboard Computation Unit (OBCU) of an autonomous vehicle, on the other hand, can't keep up with the growing need for diverse requirements of computing architectures and processing power of such applications. For example, the real-time handling and generation of HD maps is one such crucial and highly computation-intensive application for autonomous vehicles. For this purpose, an efficient solution is to rely on the computing power of adjacent resource-rich nodes. Hence, the use of a latency-aware predictive analytics-based computation task offloading method has a vital significance, especially with the advent of Vehicular Edge and Fog Computing (VEFC). VEFC models pool redundant computational resources in close proximity. It, however, faces several challenges. The stochastic nature of vehicular networks, intricate heterogeneity at several levels, and the uncertainty of time-sensitive applications for the timely completion of tasks impede the VEFC's efficiency. We developed a computation offloading framework that timely alleviates the computing-resource-deficit devices by deploying vehicles as fog-nodes commensurate with the minimum latency requirements of advanced vehicular applications, especially computation-intensive HD map handling, and generation in real-time. Our technique is a greedy heuristic method that works in small, discrete steps instead of solving the entire optimization problem holistically. We used Monte Carlo simulation to show that our approach had an overall response time of under-300 milliseconds compared to other baseline methods. © 2023 IEEE.

关键词： Computation offloading

来源：评论

学校读者我要写书评

暂无评论

TabMentor: Detect Errors on Tabular data with Noisy Labels 9th

TabMentor: Detect Errors on Tabular Data with Noisy Labels

引用

19th international conference on Advanced data Mining and Applications, ADMA 2023

作者： Zhang, Yaru Qin, Jianbin Wang, Yaoshu Ali, Muhammad Asif Ji, Yan Mao, Rui Shenzhen Institute of Computing Sciences Shenzhen University Shenzhen China King Abdullah University of Science and Technology Thuwal Saudi Arabia

ISBN: (纸本)9783031466700

Existing supervised methods for error detection require access to clean labels in order to train the classification models. This is difficult to achieve in practical scenarios. While the majority of the error detection algorithms ignore the effect of noisy labels, in this paper, we design effective techniques for error detection when both data and labels contain noise. Nevertheless, we present TabMentor, a novel deep-learning model for error detection on tabular data with noisy training labels. TabMentor introduces a deep model for the prediction, i.e., Tabclassifier that suggests the most salient features for the decision step, enabling efficient learning. For feature extraction, it uses existing error detection algorithms, along with some raw features from the datasets. To reduce the negative effect of noisy training labels on the model, TabMentor uses another deep model, i.e., Teachernet, to supervise the training of Tabclassifier. During the training process, both Teachernet and Tabclassifier dynamically learn curriculum from data, allowing Tabclassifier to focus more on clean labeled samples. Performance evaluation using five different data sets shows that the TabMentor excels over the best baseline error detection system by 0.05 to 0.11 in terms of F1 scores. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2023.

关键词： Error detection

来源：评论

学校读者我要写书评

暂无评论

Clinical Text Classification in Cancer Real-World data in Spanish 10th

Clinical Text Classification in Cancer Real-World Data in Sp...

引用

10th international Work-conference on Bioinformatics and Biomedical Engineering (IWBBIO)

作者： Moreno-Barea, Francisco J. Mesa, Hector Ribelles, Nuria Alba, Emilio Jerez, Jose M. Univ Malaga Escuela Tecn Super Ingn Informat Dept Lenguajes & Ciencias Computac Malaga Spain Hosp Univ Reg & Virgen de la Victoria Unidad Gest Clin Interctr Oncol Malaga Spain

ISBN: (纸本)9783031349522;9783031349539

Healthcare systems currently store a large amount of clinical data, mostly unstructured textual information, such as electronic health records (EHRs). Manually extracting valuable information from these documents is costly for healthcare professionals. For example, when a patient first arrives at an oncology clinical analysis unit, clinical staff must extract information about the type of neoplasm in order to assign the appropriate clinical specialist. Automating this task is equivalent to text classification in natural language processing (NLP). In this study, we have attempted to extract the neoplasm type by processing Spanish clinical documents. A private corpus of 23, 704 real clinical cases has been processed to extract the three most common types of neoplasms in the Spanish territory: breast, lung and colorectal neoplasms. We have developed methodologies based on state-of-the-art text classification task, strategies based on machine learning and bag-of-words, based on embedding models in a supervised task, and based on bidirectional recurrent neural networks with convolutional layers (C-BiRNN). The results obtained show that the application of NLP methods is extremely helpful in performing the task of neoplasm type extraction. In particular, the 2-BiGRU model with convolutional layer and pre-trained fastText embedding obtained the best performance, with a macro-average, more representative than the micro-average due to the unbalanced data, of 0.981 for precision, 0.984 for recall and 0.982 for F1-score.

关键词： Text Classification Natural Language processing Electronic Health Records Neoplasm cancer Spanish

来源：评论

学校读者我要写书评

暂无评论

Exploration of Applying Privacy Protection algorithms in Financial Fraud Detection: A Comparative Study of SecureBoost and Neural Networks

Exploration of Applying Privacy Protection Algorithms in Fin...

引用

Intelligent Systems and Computational Networks (ICISCN), international conference on

作者： Fei Wang Xue Liu School of Management Shanghai University China Basic Course Department Wuhan Donghu University Wuhan China

ISBN: (数字)9798331529246

ISBN: (纸本)9798331529253

This study explores the significant impact of corporate financial information disclosure on investor decision-making and economic policy-making. Financial fraud may lead to information distortion and disrupt market order, therefore the identification of financial fraud has always been a research focus. Previous studies have mainly relied on financial and non-financial data disclosed by enterprises, while Internet information is more indicative in identifying financial fraud. However, using Internet data will face copyright problems, and crawler technology is not the optimal solution. Information disclosure and transaction costs also limit its economic feasibility. To address these issues, this article adopts privacy preserving machine learning technology, which avoids legal, technical, and economic barriers by generating model parameters instead of using raw data. Based on 16112 samples from 2012 to 2020, this paper collects financial, non-financial and Internet information, and constructs three models: Model 1 is only based on financial and non-financial data, Model 2 adds Internet information on this basis, and Model 3 combines two privacy protection algorithms - SecureBoost and vertical neural network. The experimental results show that Model 2 improves accuracy by 7% to 10% compared to Model 1, while Model 3 further optimizes model performance while ensuring data privacy. This paper theoretically and empirically verifies the necessity of introducing Internet information, and the application potential of privacy protection machine learning technology in financial fraud detection.

关键词： data privacy Analytical models Privacy Machine learning algorithms Neural networks Finance data models Internet Fraud Protection

来源：评论

学校读者我要写书评

暂无评论

Classification and action rules in identification and self-care assessment problems

引用

TECHNOLOGY AND HEALTH CARE 2022年第1期30卷 257-269页

作者： Zdrodowska, Malgorzata Dardzinska-Glebocka, Agnieszka Bialystok Tech Univ Fac Mech Engn Inst Biomed Engn Ul Wiejska 45C PL-15351 Bialystok Poland Bialystok Tech Univ Fac Mech Engn Inst Engn Mech Bialystok Poland

BACKGROUND: Disability, especially in children, is a very important and current problem. Lack of proper diagnosis and care increases the difficulty for children to adapt to disabilities. Disabled children have many problems with basic activities of daily living. Therefore, it is very important to support diagnosticians and physiotherapists in recognizing self-care problems in children. OBJECTIVE: The aim of this paper is to extract classification and action rules, useful for those who work with children with disabilities. METHODS: First, features and their impact on the accuracy of classification are determined. Then, two models are built: one with all features and one with selected ones. For these models the classification rules are extracted. Finally, action rules are mined and the next step in treatment process is predicted. RESULTS: Seventeen features with the greatest impact on classifying a child into a particular group of self-care problems were identified. Based on the implemented algorithms, decision and action rules were obtained. CONCLUSIONS: The obtained model, selected attributes and extracted classification and action rules can support the work of therapists and direct their work to those areas of disability where even a minimal reduction of features would be of great benefit to the children.

关键词： Disability ICF-CY self-care problem classification rules action rules feature selection classification data mining

来源：评论

学校读者我要写书评

暂无评论

Application of Nonstationary Time Series Prediction to Shanghai Stock Index Based on SVM 22

Application of Nonstationary Time Series Prediction to Shang...

引用

3rd Asia-Pacific conference on Image processing, Electronics and Computers, IPEC 2022

作者： Yang, Chun Ou, Kaiman Hong, Shaoyong School of Accounting Guangzhou Huashang College Guangzhou China School of Data Science Guangzhou Huashang College Guangzhou China

ISBN: (纸本)9781450395786

With the development of computer software and hardware system, machine learning methods are more and more used in various industries of social development. In the aspect of stock index prediction, the current prediction method has gradually changed from the traditional statistical analysis method to the artificial intelligence analysis method. Based on the original sample data, this paper uses support vector machine regression (SVR) model to predict the opening price of Shanghai stock index. The parameters of SVR model are optimized and debugged by grid search method (grid), particle swarm optimization (PSO) and genetic algorithm (GA). The analysis results show that the three types of support vector machine prediction models based on the original sample data can fully reflect the time-varying law of stock index and have high prediction accuracy. Among them, genetic algorithm support vector machine regression (GA-SVR) model shows that the minimum root mean square error (RMSE) is 14.730 and the minimum average absolute percentage error (MAPE) is 0.375%. GA-SVR model has good prediction effect and has certain significance for the prediction of stock price. © 2022 ACM.

关键词： Genetic algorithms

来源：评论

学校读者我要写书评

暂无评论

LLM-PBE: Assessing data Privacy in Large Language models 50th

LLM-PBE: Assessing Data Privacy in Large Language Models

引用

50th international conference on Very Large data Bases, VLDB 2024

作者： Li, Qinbin Hong, Junyuan Xie, Chulin Tan, Jeffrey Xin, Rachel Hou, Junyi Yin, Xavier Wang, Zhun Hendrycks, Dan Wang, Zhangyang Li, Bo He, Bingsheng Song, Dawn University of California Berkeley United States University of Texas at Austin United States University of Illinois Urbana-Champaign United States National University of Singapore Singapore Center for AI Safety United States University of Chicago United States

Large Language models (LLMs) have become integral to numerous domains, significantly advancing applications in data management, mining, and analysis. Their profound capabilities in processing and interpreting complex language data, however, bring to light pressing concerns regarding data privacy, especially the risk of un intentional training data leakage. Despite the critical nature of this issue, there has been no existing literature to offer a comprehensive assessment of data privacy risks in LLMs. Addressing this gap, our paper introduces LLM-PBE, a toolkit crafted specifically for the systematic evaluation of data privacy risks in LLMs. LLM-PBE is designed to analyze privacy across the entire lifecycle of LLMs, incorporating diverse attack and defense strategies, and handling various data types and metrics. Through detailed experimentation with multiple LLMs, LLM-PBE facilitates an in-depth exploration of data privacy concerns, shedding light on influential factors such as model size, data characteristics, and evolving temporal dimensions. This study not only enriches the understanding of privacy issues in LLMs but also serves as a vital resource for future research in the field. Aimed at enhancing the breadth of knowledge in this area, the findings, resources, and our full technical report are made available at https://***/, providing an open platform for academic and practical advancements in LLM privacy assessment. © 2024, VLDB Endowment. All rights reserved.

关键词： Information leakage

来源：评论

学校读者我要写书评

暂无评论

A Comprehensive Overview on data Augmentation Techniques for Medical Images

A Comprehensive Overview on Data Augmentation Techniques for...

引用

Electronics and Sustainable Communication Systems (ICESC), 2020 international conference on

作者： Swarajya Madhuri Rayavarapu Tammineni Shanmukha Prasanthi Sasibhushana Rao Gottapu Aruna Singam Department of Electronics and Communication Andhra University college of Engineering India

ISBN: (数字)9798350379945

ISBN: (纸本)9798350379952

A significant number of recent advancements in Deep Learning have significantly benefited from training sets that are both larger and more diversified. Nevertheless, the collection of huge datasets for medical imaging continues to be a challenge due to issues around privacy and the expenses associated with labelling. Through the use of data augmentation, it is feasible to significantly increase the quantity and variety of data that is accessible for training purposes without actually collecting additional samples. data augmentation techniques span from straightforward changes like cropping, padding, and flipping to more complicated generative models. These transformations are surprisingly powerful despite their apparent simplicity. Different data augmentation procedures are likely to function differently depending on the nature of the input and the visual task that is being performed. As a result of this, it is probable that medical imaging calls for particular augmentation algorithms that are capable of producing believable data samples and enabling the successful regularization of deep neural networks. This paper reviews different data augmentation techniques.

关键词： Training Deep learning data privacy Visualization Machine learning algorithms Reviews data augmentation data models Labeling Biomedical imaging

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：