检索结果-内蒙古大学图书馆

Automated File Labeling for Heterogeneous Files Organization Using Machine Learning

computers, Materials & Continua 2023年第2期74卷 3263-3278页

作者： Sagheer Abbas Syed Ali Raza MAKhan Muhammad Adnan Khan Atta-ur-Rahman Kiran Sultan Amir Mosavi School of Computer Science National College of Business Administration&EconomicsLahore54000Pakistan Department of Computer Science GC University LahorePakistan Riphah School of Computing&Innovation Faculty of ComputingRiphah International UniversityLahore CampusLahore54000Pakistan Department of Software Pattern Recognition and Machine Learning LabGachon UniversitySeongnam13120Korea Department of Computer Science College of Computer Science and Information Technology(CCSIT)Imam Abdulrahman Bin Faisal University(IAU)P.O.Box 1982Dammam31441Saudi Arabia Department of CIT The Applied CollegeKing Abdulaziz UniversityJeddah31261Saudi Arabia John von Neumann Faculty of Informatics Obuda UniversityBudapest1034Hungary Institute of Information Engineering Automation and MathematicsSlovak University of Technology in BratislavaBratislava81107Slovakia Faculty of Civil Engineering TU-DresdenDresden01062Germany

File labeling techniques have a long history in analyzing the anthological trends in computational *** situation becomes worse in the case of files downloaded into systems from the ***,most users either have to change file names manually or leave a meaningless name of the files,which increases the time to search required files and results in redundancy and duplications of user ***,no significant work is done on automated file labeling during the organization of heterogeneous user files.A few attempts have been made in topic ***,one major drawback of current topic modeling approaches is better *** rely on specific language types and domain similarity of the *** this research,machine learning approaches have been employed to analyze and extract the information from heterogeneous corpus.A different file labeling technique has also been used to get the meaningful and`cohesive topic of the *** results show that the proposed methodology can generate relevant and context-sensitive names for heterogeneous data files and provide additional insight into automated file labeling in operating systems.

关键词： Automated file labeling file organization machine learning topic modeling

来源：评论

学校读者我要写书评

暂无评论

Cardiac Arrhythmia Disease Classifier Model Based on a Fuzzy Fusion Approach

引用

computers, Materials & Continua 2023年第5期75卷 4485-4499页

作者： Fatma Taher Hamoud Alshammari Lobna Osman Mohamed Elhoseny Abdulaziz Shehab Eman Elayat College of Technological Innovation Zayed UniversityDubaiUAE Department of Information Systems College of Computer and Information SciencesJouf UniversitySakakaSaudi Arabia Department of Electronics and Communications Engineering Delta Higher Institute for Engineering&TechnologyMansouraEgypt College of Computing and Informatics University of SharjahSharjahUnited Arab Emirates Department of Information Systems Mansoura UniversityMansoura35516Egypt Department of Teacher Preparation Faculty of Specific EducationMansoura UniversityMansouraEgypt

Cardiac diseases are one of the greatest global health *** to the high annual mortality rates,cardiac diseases have attracted the attention of numerous researchers in recent *** article proposes a hybrid fuzzy fusion classification model for cardiac arrhythmia *** fusion model is utilized to optimally select the highest-ranked features generated by a variety of well-known feature-selection *** ensemble of classifiers is then applied to the fusion’s *** proposed model classifies the arrhythmia dataset from the University of California,Irvine into normal/abnormal classes as well as 16 classes of ***,at the preprocessing steps,for the miss-valued attributes,we used the average value in the linear attributes group by the same class and the most frequent value for nominal ***,in order to ensure the model optimality,we eliminated all attributes which have zero or constant values that might bias the results of utilized *** preprocessing step led to 161 out of 279 attributes(features).Thereafter,a fuzzy-based feature-selection fusion method is applied to fuse high-ranked features obtained from different heuristic feature-selection *** short,our study comprises three main blocks:(1)sensing data and preprocessing;(2)feature queuing,selection,and extraction;and(3)the predictive *** proposed method improves classification performance in terms of accuracy,F1measure,recall,and precision when compared to state-of-the-art *** achieves 98.5%accuracy for binary class mode and 98.9%accuracy for categorized class mode.

关键词： Cardiac arrhythmia preprocessing missing values classification model fusion

来源：评论

学校读者我要写书评

暂无评论

VILMA: A ZERO-SHOT BENCHMARK FOR LINGUISTIC AND TEMPORAL GROUNDING IN VIDEO-LANGUAGE MODELS 12

VILMA: A ZERO-SHOT BENCHMARK FOR LINGUISTIC AND TEMPORAL GRO...

引用

12th International Conference on Learning Representations, ICLR 2024

作者： Kesen, Ilker Pedrotti, Andrea Dogan, Mustafa Cafagna, Michele Acikgoz, Emre Can Parcalabescu, Letitia Calixto, Iacer Frank, Anette Gatt, Albert Erdem, Aykut Erdem, Erkut Koç University KUIS AI Center Turkey Koç University Department of Computer Engineering Turkey University of Pisa Department of Computer Science Italy Institute of Information Science and Technologies Italian National Council of Research Italy Hacettepe University Department of Computer Engineering Turkey Aselsan Research Turkey University of Malta Institute of Linguistics and Language Technology Malta Heidelberg University Department of Computational Linguistics Germany Amsterdam UMC University of Amsterdam Department of Medical Informatics Netherlands Amsterdam Public Health Methodology & Mental Health Amsterdam Netherlands Utrecht University Department of Information and Computing Sciences Netherlands

With the ever-increasing popularity of pretrained Video-Language Models (VidLMs), there is a pressing need to develop robust evaluation methodologies that delve deeper into their visio-linguistic capabilities. To address this challenge, we present VILMA), a task-agnostic benchmark that places the assessment of fine-grained capabilities of these models on a firm footing. Task-based evaluations, while valuable, fail to capture the complexities and specific temporal aspects of moving images that VidLMs need to process. Through carefully curated counterfactuals, VILMA offers a controlled evaluation suite that sheds light on the true potential of these models, as well as their performance gaps compared to human-level understanding. VILMA also includes proficiency tests, which assess basic capabilities deemed essential to solving the main counterfactual tests. We show that current VidLMs' grounding abilities are no better than those of vision-language models which use static images. This is especially striking once the performance on proficiency tests is factored in. Our benchmark serves as a catalyst for future research on VidLMs, helping to highlight areas that still need to be explored. © 2024 12th International Conference on Learning Representations, ICLR 2024. All rights reserved.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

Using Machine Learning to Identify and Categorize Personally Identifiable Information and Payment Card Industry Data in Textual Content

Using Machine Learning to Identify and Categorize Personally...

引用

Advanced Research in Computing (ICARC), International Conference on

作者： Milinda Arambawela Achala Aponso Computer Science and Engineering University of Westminster London UK Department of Computing Informatics Institute of Technology Colombo Sri Lanka

The advent of the Internet has significantly stream-lined daily tasks through the rapid increase of online services. Everyday activities, such as purchasing goods and scheduling appointments with healthcare professionals, have become more speedy, efficient and user-friendly with the integration of the Internet. The continuous improvement of online services has led to many people moving towards digital activities. As a result, it has heightened the recording of personal and payment transaction data across various storage mediums, including databases and log files. The protection and regulation of this sensitive data are imperative, aligning with the guidelines outlined in GDPR and PCI-DSS compliances. Recognizing exposed personal data poses a considerable challenge. This research introduces a novel approach to identifying payment card industry data (PCI) and personally identifiable information (PII). The research project proposes a machine learning-based text classification model utilizing the Convolutional Neural Network (CNN) model to discern PII and PCI data within a given text. The CNN model has been constructed and compared against Naive Bayes, Gradient Boost, Random Forest, and Support Vector Machine (SVM) models. The CNN model achieved the highest accuracy at 0.96 (96%). Additionally, the F1 scores for each class were significant, with PII scoring 0.94, PCI scoring 0.95, and Normal scoring 0.99. Following the model's construction and training, it was employed with the saved tokenizer's word indexes and label encoders in the developed classification tool. This tool successfully delivered the promised results, identifying exposed PII and PCI data.

关键词：

来源：评论

学校读者我要写书评

暂无评论

KnowSOntoWSR: Web Service Recommendation System Using Semantically Driven QoS Ontology-Based Knowledge-Centred Paradigm 1

引用

7th International Conference on Internet of Things and Connected Technologies, ICIoTCT 2022

作者： Dhanvardini, R. Deepak, Gerard Priyadarshini, J. Sheeba Santhanavijayan, A. Healthcare Informatics Division Optum UnitedHealth Groups Hyderabad India Department of Computer Science Engineering National Institute of Technology Tiruchirappalli India Department of Data Science Manipal Institute of Technology Bengaluru Bengaluru India Department of Data Science Manipal Academy of Higher Education Manipal India Bangalore India

ISBN: (数字)9789811997198

ISBN: (纸本)9789811997181

Web services have significantly expanded and become a key enabling technology for online data, application and resource sharing. Designing new methods for efficient and reliable web service recommendation has been of tremendous importance with the growing usage and prominence of web services. It would be ideal for a system to suggest online services that are in line with consumers’ preferences without requesting specific query information from them. Quality of Service (QoS) is vital for characterising non-functional aspects of Web services as they become more prevalent and widely used on the World Wide Web. The KnowSOntoWSR framework, which is built on a knowledge-driven and semantically inclined model that adheres to QoS ontology, is proposed in this research. AWS and WebSphere are employed as knowledge tags, and the powerful machine learning classifier XGBoost is applied. The features and recommendations are computed using the Twitter semantic similarity. The proposed framework outperforms the baseline models’ estimates with an accuracy of 95.94% and average F-measure of 95.93%. © 2023, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

关键词： Quality of service

来源：评论

学校读者我要写书评

暂无评论

What Improves the Generalization of Graph Transformers? A Theoretical Dive into the Self-attention and Positional Encoding 41

What Improves the Generalization of Graph Transformers? A Th...

引用

41st International Conference on Machine Learning, ICML 2024

作者： Li, Hongkang Wang, Meng Ma, Tengfei Liu, Sijia Zhang, Zaixi Chen, Pin-Yu Department of Electrical Computer and System Engineering Rensselaer Polytechnic Institute TroyNY United States Department of Biomedical Informatics Stony Brook University Stony BrookNY United States Department of Computer Science and Engineering Michigan State University East LansingMI United States MIT-IBM Watson AI Lab IBM Research MA United States Department of Computer Science and Engineering University of Science and Technology of China Anhui Hefei China IBM Thomas J. Watson Research Center Yorktown HeightsNY United States

Graph Transformers, which incorporate self-attention and positional encoding, have recently emerged as a powerful architecture for various graph learning tasks. Despite their impressive performance, the complex non-convex interactions across layers and the recursive graph structure have made it challenging to establish a theoretical foundation for learning and generalization. This study introduces the first theoretical investigation of a shallow Graph Transformer for semi-supervised node classification, comprising a self-attention layer with relative positional encoding and a two-layer perceptron. Focusing on a graph data model with discriminative nodes that determine node labels and non-discriminative nodes that are class-irrelevant, we characterize the sample complexity required to achieve a desirable generalization error by training with stochastic gradient descent (SGD). This paper provides the quantitative characterization of the sample complexity and number of iterations for convergence dependent on the fraction of discriminative nodes, the dominant patterns, and the initial model errors. Furthermore, we demonstrate that self-attention and positional encoding enhance generalization by making the attention map sparse and promoting the core neighborhood during training, which explains the superior feature representation of Graph Transformers. Our theoretical results are supported by empirical experiments on synthetic and real-world benchmarks. Copyright 2024 by the author(s)

关键词： Stochastic systems

来源：评论

学校读者我要写书评

暂无评论

Robust Transmission Design in Multiobjective RIS-Aided SWIPT IoT Communications

引用

IEEE Internet of Things Journal 2024年第10期11卷 18605-18618页

作者： Sharma, Vaibhav Allu, Raviteja Singh, Sandeep Kumar Singh, Keshav Duong, Trung Q. Tsiftsis, Theodoros A. Institute of Communications Engineering National Sun Yat-sen University Kaohsiung804 Taiwan Motilal Nehru National Institute of Technology Allahabad Electronics and Communication Engineering Department Prayagraj211004 India Memorial University of Newfoundland Faculty of Engineering and Applied Science St. John'sNLA1B 3X5 Canada Queen's University Belfast School of Electronics Electrical Engineering and Computer Science BelfastBT7 1NN United Kingdom University of Thessaly Department of Informatics and Telecommunications Lamia35100 Greece

This work investigates the performance of simultaneous wireless information and power transfer (SWIPT) in a reconfigurable intelligent surface (RIS)-aided Internet of Things (IoT) communications under imperfect channel state information (CSI). We formulate a multiobjective optimization problem (MOOP) to design a transmit precoding vector (TPV) at the base station (BS) and phase shift matrix (PSM) at the RIS that jointly maximizes energy efficiency (EE) and harvested power (HP) under the norm-bounded CSI error model. Due to the conflicting objective functions and nonconvex nature of the above optimization problem, the MOOP is simplified using the ϵ-constraint method and subsequently adopting advanced optimization tools, such as the Dinkelbach method, S-procedure, general sign-definiteness, semidefinite programming and convex-concave procedure. Thereafter, we propose an alternating optimization-based algorithm which determines optimal TPV and PSM iteratively that jointly maximizes the EE and HP of the considered system. Through numerical simulations, we validate the robustness, optimality, convergence, accuracy, and effectiveness of our proposed algorithm. Furthermore, we assess the impact of several key parameters, such as the number of RIS elements, available transmit power at BS, and the minimum HP on the performance of the considered system. © 2014 IEEE.

关键词： Energy efficiency

来源：评论

学校读者我要写书评

暂无评论

Dynamic Disaster Management with Real-Time IoT Data Analysis and Response

Dynamic Disaster Management with Real-Time IoT Data Analysis...

引用

2024 IEEE International Conference on Automation and Computation, AUTOCOM 2024

作者： Dankan Gowda, V. Sharma, Avinash Prasad, K.D.V. Saxena, Rini Barua, Tarkeshwar Mohiuddin, Khalid Bms Institute of Technology and Management Department of Electronics and Communication Engineering Karnataka Bangalore India Chandigarh Engineering College Chandigarh Group of Colleges-Jhanjeri Department of Computer Science and Engineering Punjab Mohali140307 India Symbiosis Institute of Business Management Hyderabad India Pune India King Khalid University College of Business Department of Business Informatics PO Box-3247 Abha61471 Saudi Arabia

ISBN: (纸本)9798350382723

As natural and manmade disasters grew in number, as a result the problem of how to quickly and effectively respond to disaster has become fresh. This is precisely the purpose of this research: Using IoT technologies and powerful data analysis techniques, to integrate them into existing disaster management systems. The beginning of the article contains a broadcasted statement to the effect that traditional methods are insufficient, and real-time data and preemptive response systems are needed. With the application of Internet of Things, an integrated system is proposed in which countless types of information such as weather conditions and seismic activity are gathered by sensors and actuators. Advanced machine learning algorithms and predictive modeling are used to analyze the gathered data. This allows us to make real-time decisions. The design and construction of an IoT-based disaster management system is the methodology behind the research. In particular, we will evaluate how effective it is at reducing response times and increasing overall resilience to disasters. The results show a high efficiency in response, which reflects the feasibility of the method. Finally, the paper discusses the problems encountered in implementing IoT and advanced data analysis of disaster risk and suggests future research avenues. There is no doubt that they will change the present disaster management practice forever. © 2024 IEEE.

关键词： Remote sensing

来源：评论

学校读者我要写书评

暂无评论

Performing Aerobatic Maneuver with Imitation Learning 23rd

Performing Aerobatic Maneuver with Imitation Learning

引用

23rd International Conference on Computational Science, ICCS 2023

作者： Freitas, Henrique Camacho, Rui Silva, Daniel Castro Department of Informatics Engineering Faculty of Engineering University of Porto Rua Dr. Roberto Frias s/n. Porto4200-465 Portugal LIAAD/INESC TEC - Artificial Intelligence and Decision Support Laboratory/Institute for Systems and Computer Engineering Technology and Science Porto Portugal LIACC - Artificial Intelligence and Computer Science Laboratory Porto Portugal

ISBN: (纸本)9783031359941

The work reported in this article addresses the challenge of building models for non-trivial aerobatic aircraft maneuvers in an automated fashion. It is built using a Behavioural Cloning approach where human pilots provide a set of example maneuvers used by a Machine Learning algorithm to induce a control model for each maneuver. The best examples for each maneuver were selected using a set of objective evaluation metrics. Using those example sets, robust models were induced that could replicate (and in some cases outperform) the human pilots that provided the examples (the clean-up effect). Complete complex maneuvers were performed using a meta-controller capable of sequencing the basic ones learned by imitation. This endeavor was rewarded by the results that show several Machine Learning models capable of performing highly complex aircraft maneuvers. © 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.

关键词： Cloning

来源：评论

学校读者我要写书评

暂无评论

Improving bot detection of fake followers on Twitter via a hybrid B-HC optimisation algorithm

Improving bot detection of fake followers on Twitter via a h...

引用

International Conference on Electronics, Communication and Computing Technologies (ICECCT)

作者： Haider Alzeyadi Fecir Duran Department of Computer Science Informatics Institute Gazi University Ankara Turkey Department of Computer Engineering Engineering Faculty of Technology Gazi University Ankara Turkey

Sophisticated cyber threats are seen on Online Social Networks (OSNs) social media accounts automated to imitate human behaviours has an impactful effect on distorting public thoughts and opinions. OSNs are weaponized to diffuse deception, misinformation, and malicious activities, that forms a serious threat to society. The deceptive nature of imitating human behaviour has become a challenging and crucial task to detect automated accounts (socialbots). This research, however, proposes a hybrid metaheuristic optimisation algorithm for socialbot detection. Specifically, a hybrid B-Hill Climbing (B-HC) optimisation algorithm works in tandem with a k-NN nearest neighbour classifier to accurately select a relevant feature subset. It is applied to be tested for fake followers account on Twitter data. Experimental results showed that the proposed method is better than the traditional and the latest feature selection techniques as well as the rule-set methods. The B-HC alongside with k-NN method achieved promising results using only relevant feature subset.

关键词：

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：