检索结果-内蒙古大学图书馆

A survey of artificial intelligence models for wireless capsule endoscopy videos for superior automatic diagnosis: problems and solutions

引用

Multimedia Tools and Applications 2025年 1-35页

作者： El-Gammal, Eman M. El-Shafai, Walid Taha, Taha E. El-Fishawy, Adel S. Abd El-Samie, Fathi E. Department of Electronics and Electrical Communications Engineering Faculty of Electronic Engineering Menoufia University Menouf32952 Egypt Center of Advanced Software and Biomedical Clinical Engineering Consultations Faculty of Engineering Cairo University Giza Egypt Computer Science Department Prince Sultan University Riyadh11586 Saudi Arabia Department of Information Technology College of Computer and Information Sciences Princess Nourah Bint Abdulrahman University 84428 Riyadh11671 Saudi Arabia

Wireless Capsule Endoscopy (WCE) emerged as an innovative and patient-centric approach for non-invasive and painless examination of the gastrointestinal (GI) tract. It serves as a pivotal tool in helping medical practitioners in the early detection of anomalies within the intricate domain of the human GI tract. The automated identification of aberrations assumes paramount significance, contributing not only to temporal efficiency but also to timely diagnosis. Within the realm of academic literature, a plethora of Artificial Intelligence (AI) methodologies have been proposed to effectuate the automatic classification, segmentation, and synthesis of anomalies inherent in WCE images. This scholarly endeavor undertakes a comprehensive and meticulous survey, elucidating the spectrum of anomaly classification, summarization, and detection techniques harnessed for Computer-Aided Diagnosis (CAD) in the context of WCE images. The survey begins by delineating the methodologies underpinning WCE image classification and video processing. Subsequently, an exhaustive evaluation of techniques for analyzing WCE images is furnished, accompanied by an in-depth exploration of pertinent surveys, in addition to a judicious assessment of their merits and demerits. Furthermore, this study undertakes a rigorous evaluation of prevailing datasets utilized to benchmark the efficacy of various WCE techniques. In summary, this survey serves as a vanguard, delineating promising avenues for future investigations in the realm of WCE image analysis models. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2025.

关键词： Endoscopy

来源：评论

学校读者我要写书评

暂无评论

Gloss: Guiding Large Language Models to Answer Questions from System Logs 31

Gloss: Guiding Large Language Models to Answer Questions fro...

引用

31st IEEE International Conference on software Analysis, Evolution and Reengineering, SANER 2024

作者： Huang, Shaohan Liu, Yi Qi, Jiaxing Shang, Jing Xiao, Zhiwen Fung, Carol Wu, Zhihui Yang, Hailong Luan, Zhongzhi Qian, Depei Sino-German Joint Software Institute Beihang University Beijing China China Mobile Information Technology Center Beijing China Concordia Institute for Information Systems Engineering Concordia University Quebec Canada

ISBN: (纸本)9798350330663

System logs contain valuable information and they have emerged as one of the most crucial data sources for system monitoring aimed at enhancing service quality. IT support teams and system administrators are in dire need of an intelligent log-based QA system to help them quickly identify, diagnose, and resolve issues. In this paper, we propose a novel method for constructing log-based question-answering (QA) data using large language models, addressing challenges associated with limited dataset size and diversity in existing log-based QA systems. Our pipeline consists of three steps: generating questions, answering log questions, and refining question-answer pairs. The purpose of the generating questions is to create a diverse set of log-related queries that cover a wide range of potential issues. The second step, answering log questions, aims to extract relevant information from the logs to address the generated questions. This step ensures accurate and context-aware responses. Refining question-answer pairs is intended to improve the overall quality and consistency of the generated log-based QA data. We present a case study using ChatGPT to generate a new dataset, LogQuAD, containing over 28,000 question-answer pairs derived from more than 31,000 raw logs, representing a significant increase compared to existing datasets like LogQA. In our experimental setting, we sample half of the data as the training set and use memory-effect fine-tuning to fine-tune the model, named Gloss. Experimental results show that our method can generate high-quality log-based QA data, leading to improved performance of log-based QA models. Notably, our fine-tuned 7B model outperforms the LLaMA-65B model. This approach can potentially save valuable time for IT support teams and system administrators, enabling proactive problem resolution and optimal system performance. © 2024 IEEE.

关键词： Large datasets

来源：评论

学校读者我要写书评

暂无评论

Optimizing Existing SHM systems: Retasking as a Self-Healing Solution for Improved Fault Tolerance 7

Optimizing Existing SHM Systems: Retasking as a Self-Healing...

引用

7th IEEE International Conference on Emerging Smart Computing and Informatics, ESCI 2025

作者： Cherian, Aaron Mano Ajitha, D. Gouraha, Avanish Mandal, Dipanshu School of Computer Science and Engineering (SCOPE) Vellore Institute of Technology Tamilnadu Vellore India Department of Software Systems School of Computer Science and Engineering (SCOPE) Vellore Institute of Technology Tamilnadu Vellore India

ISBN: (纸本)9798331515683

This paper focuses on self-healing algorithms in structural health monitoring (SHM) systems centered around the enhancement of resilience and adaptability of the systems. In this study, imports from existing methods (clustering, Fault Tolerant Multiple Redundancy (FTMR) and reinforcement learning) are analyzed against the choice of creating a novel retasking algorithm designed for dynamic resource redistribution and optimal monitoring coverage. Unlike conventional methods, retasking will allow adapting the coverage in real time, whereby system down time will be reduced, with less computational load achieved through task redistribution through functional sensors. Findings showed that retasking improved reliability and scalability of the SHM systems drastically, providing a simple yet powerful resolution towards modern infrastructure monitoring. This study stresses the retasking capability to redefine self-healing in the SHM systems for future directions in infrastructure safety. © 2025 IEEE.

关键词： Clustering Fault Tolerant Multiple Redundancy (FTMR) Real-time monitoring Reinforcement learning Retasking algorithm Self-healing algorithms Structural health monitoring (SHM)

来源：评论

学校读者我要写书评

暂无评论

Optimizing beyond boundaries: empowering the salp swarm algorithm for global optimization and defective software module classification

引用

Neural Computing and Applications 2024年第30期36卷 18727-18759页

作者： Kassaymeh, Sofian Al-Betar, Mohammed Azmi Rjoubd, Gaith Fraihat, Salam Abdullah, Salwani Almasri, Ammar Software Engineering Department Faculty of Information Technology Aqaba University of Technology Aqaba Jordan College of Engineering and Information Technology Ajman University Ajman United Arab Emirates Department of Information Technology Al-Huson University College Al-Balqa Applied University Irbid Jordan Concordia Institute for Information Systems Engineering Concordia University Montreal Canada Artificial Intelligence Department Faculty of Information Technology Aqaba University of Technology Aqaba Jordan Data Mining and Optimization Research Group Center for Artificial Intelligence Technology Universiti Kebangsaan Malaysia Bangi Malaysia Management Information Systems Department Al-Balqa Applied University Amman Jordan

This work presents a new version of the salp swarm optimizer (SSA), called "mSSA," that uses complex mathematical expressions to dynamically manipulate the crucial control parameter (c1) during optimization. These expressions are carefully designed to modulate the shift in search strategy from exploratory to exploitative, improving the flexibility and speed of convergence of the algorithm. To evaluate the performance of the developed mSSA variants, a thorough examination is carried out on twenty-three benchmark test functions alongside their application to the complex task of software module classification. The process of classifying defective software modules involves developing a multilayer perceptron (MLP) classifier that is suited to the particular complexity and heterogeneity of the task. Selecting the best optimizer is made easier by systematically evaluating the different mSSA versions as MLP classifier trainers. Based on metrics like classification accuracy, convergence speed, and avoidance of local minima, a comparative analysis in opposition to six previously published metaheuristic optimizers shows that mSSA3, when combined with the developed MLP classifier, outperforms both other mSSA variations and state-of-the-art metaheuristic optimizers in terms of overall performance. The excellent classification accuracy, swift convergence, and ability to avoid local minima of mSSA3 highlight its superiority and establish it as a cutting-edge method in the application of metaheuristic algorithms. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024.

关键词： Global optimization

来源：评论

学校读者我要写书评

暂无评论

Vision DualGNN: Semantic Graph Is Not Only You Need 27th

Vision DualGNN: Semantic Graph Is Not Only You Need

引用

27th International Conference on Pattern Recognition, ICPR 2024

作者： Zheng, Xiaolong Wang, Jianming Xiao, Zhitao Sun, Yukuan School of Software Tiangong University Tianjin China Tianjin Key Laboratory of Autonomous Intelligence Technology and Systems Tiangong University Tianjin China Tianjin Key Laboratory of Optoelectronic Detection Technology and Systems Tiangong University Tianjin China Center for Engineering Intership and Training Tiangong University Tianjin China

ISBN: (纸本)9783031781063

Graph Neural Networks (GNNs) have shown great potential in visual tasks, yet they face challenges in effectively constructing and processing graphs. Vision GNN (ViG) was developed to tackle these issues by segmenting images into patches treated as nodes, with edges formed by connecting the nearest semantic neighbors. However, relying solely on semantic information for graph construction confines itself to a dispersed distribution of object neighbors, leading to inadequate graph processing. To address this issue, we propose Vision DualGNN(VDG), a novel dual graph neural network architecture that leverages both spatial and semantic information to construct and process graph representation of images. We apply a node encoder that transforms image patches into expressive node features. Additionally, we implement a dual-stream GNN that operates on both a spatial graph and a semantic graph. The spatial graph serves as a constraint for the semantic graph, enhancing the node features with spatial awareness. To verify the validity of our architecture, we have conducted our experiments on the ImageNet and CIFAR-100 datasets. And achieved state-of-the-art performance compared to other baseline models. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

关键词： dualgnn graph convolution semantic graph spatial graph

来源：评论

学校读者我要写书评

暂无评论

Optimizing Clustering Approaches in Cloud Environments

International Journal of Interactive Mobile Technologies

引用

International Journal of Interactive Mobile Technologies 2023年第19期17卷 70-94页

作者： Al-Ghuwairi, Abdel-Rahman Al-Fraihat, Dimah Sharrab, Yousef Kreishan, Yazeed Alsarhan, Ayoub Idhaim, Hasan Qahmash, Ayman Department of Software Engineering Faculty of Prince Al-Hussein Bin Abdallah II for Information Technology The Hashemite University Zarqa Jordan Department of Software Engineering Faculty of Information Technology Isra University Amman Jordan Department of Data Science and Artificial Intelligence Faculty of Information Technology Isra University Amman Jordan Department of Information Technology Faculty of Prince Al-Hussein Bin Abdallah II for Information Technology The Hashemite University Zarqa Jordan Department of Information Systems Faculty of Prince Al-Hussein Bin Abdallah II for Information Technology The Hashemite University Zarqa Jordan Department of Information Systems Computer Science College King Khalid University Abha Saudi Arabia

This study focuses on the challenge of developing abstract models to differentiate various cloud resources. It explores the advancements in cloud products that offer specialized services to meet specific external needs. The study proposes a new approach to request processing in clusters, improving downtime, load distribution, and overall performance. A comparison of three clustering approaches is conducted: local single cluster, local multiple clusters, and multiple cloud clusters. Performance, scalability, fault tolerance, resource allocation, availability, and cost-effectiveness are evaluated through experiments with 50 requests. All three approaches achieve a 100% success rate, but processing times vary. The local single cluster has the longest duration, while the local multiple clusters and multiple cloud clusters perform better and offer faster processing, scalability, fault tolerance, and availability. From a cost perspective, the local single cluster and local multiple clusters incur capital and operational expenses, while the multiple cloud clusters follow a pay-as-you-go model. Overall, the local multiple clusters and multiple cloud clusters outperform the local single cluster in terms of performance, scalability, fault tolerance, resource allocation, availability, and cost-effectiveness. These findings provide valuable insights for selecting appropriate clustering strategies in cloud environments. © 2023 by the authors of this article. Published under CC-BY.

关键词： Fault tolerance

来源：评论

学校读者我要写书评

暂无评论

Enhancing Soft Skills in Autistic Children: The Next Generation of Mind Champ's Technological Approach

Enhancing Soft Skills in Autistic Children: The Next Generat...

引用

Advancements in Computing (ICAC), International Conference on

作者： Kalna Peiris Ruchira Nelaka Kalpani Manathunga Department of Computer Systems Engineering Department of Information Technology Department of Computer Science and Software Engineering

ISBN: (数字)9798331517878

ISBN: (纸本)9798331517885

Autism Spectrum Disorder (ASD) significantly impacts a child's ability to navigate social interactions, regulate emotions, and develop adaptive skills crucial for daily functioning. While various interventions exist to address cognitive and academic skills, the development of soft skills such as communication, emotional regulation, social interaction, and creativity remains an under explored area. This paper builds upon the foundational work of the Mind Champ platform, which originally targeted both learning and soft skills, by focusing exclusively on enhancing the soft skills development of autistic children. This enhanced version of the Mind Champ platform leverages advanced behavioral and emotional analysis techniques to offer a comprehensive technological solution. Through interactive activities centered on painting and music, the platform creates a structured, engaging, and supportive environment tailored to the unique needs of children with ASD. By prioritizing emotional engagement and creative expression, the platform empowers children to improve their social abilities, emotional regulation, and adaptability. The results from our continued research indicate that these technology-driven interventions contribute significantly to the holistic development of soft skills in autistic children, providing them with valuable tools to navigate social environments more effectively.

关键词： Autism Navigation Variable speed drives Motors Regulation Real-time systems Creativity Next generation networking Painting Resilience

来源：评论

学校读者我要写书评

暂无评论

Audio-Text Multimodal Speech Recognition via Dual-Tower Architecture for Mandarin Air Traffic Control Communications

引用

Computers, Materials & Continua 2024年第3期78卷 3215-3245页

作者： Shuting Ge Jin Ren Yihua Shi Yujun Zhang Shunzhi Yang Jinfeng Yang School of Computer Science and Software Engineering University of Science and Technology LiaoningAnshan114051China Institute of Applied Artificial Intelligence of the Guangdong-Hong Kong-Macao Greater Bay Area Shenzhen Polytechnic UniversityShenzhen518055China Shenzhen Institutes of Advanced Technology Chinese Academy of SciencesShenzhen518055China Industrial Training Centre Shenzhen Polytechnic UniversityShenzhen518055China

In air traffic control communications (ATCC), misunderstandings between pilots and controllers could result in fatal aviation accidents. Fortunately, advanced automatic speech recognition technology has emerged as a promising means of preventing miscommunications and enhancing aviation safety. However, most existing speech recognition methods merely incorporate external language models on the decoder side, leading to insufficient semantic alignment between speech and text modalities during the encoding phase. Furthermore, it is challenging to model acoustic context dependencies over long distances due to the longer speech sequences than text, especially for the extended ATCC data. To address these issues, we propose a speech-text multimodal dual-tower architecture for speech recognition. It employs cross-modal interactions to achieve close semantic alignment during the encoding stage and strengthen its capabilities in modeling auditory long-distance context dependencies. In addition, a two-stage training strategy is elaborately devised to derive semantics-aware acoustic representations effectively. The first stage focuses on pre-training the speech-text multimodal encoding module to enhance inter-modal semantic alignment and aural long-distance context dependencies. The second stage fine-tunes the entire network to bridge the input modality variation gap between the training and inference phases and boost generalization performance. Extensive experiments demonstrate the effectiveness of the proposed speech-text multimodal speech recognition method on the ATCC and AISHELL-1 datasets. It reduces the character error rate to 6.54% and 8.73%, respectively, and exhibits substantial performance gains of 28.76% and 23.82% compared with the best baseline model. The case studies indicate that the obtained semantics-aware acoustic representations aid in accurately recognizing terms with similar pronunciations but distinctive semantics. The research provides a novel modeling pa

关键词： Speech-text multimodal automatic speech recognition semantic alignment air traffic control communications dual-tower architecture

来源：评论

学校读者我要写书评

暂无评论

F-IKOS: An Abstract Interpretation-based Static Analyzer for Fortran Programs 12

F-IKOS: An Abstract Interpretation-based Static Analyzer for...

引用

12th International Workshop on Quantitative Approaches to software Quality, QuASoQ 2024

作者： Zou, Sheng Chen, Liqian Fan, Guangsheng Huang, Renjie Yin, Banghu College of Computer Science and Technology National University of Defense Technology Changsha410073 China State Key Laboratory of Complex & Critical Software Environment Changsha410073 China College of Systems Engineering National University of Defense Technology Changsha410073 China

The Fortran programming language is widely utilized in numerical computation and scientific computing. Fortran programs are prone to potential runtime errors related to numerical properties due to the large number of numerical operations. In this paper, we present F-IKOS, an abstract interpretation-based static analyzer for Fortran programs on top of IKOS, which soundly handles floating-point types in Fortran programs. Firstly, we translate Fortran programs to LLVM IR using compiler front-end Flang. After that, we extend IKOS to support sound floating-point analysis and then employ it to analyze the translated LLVM IR. Particularly, when analyzing floating-point types in programs, we first abstract floating-point expressions into real-number expressions with interval coefficients, and then linearize these expressions into real-number expressions with scalar coefficients. These linear expressions are subsequently handled by abstract domains originally designed for real-number types to produce sound analysis results. We have conducted experiments on representative Fortran programs to show the efficiency and effectiveness of F-IKOS. The experimental results are encouraging: F-IKOS soundly analyzes runtime errors in complex programs, outperforming other analyzers. © 2024 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).

关键词： FORTRAN (programming language)

来源：评论

学校读者我要写书评

暂无评论

Bluetooth Vulnerabilities and Security 7

Bluetooth Vulnerabilities and Security

引用

7th IEEE International Conference on Emerging Smart Computing and Informatics, ESCI 2025

作者： Mareeswari, V. Vijayan, R. Shukla, Ashish Department of Software and Systems Engineering School of Computer Science Engineering and Information Systems (SCORE) Vellore Institute of Technology (VIT) Vellore India Department of Information Technology School of Computer Science Engineering and Information Systems (SCORE) Vellore Institute of Technology (VIT) Vellore India School of Computer Science Engineering and Information Systems (SCORE) Vellore Institute of Technology (VIT) Vellore India

ISBN: (纸本)9798331515683

Bluetooth technology, which facilitates wireless communication between Billions of devices including smartphones, tablets, laptops, and Internet of Thing (IoT) devices, is a cornerstone of modern connectivity. Its importance lies in its ability to enable seamless data exchange and interaction across a wide range of applications, from personal gadgets to complex industrial systems. Despite its widespread adoption, Bluetooth is not immune to critical security vulnerabilities. Issues like Bluetooth Low Energy (BLE) vulnerabilities and denial-of-service (DoS) attacks can overwhelm devices with excessive traffic, while BLE Forced Connection can lead to unauthorized access, and eavesdropping exposes sensitive data being exchanged. We have discussed these vulnerabilities in detail, examining their mechanisms, potential impacts, and various implementation aspects. Additionally, we have added demonstrations to illustrate how these attacks can be executed and the potential consequences. To prevent these threats, we explored essential measures, including regular firmware updates, secure pairing protocols, and careful management of Bluetooth settings. Adopting best practices like disabling Bluetooth when not in use and monitoring connected devices can further enhance security, ensuring the reliable and safe operation of the vast and growing ecosystem of Bluetooth-enabled devices. © 2025 IEEE.

关键词： Bluetooth Low Energy (BLE) Bluetooth Vulnerabilities Denial of Service (DoS) PIN Bypass Security Risks

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：