检索结果-内蒙古大学图书馆

COMPrompter: reconceptualized segment anything model with multiprompt network for camouflaged object detection

Science China(Information Sciences) 2025年第1期68卷 189-203页

作者： Xiaoqin ZHANG Zhenni YU Li ZHAO Deng-Ping FAN Guobao XIAO Zhejiang Province Key Laboratory of Intelligent Informatics for Safety and Emergency Wenzhou University Nankai International Advanced Research Institute (SHENZHEN FUTIAN) College of Computer Science Nankai University School of Computer Science and Technology Tongji University

We rethink the segment anything model(SAM) and propose a novel multiprompt network called COMPrompter for camouflaged object detection(COD). SAM has zero-shot generalization ability beyond other models and can provide an ideal framework for COD. Our network aims to enhance the single prompt strategy in SAM to a multiprompt strategy. To achieve this, we propose an edge gradient extraction module, which generates a mask containing gradient information regarding the boundaries of camouflaged objects. This gradient mask is then used as a novel boundary prompt, enhancing the segmentation process. Thereafter, we design a box-boundary mutual guidance module, which fosters more precise and comprehensive feature extraction via mutual guidance between a boundary prompt and a box prompt. This collaboration enhances the model's ability to accurately detect camouflaged objects. Moreover, we employ the discrete wavelet transform to extract high-frequency features from image embeddings. The high-frequency features serve as a supplementary component to the multiprompt ***, our COMPrompter guides the network to achieve enhanced segmentation results, thereby advancing the development of SAM in terms of COD. Experimental results across COD benchmarks demonstrate that COMPrompter achieves a cutting-edge performance, surpassing the current leading model by an average positive metric of 2.2% in COD10K. In the specific application of COD, the experimental results in polyp segmentation show that our model is superior to top-tier methods as well. The code will be made available at https://***/guobaoxiao/COMPrompter.

关键词： segment anything model camouflaged object detection boundary prompt

来源：评论

学校读者我要写书评

暂无评论

MalAware: A tabletop exercise for malware security awareness education and incident response training

引用

Internet of Things and Cyber-Physical Systems 2024年第1期4卷 280-292页

作者： Angafor, Giddeon Yevseyeva, Iryna Maglaras, Leandros School of Computer Science and Informatics De Montfort University Leicester United Kingdom School of Computing Edinburgh Napier University Edinburgh United Kingdom

Advancements in technology, including the Internet of Things (IoT) revolution, have enabled individuals and businesses to use systems and devices that connect, exchange data, and provide real-time information from far and near. Despite that, this interconnectivity and data sharing between systems and devices over the internet poses security and privacy risks as threat actors can intercept, steal, and use owners’ data for nefarious purposes. This paper discusses ’MalAware’, a ‘Malware Awareness Education’ and incident response (IR) scenario-based tabletop exercise and card game for malware threat mitigation training. It introduces the importance of incident management, highlights the dangers posed by malware for connected systems, and outlines the role of tabletop games and exercises in helping businesses mature their malware incident response capabilities. The study discusses the design of MalAware and summarises the results of 2 pilots undertaken to assess the concept, maintaining that the results highlighted the value of ‘MalAware’ as an essential tool to help students and staff master how to mitigate security threats caused by malware. It argues that MalAware can assist businesses in their IR preparedness endeavors, enabling incident management teams to review plans and processes to ensure they are fit for purpose. It enables staff to leverage scenario-based and simulated security breach examples, including role-play, to establish appropriate malware defences. MalAware's practical hands-on exercises can assist trainees in gaining essential malware and other threat mitigation skills, helping to protect the security and privacy of IoTs. © 2024 The Authors

关键词： Malware

来源：评论

学校读者我要写书评

暂无评论

IoT: Communication protocols and security threats

引用

Internet of Things and Cyber-Physical Systems 2023年第1期3卷 1-13页

作者： Gerodimos, Apostolos Maglaras, Leandros Ferrag, Mohamed Amine Ayres, Nick Kantzavelou, Ioanna School of Computer Science and Informatics University of Thessaly Lamia Greece School of Computing at Edinburgh Napier University Edinburgh United Kingdom Technology Innovation Institute Abu Dhabi United Arab Emirates School of Computer Science and Informatics De Montfort University Leicester United Kingdom School of Engineering Dept.of Informatics and Computer Engineering University of West Attica Athens Greece

In this study, we review the fundamentals of IoT architecture and we thoroughly present the communication protocols that have been invented especially for IoT technology. Moreover, we analyze security threats, and general implementation problems, presenting several sectors that can benefit the most from IoT development. Discussion over the findings of this review reveals open issues and challenges and specifies the next steps required to expand and support IoT systems in a secure framework. © 2023 The Authors

关键词： Network architecture

来源：评论

学校读者我要写书评

暂无评论

Few-Shot Transfer Learning for Deep Reinforcement Learning on Robotic Manipulation Tasks 25th

Few-Shot Transfer Learning for Deep Reinforcement Learning ...

引用

25th Annual Conference on Towards Autonomous Robotic Systems, TAROS 2024

作者： He, Yuanzhi Wallbridge, Christopher D. Hernndez, Juan D. Colombo, Gualtiero B. School of Computer Science and Informatics Cardiff University Cardiff United Kingdom

ISBN: (纸本)9783031720611

Robot manipulation with simulation has become a mainstream approach in the robotics field recently. It entails lower risk and cost compared to direct training a real robot. Various physics engines, such as MuJoCo, offer simulated environments tailored for robot manipulation tasks. As the robotics field rapidly grows, model complexity and training times increase exponentially to meet the demands of diverse tasks. Solving this is challenging as it requires complex models and long training times. Deep Reinforcement Learning (DRL) is the current best-performing way to solve robot manipulation problems. However, although certain algorithms utilized automated curriculum learning to tackle multi-task robot manipulation problems, the models were still too complex to be solved with one training from scratch with acceptable accuracy and reasonable training time. To address this, we introduce a novel few-shot Transfer Learning (TL) technique for DRL that applies both Forward Transfer (FT) and Reverse Transfer (RT). TL facilitates breaking down a complex problem into easier-to-solve sub-problems and transferring the acquired knowledge to more complex ones. Our TL method appears able to accelerate the training process for all the MuJoCo Fetch tasks, while even improving performance by 20% and accelerating 85% for the most complex FetchSlide environment. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

关键词： Deep reinforcement learning

来源：评论

学校读者我要写书评

暂无评论

CID at RRG24: Attempting in a Conditionally Initiated Decoding of Radiology Report Generation with Clinical Entities 23

CID at RRG24: Attempting in a Conditionally Initiated Decodi...

引用

23rd Meeting of the ACL Special Interest Group on Biomedical Natural Language Processing, BioNLP 2024

作者： Liao, Yuxiang Liang, Yuanbang Qin, Yipeng Liu, Hantao Spasić, Irena School of Computer Science and Informatics Cardiff University United Kingdom

ISBN: (纸本)9798891761308

Radiology Report Generation (RRG) seeks to leverage deep learning techniques to automate the reporting process of radiologists. Current methods are typically modelling RRG as an image-to-text generation task that takes X-ray images as input and generates textual reports describing the corresponding clinical observations. However, the wording of the same clinical observation could have been influenced by the expression preference of radiologists. Nevertheless, such variability can be mitigated by normalizing textual reports into structured representations such as a graph structure. In this study, we attempt a novel paradigm for incorporating graph structural data into the RRG model. Our approach involves predicting graph labels based on visual features and subsequently initiating the decoding process through a template injection conditioned on the predicted labels. We trained and evaluated our model on the BioNLP 2024 Shared Task on Large-Scale Radiology Report Generation and submitted our results to the ViLMedic RRG leaderboard. Although our model showed a moderate ranking on the leaderboard, the results provide preliminary evidence for the feasibility of this new paradigm, warranting further exploration and refinement.. ©2024 Association for Computational Linguistics.

关键词： Medical imaging

来源：评论

学校读者我要写书评

暂无评论

The Score Reveal Problem: How do We Maximise Entertainment? 25th

The Score Reveal Problem: How do We Maximise Entertainment?

引用

25th International Conference on Principles and Practice of Multi-Agent Systems, PRIMA 2024

作者： Fowler, Aric Booth, Richard School of Computer Science and Informatics Cardiff University Cardiff United Kingdom

ISBN: (纸本)9783031773662

In many elections or competitions, a set of voters assign points to the candidates in a way that indicates their preferences, with the winning candidate being the candidate with the highest total score. When it comes to revealing the result after all votes have been cast, some competitions proceed by having a roll call where each voter announces their vote in turn. This is often done for entertainment purposes, leading to the introduction of the score reveal problem: Which ordering of the voters should be chosen to maximise the entertainment value of the roll call? We define several entertainment measures and consider their properties, motivated by considerations such as avoiding early resolution of the outcome, focusing attention on the leading candidates, and catering towards preferences for surprise or suspense. We compare several approaches for finding optimal solutions, comparing the hardness of doing so with different entertainment measures and voting formats. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

关键词： Combinatorial optimization

来源：评论

学校读者我要写书评

暂无评论

Do Large Language Models Understand Mansplaining? Well, actually... 30

Do Large Language Models Understand Mansplaining? Well, actu...

引用

Joint 30th International Conference on Computational Linguistics and 14th International Conference on Language Resources and Evaluation, LREC-COLING 2024

作者： Perez-Almendros, Carla Camacho-Collados, Jose School of Computer Science & Informatics Cardiff University United Kingdom

ISBN: (纸本)9782493814104

Gender bias has been widely studied by the NLP community. However, other more subtle variations of it, such as mansplaining, have yet received little attention. Mansplaining is a discriminatory behaviour that consists of a condescending treatment or discourse towards women. In this paper, we introduce and analyze Well, actually..., a corpus of 886 mansplaining stories experienced by women. We analyze the corpus in terms of features such as offensiveness, sentiment or misogyny, among others. We also explore to what extent Large Language Models (LLMs) can understand and identify mansplaining and other gender-related microaggressions. Specifically, we experiment with ChatGPT-3.5Turbo and LLaMA-2 (13b and 70b), with both targeted and open questions. Our findings suggest that, although they can identify mansplaining to some extent, LLMs still struggle to point out this attitude and will even reproduce some of the social patterns behind mansplaining situations, for instance by praising men for giving unsolicited advice to women. © 2024 ELRA Language Resource Association: CC BY-NC 4.0.

关键词： Social networking (online)

来源：评论

学校读者我要写书评

暂无评论

Perspectives from Unpaid Carers on Socially Assistive Robot Interactions in Older Adult Care 16th

Perspectives from Unpaid Carers on Socially Assistive Robo...

引用

16th International Conference on Social Robotics, ICSR + AI 2024

作者： Gul, Aisha D. Turner, Liam Fuentes, Carolina School of Computer Science & Informatics Cardiff University Cardiff United Kingdom

ISBN: (纸本)9789819635245

With the global population aging, there is a growing need for innovative assistive technologies to support unpaid carers in maintaining older adults’ quality of life. Socially Assistive Robots (SARs) offer a potential solution by assisting with daily tasks, providing companionship, and easing the unpaid carers’ burden. For successful integration, SARs must deliver personalised interactions that implicitly promote supporting the needs of both carers and older adults. We conducted a qualitative study with 15 unpaid carers who interacted with a Pepper robot to understand the perception of unpaid carers towards using SARs as an assistive tool for providing care to older adults. Thematic analysis revealed concerns about the lack of human touch, the role of SARs as assistants rather than replacements, and the potential of robots for companionship. Carers also expressed distrust in technology, lack of confidence in machine capabilities, and safety concerns. From these findings, we propose that future research studies consider the collective set of dyadic interactions as a triad between unpaid carers, older adults, and SARs in order to further investigate and design personalised care. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.

关键词： Assistive technology

来源：评论

学校读者我要写书评

暂无评论

Developing phoneme-based lip-reading sentences system for silent speech recognition

引用

CAAI Transactions on Intelligence Technology 2023年第1期8卷 129-138页

作者： Randa El-Bialy Daqing Chen Souheil Fenghour Walid Hussein Perry Xiao Omar HKaram Bo Li School of Engineering London South Bank UniversityLondonUK Faculty of Informatics and Computer Science British University in EgyptCairoEgypt School of Electronics and Informatics Northwestern Polytechnical UniversityXi'anChina

Lip-reading is a process of interpreting speech by visually analysing lip *** research in this area has shifted from simple word recognition to lip-reading sentences in the *** paper attempts to use phonemes as a classification schema for lip-reading sentences to explore an alternative schema and to enhance system *** classification schemas have been investigated,including characterbased and visemes-based *** visual front-end model of the system consists of a Spatial-Temporal(3D)convolution followed by a 2D *** utilise multi-headed attention for phoneme recognition *** the language model,a Recurrent Neural Network is *** performance of the proposed system has been testified with the BBC Lip Reading Sentences 2(LRS2)benchmark *** with the state-of-the-art approaches in lip-reading sentences,the proposed system has demonstrated an improved performance by a 10%lower word error rate on average under varying illumination ratios.

关键词： deep learning deep neural networks lip-reading phoneme-based lip-reading spatial-temporal convolution,transformers

来源：评论

学校读者我要写书评

暂无评论

FilterGNN:Image feature matching with cascaded outlier filters and linearattention

引用

Computational Visual Media 2024年第5期10卷 873-884页

作者： Jun-Xiong Cai Tai-Jiang Mu Yu-Kun Lai Key Laboratory of Pervasive Computing Ministry of EducationDepartment of Computer Science and TechnologyTsinghua UniversityBeijing 100084China School of Computer Science and Informatics Cardiff UniversityWales CF244AGUK

The cross-view matching of local image features is a fundamental task in visual localization and 3D *** study proposes FilterGNN,a transformer-based graph neural network(GNN),aiming to improve the matching efficiency and accuracy of visual *** on high matching sparseness and coarse-to-fine covisible area detection,FilterGNN utilizes cascaded optimal graph-matching filter modules to dynamically reject outlier ***,we successfully adapted linear attention in FilterGNN with post-instance normalization support,which significantly reduces the complexity of complete graph learning from O(N2)to O(N).Experiments show that FilterGNN requires only 6%of the time cost and 33.3%of the memory cost compared with SuperGlue under a large-scale input size and achieves a competitive performance in various tasks,such as pose estimation,visual localization,and sparse 3D reconstruction.

关键词： image matching transformer linear attention visual localization sparse reconstruction

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：