检索结果-内蒙古大学图书馆

Dual modality prompt learning for visual question-grounded answering in robotic surgery

Visual Computing for Industry,Biomedicine,and Art 2024年第1期7卷 316-328页

作者： Yue Zhang Wanshu Fan Peixi Peng Xin Yang Dongsheng Zhou Xiaopeng Wei National and Local Joint Engineering Laboratory of Computer Aided Design School of Software EngineeringDalian UniversityDalian 116622LiaoningChina School of Computer Science and Technology Dalian University of TechnologyDalian 116081LiaoningChina

With recent advancements in robotic surgery,notable strides have been made in visual question answering(VQA).Existing VQA systems typically generate textual answers to questions but fail to indicate the location of the relevant content within the *** limitation restricts the interpretative capacity of the VQA models and their abil-ity to explore specific image *** address this issue,this study proposes a grounded VQA model for robotic surgery,capable of localizing a specific region during answer *** inspiration from prompt learning in language models,a dual-modality prompt model was developed to enhance precise multimodal information ***,two complementary prompters were introduced to effectively integrate visual and textual prompts into the encoding process of the model.A visual complementary prompter merges visual prompt knowl-edge with visual information features to guide accurate *** textual complementary prompter aligns vis-ual information with textual prompt knowledge and textual information,guiding textual information towards a more accurate inference of the ***,a multiple iterative fusion strategy was adopted for comprehensive answer reasoning,to ensure high-quality generation of textual and grounded *** experimental results vali-date the effectiveness of the model,demonstrating its superiority over existing methods on the EndoVis-18 and End-oVis-17 datasets.

关键词： Prompt learning Visual prompt Textual prompt Grounding-answering Visual question answering

来源：评论

学校读者我要写书评

暂无评论

FCA-based θ-iceberg core decomposition in graphs

引用

Journal of Ambient Intelligence and Humanized Computing 2024年第2期15卷 1423-1428页

作者： Hao, Fei Xinchang, Khamphaphone Park, Doo-Soon Key Laboratory of Modern Teaching Technology Ministry of Education Xi’an China School of Computer Science Shaanxi Normal University Xi’an710119 China Department of Computer Science and Engineering Soonchunhyang University Asan31538 Korea Republic of Department of Computer Software Engineering Soonchunhyang University Asan31538 Korea Republic of

Complex networking analysis is a powerful technique for understanding both complex networks and big graphs in ubiquitous computing. Particularly, there are several novel metrics, such as k-clique and k-core are proposed in order to study the relative importance of nodes in complex networks. Among of those metrics, k-core analysis is an effective approach for simplifying graphical structure. However, the relation between k and the scale of networks is not explored in most existing literature. Toward this end, this paper formulate a new research problem, θ-Iceberg Core decomposition in graphs, which is able to incorporate a parameter θ (0 © Springer-Verlag GmbH Germany, part of Springer Nature 2017.

关键词： Formal concept analysis

来源：评论

学校读者我要写书评

暂无评论

A Survey of Personalized Medicine Recommendation

引用

International Journal of Crowd science 2024年第2期8卷 77-82页

作者： Zhu, Fanglin Cui, Lizhen Xu, Yonghui Qu, Zhe Shen, Zhiqi School of Software Shandong University Jinan250101 China Shandong University Jinan250101 China School of Computer Science and Engineering Nanyang Technological University 639798 Singapore

Mining potential and valuable medical knowledge from massive medical data to support clinical decision-making has become an important research field. Personalized medicine recommendation is an important research direction in this field, aiming to recommend the most suitable medicines for each patient according to the health status of the patient. Personalized medicine recommendation can assist clinicians to make clinical decisions and avoid the occurrence of medical abnormalities, so it has been widely concerned by many researchers. Based on this, this paper makes a comprehensive review of personalized medicine recommendation. Specifically, we first make clear the definition of personalized medicine recommendation problem;then, starting from the key theories and technologies, the personalized medicine recommendation algorithms proposed in recent years are systematically classified (medicine recommendation based on multi-disease, medicine recommendation with combination pattern, medicine recommendation with additional knowledge, and medicine recommendation based on feedback) and in-depth analyzed;and this paper also introduces how to evaluate personalized medicine recommendation algorithms and some common evaluation indicators;finally, the challenges of personalized medicine recommendation problem are put forward, and the future research direction and development trends are prospected. © The author(s) 2024.

关键词： Decision making

来源：评论

学校读者我要写书评

暂无评论

Suitable and Style-Consistent Multi-Texture Recommendation for Cartoon Illustrations

引用

ACM Transactions on Multimedia Computing, Communications and Applications 2024年第7期20卷 1-26页

作者： Wu, Huisi Wang, Zhaoze Li, Yifan Liu, Xueting Lee, Tong-Yee College of Computer Science and Software Engineering Shenzhen University No. 3688 Nanhai Road Nanshan District Guangdong Shenzhen518060 China Dept. of Computer Science and Information Engineering National Cheng-Kung University No. 1 University Road Tainan70101 Taiwan

Texture plays an important role in cartoon illustrations to display object materials and enrich visual experiences. Unfortunately, manually designing and drawing an appropriate texture is not easy even for proficient artists, let alone novice or amateur people. While there exist tons of textures on the Internet, it is not easy to pick an appropriate one using traditional text-based search engines. Although several texture pickers have been proposed, they still require the users to browse the textures by themselves, which is still labor-intensive and time-consuming. In this article, an automatic texture recommendation system is proposed for recommending multiple textures to replace a set of user-specified regions in a cartoon illustration with visually pleasant look. Two measurements, the suitability measurement and the style-consistency measurement, are proposed to make sure that the recommended textures are suitable for cartoon illustration and at the same time mutually consistent in style. The suitability is measured based on the synthesizability, cartoonity, and region fitness of textures. The style-consistency is predicted using a learning-based solution since it is subjective to judge whether two textures are consistent in style. An optimization problem is formulated and solved via the genetic algorithm. Our method is validated on various cartoon illustrations, and convincing results are obtained. © 2024 Copyright held by the owner/author(s). Publication rights licensed to ACM.

关键词： Textures

来源：评论

学校读者我要写书评

暂无评论

Environment-Tolerant Trust Opportunity Routing Based on Reinforcement Learning for Internet of Underwater Things

引用

IEEE Transactions on Mobile Computing 2025年第7期24卷 6348-6360页

作者： He, Yu Han, Guangjie Hou, Yun Lin, Chuan Hohai University College of Information Science and Engineering Changzhou213200 China Hohai University Key Laboratory of Maritime Intelligent Network Information Technology Ministry of Education Changzhou213200 China Hohai University College of Computer Science and Software Engineering Nanjing211100 China Northeastern University Software College Shenyang110819 China

The Internet of Underwater Things (IoUT) has garnered significant interest due to its potential applications in monitoring underwater environments. However, the unique characteristics of acoustic communication, such as long propagation delays and high attenuation, present considerable obstacles for achieving efficient and dependable data transmission. Opportunistic routing is a crucial technique for enhancing packet delivery ratios by selecting a set of forwarding nodes and utilizing their cooperative forwarding to boost network throughput. Nevertheless, choosing an excessive number of forwarding nodes can lead to wasteful energy usage and extended communication delays. Moreover, the overlooked trustworthiness of forwarded nodes in most research works can undermine the effectiveness of opportunistic routing. Therefore, this study presents a novel trust opportunistic routing scheme that employs reinforcement learning to achieve resilience in constantly changing underwater settings. The combination of reinforcement learning and trust management enables the proposed opportunistic routing scheme to adapt to the unstable underwater environment and unknown malicious attacks. Initially, a method is introduced for measuring environmental fitness by considering multiple trust factors, including communication success rate, data reliability, and location dynamics. The proposed scheme then uses reinforcement learning to develop a reliable opportunistic routing method based on quantified state information. This component employs the obtained state to formulate action strategies and obtains reward values from environmental inputs. The reward update equation integrates these qualities to optimize the deployment of superior action strategies, finally achieving trust opportunistic routing for underwater data collection. Fundamental experimental results demonstrate that the proposed protocol performs exceptionally well in demanding underwater conditions, outperforming existing method

关键词： Reinforcement learning

来源：评论

学校读者我要写书评

暂无评论

YOLOCSP-PEST for Crops Pest Localization and Classification

引用

computers, Materials & Continua 2025年第2期82卷 2373-2388页

作者： Farooq Ali Huma Qayyum Kashif Saleem Iftikhar Ahmad Muhammad Javed Iqbal Department of Software Engineering University of Engineering and TechnologyTaxila47050Pakistan Department of Computer Science&Engineering College of Applied Studies&Community ServiceKing Saud UniversityRiyadh11362Saudi Arabia Department of Information Technology Faculty of Computing and Information TechnologyKing Abdulaziz UniversityJeddah21589Saudi Arabia Department of Computer Science University of Engineering and TechnologyTaxila47050Pakistan

Preservation of the crops depends on early and accurate detection of pests on crops as they cause several diseases decreasing crop production and quality. Several deep-learning techniques have been applied to overcome the issue of pest detection on crops. We have developed the YOLOCSP-PEST model for Pest localization and classification. With the Cross Stage Partial Network (CSPNET) backbone, the proposed model is a modified version of You Only Look Once Version 7 (YOLOv7) that is intended primarily for pest localization and classification. Our proposed model gives exceptionally good results under conditions that are very challenging for any other comparable models especially conditions where we have issues with the luminance and the orientation of the images. It helps farmers working out on their crops in distant areas to determine any infestation quickly and accurately on their crops which helps in the quality and quantity of the production yield. The model has been trained and tested on 2 datasets namely the IP102 data set and a local crop data set on both of which it has shown exceptional results. It gave us a mean average precision (mAP) of 88.40% along with a precision of 85.55% and a recall of 84.25% on the IP102 dataset meanwhile giving a mAP of 97.18% on the local data set along with a recall of 94.88% and a precision of 97.50%. These findings demonstrate that the proposed model is very effective in detecting real-life scenarios and can help in the production of crops improving the yield quality and quantity at the same time.

关键词： Deep learning classification of pests YOLOCSP-PEST pest detection

来源：评论

学校读者我要写书评

暂无评论

Medicine and Disease Association Prediction via Attention-Based Medical Heterogeneous Information Network Representation Learning

IAENG International Journal of Computer Science

引用

IAENG International Journal of computer science 2022年第1期49卷 69-78页

作者： Zhang, Bin Yang, Dan Lin, Zhihuang Master of Computer Science and Software Engineering Department University of Science and Technology Liao Ning Anshan China School of Computer Science and Software Engineering University of Science and Technology Liao Ning Anshan China

Knowledge of medication and disease has been rapidly accumulated. Also, an increasing number of researchers have paid more attention to predicting medicine-disease associations by machine learning methods. The associations of entities in the medical heterogeneous information network involve different association types. In this paper, we propose MHIN-MD, a representation learning of medical heterogeneous network model to solve the association between medicines and diseases in medical heterogeneous information network. Specifically, we first construct a medical heterogeneous information network contains diseases, medicines, diagnosis, etc. Next, we design a neural network model to learn the feature vector representation of medical nodes information. We try to find the association between medicines and diseases through the characteristics of medicines structure and the interaction between medicines. Using the Euclidean distance to calculate medicine similarity and disease similarity, generates a set of similar nodes between medicines and diseases to prediction association between medicines and diseases. Finally, our extensive experiments on MIMIC- III datasets and DrugBank datasets, the result demonstrates that MHIN-MD can outperform baselines in the association prediction. © 2022. All Rights Reserved.

关键词： Diagnosis

来源：评论

学校读者我要写书评

暂无评论

Pairwise tagging framework for end-to-end emotion-cause pair extraction

引用

Frontiers of computer science 2023年第2期17卷 111-120页

作者： Zhen WU Xinyu DAI Rui XIA National Key Laboratory for Novel Software Technology Nanjing UniversityNanjing 210023China Collaborative Innovation Center of Novel Software Technology and Industrialization Nanjing 210023China School of Computer Science and Engineering Nanjing University of Science and TechnologyNanjing 210023China

Emotion-cause pair extraction(ECPE)aims to extract all the pairs of emotions and corresponding causes in a *** generally contains three subtasks,emotions extraction,causes extraction,and causal relations detection between emotions and *** works adopt pipelined approaches or multi-task learning to address the ECPE ***,the pipelined approaches easily suffer from error propagation in real-world *** multi-task learning cannot optimize all tasks globally and may lead to suboptimal extraction *** address these issues,we propose a novel framework,Pairwise Tagging Framework(PTF),tackling the complete emotion-cause pair extraction in one unified tagging *** prior works,PTF innovatively transforms all subtasks of ECPE,i.e.,emotions extraction,causes extraction,and causal relations detection between emotions and causes,into one unified clause-pair tagging *** this unified tagging task,we can optimize the ECPE task globally and extract more accurate emotion-cause *** validate the feasibility and effectiveness of PTF,we design an end-to-end PTF-based neural network and conduct experiments on the ECPE benchmark *** experimental results show that our method outperforms pipelined approaches significantly and typical multi-task learning approaches.

关键词： emotion-cause pair extraction pairwise tagging framework end-to-end neural network

来源：评论

学校读者我要写书评

暂无评论

ER-Net:Efficient Recalibration Network for Multi-ViewMulti-Person 3D Pose Estimation

引用

computer Modeling in engineering & sciences 2023年第8期136卷 2093-2109页

作者： Mi Zhou Rui Liu Pengfei Yi Dongsheng Zhou National and Local Joint Engineering Laboratory of Computer Aided Design School of Software EngineeringDalian UniversityDalian116622China School of Computer Science and Technology Dalian University of TechnologyDalian116024China

Multi-view multi-person 3D human pose estimation is a hot topic in the field of human pose estimation due to its wide range of application *** the introduction of end-to-end direct regression methods,the field has entered a new stage of ***,the regression results of joints that are more heavily influenced by external factors are not accurate enough even for the optimal *** this paper,we propose an effective feature recalibration module based on the channel attention mechanism and a relative optimal calibration strategy,which is applied to themulti-viewmulti-person 3D human pose estimation task to achieve improved detection accuracy for joints that are more severely affected by external ***,it achieves relative optimal weight adjustment of joint feature information through the recalibration module and strategy,which enables the model to learn the dependencies between joints and the dependencies between people and their corresponding *** call this method as the Efficient Recalibration Network(ER-Net).Finally,experiments were conducted on two benchmark datasets for this task,Campus and Shelf,in which the PCP reached 97.3% and 98.3%,respectively.

关键词： Multi-view multi-person pose estimation attention mechanism computer vision

来源：评论

学校读者我要写书评

暂无评论

MixSSC: Forward-Backward Mixture for Vision-based 3D Semantic Scene Completion

引用

IEEE Transactions on Circuits and Systems for Video Technology 2025年第6期35卷 5684-5696页

作者： Wang, Meng Ding, Yan Liu, Yumeng Qin, Yunchuan Li, Ruihui Tang, Zhuo The College of Computer Science and Electronic Engineering Hunan University Changsha410082 China Beijing Key Laboratory of Human-Computer Interaction Institute of Software Chinese Academy of Sciences Beijing100190 China

Vision-based semantic scene completion task aims to predict dense geometric and semantic 3D scene representations from 2D images. However, 3D modeling from a single view is an ill-posed problem, limited by the field of view and occlusion problems caused by image input. Moreover, existing methods tend to produce erroneous scene hallucinations and overly smooth boundary segmentation due to a lack of information. To address this problem, we propose MixSSC, which mixes the sparsity of forward projection with the denseness of depth-prior backward projection. The aim is to use sparse features to fill information-poor regions and dense features to enhance visible regions. Specifically, we develop the forward-backward mixture module, which enables the generation of scene mixture voxel representation by leveraging the benefits of both forward and backward projection. Subsequently, we design the semantic-spatial fusion module, which utilizes a coarse-to-fine approach to process mixture voxel features at the semantic-spatial level. Extensive experimental results on the SemanticKITTI, SSCBench-KITTI-360 and nuScenes datasets demonstrate the superiority of MixSSC. © 1991-2012 IEEE.

关键词： Semantic Segmentation

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：