检索结果-内蒙古大学图书馆

NeOR: neural exploration with feature-based visual odometry and tracking-failure-reduction policy

Optoelectronics Letters 2025年第5期21卷 290-297页

作者： ZHU Ziheng LIU Jialing CHEN Kaiqi TONG Qiyi LIU Ruyu College of Computer Science and Technology College of Software Zhejiang University of Technology School of Information Science and Technology Hangzhou Normal University

Embodied visual exploration is critical for building intelligent visual agents. This paper presents the neural exploration with feature-based visual odometry and tracking-failure-reduction policy(Ne OR), a framework for embodied visual exploration that possesses the efficient exploration capabilities of deep reinforcement learning(DRL)-based exploration policies and leverages feature-based visual odometry(VO) for more accurate mapping and positioning results. An improved local policy is also proposed to reduce tracking failures of feature-based VO in weakly textured scenes through a refined multi-discrete action space, keyframe fusion, and an auxiliary task. The experimental results demonstrate that Ne OR has better mapping and positioning accuracy compared to other entirely learning-based exploration frameworks and improves the robustness of feature-based VO by significantly reducing tracking failures in weakly textured scenes.

关键词： A

来源：评论

学校读者我要写书评

暂无评论

Image detection method for multi-category lesions in wireless capsule endoscopy based on deep learning models

引用

World Journal of Gastroenterology 2024年第48期30卷 5111-5129页

作者： Zhi-Guo Xiao Xian-Qing Chen Dong Zhang Xin-Yuan Li Wen-Xin Dai Wen-Hui Liang School of Computer Science Technology Changchun UniversityChangchun 130022Jilin ProvinceChina School of Computer Science Technology Beijing Institute of TechnologyBeijing 100811China

BACKGROUND Wireless capsule endoscopy(WCE)has become an important noninvasive and portable tool for diagnosing digestive tract diseases and has been propelled by advancements in medical imaging ***,the complexity of the digestive tract structure,and the diversity of lesion types,results in different sites and types of lesions distinctly appearing in the images,posing a challenge for the accurate identification of digestive tract *** To propose a deep learning-based lesion detection model to automatically identify and accurately label digestive tract lesions,thereby improving the diagnostic efficiency of doctors,and creating significant clinical application *** In this paper,we propose a neural network model,WCE_Detection,for the accurate detection and classification of 23 classes of digestive tract lesion ***,since multicategory lesion images exhibit various shapes and scales,a multidetection head strategy is adopted in the object detection network to increase the model's robustness for multiscale lesion ***,a bidirectional feature pyramid network(BiFPN)is introduced,which effectively fuses shallow semantic features by adding skip connections,significantly reducing the detection error *** the basis of the above,we utilize the Swin Transformer with its unique self-attention mechanism and hierarchical structure in conjunction with the BiFPN feature fusion technique to enhance the feature representation of multicategory lesion *** The model constructed in this study achieved an mAP50 of 91.5%for detecting 23 *** than eleven single-category lesions achieved an mAP50 of over 99.4%,and more than twenty lesions had an mAP50 value of over 80%.These results indicate that the model outperforms other state-of-the-art models in the end-to-end integrated detection of human digestive tract lesion *** The deep learning-based object detection network detects multiple digestive tract lesi

关键词： Human digestive tract Artificial intelligence Deep learning Wireless capsule endoscopy Object detection

来源：评论

学校读者我要写书评

暂无评论

Secure vehicular data communication in Named Data Networking

引用

Digital Communications and Networks 2023年第1期9卷 203-210页

作者： Xiaonan Wang Xilan Chen Xingwei Wang School of Computer Science and Engineering Changshu Institute of TechnologyChangshu215500China School of Computer Science and Technology Northeastern UniversityShenyang110169China

Vehicular data misuse may lead to traffic accidents and even loss of life,so it is crucial to achieve secure vehicular data *** paper focuses on secure vehicular data communications in the Named Data Networking(NDN).In NDN,names,provider IDs and data are transmitted in plaintext,which exposes vehicular data to security threats and leads to considerable data communication costs and failure *** paper proposes a Secure vehicular Data Communication(SDC)approach in NDN to supress data communication costs and failure *** constructs a vehicular backbone to reduce the number of authenticated nodes involved in reverse *** the ciphtertext of the name and data is included in the signed Interest and Data and transmitted along the backbone,so the secure data communications are *** is evaluated,and the data results demonstrate that SCD achieves the above objectives.

关键词： Named data networking Reverse path Secure data communication

来源：评论

学校读者我要写书评

暂无评论

DNA sequence design model for multi-scene fusion

引用

Neural Computing and Applications 2025年第7期37卷 5499-5520页

作者： Yao, Yao Zheng, Yanfen Cui, Shuang Hou, Yaqing Zhang, Qiang Wei, Xiaopeng School of Computer Science and Technology Dalian University of Technology Liaoning Dalian116024 China

Due to its unique properties and excellent sequence design methods, DNA finds wide applications in computing, information storage, molecular circuits, and biological diagnosis. Previous efforts to enhance the efficiency and precision of DNA sequence design have led to the proposal of various universal DNA sequence design methods. These methods optimize the arrangement of the four bases to reduce sequence similarity and meet specific criteria. However, prior investigations have predominantly focused on sequence design within single-scene frameworks, overlooking the complexities associated with designing for multi-scene fusion, such as ion-bridge mismatch, tri-base sequence design, and others. To address this gap, we fused four common scenes and introduced two novel constraint models to facilitate DNA sequence design for multi-scene fusion. Additionally, we developed a dynamic virus spread algorithm as the core for optimizing DNA sequences and evaluated it using 23 well-known benchmark functions. Furthermore, our algorithm outperformed eight popular swarm evolutionary algorithms in eight dominant results. Finally, we simulated the optimization of four distinct scenes, demonstrating that our sequences met expected performance levels in their respective areas. Thus, our work provides a practical tool for designing DNA sequences tailored to various specific applications. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2025.

关键词： Integrated circuit design

来源：评论

学校读者我要写书评

暂无评论

Meter-YOLOv8n: A Lightweight and Efficient Algorithm for Word-Wheel Water Meter Reading Recognition

引用

International Journal of Advanced computer science and Applications 2025年第4期16卷 209-221页

作者： Qiao, Shichao Yuan, Yuying Qi, Ruijie School of Computer Science and Technology Shandong University of Technology Shandong Zibo255000 China

To address the issues of low efficiency and large parameters in the current word-wheel water meter reading recognition algorithms, this paper proposes a Meter-YOLOv8n algorithm based on YOLOv8n. Firstly, the C2f component of YOLOv8n is improved by introducing an enhanced inverted residual mobile block (iRMB). It enables the model to efficiently capture global features and fully extract the key information of the water meter characters. Secondly, the Slim-Neck feature fusion structure is employed in the neck network. By replacing the original convolutional kernels with GSConv, the model's ability to express the features of small object characters is enhanced, and the number of parameters in the model is reduced. Finally, Inner-EIoU is used to optimize the bounding box loss function. This simplifies the calculation process of the loss function and improves the model's ability to locate dense bounding boxes. The experimental results show that, compared with the original model, the precision, recall, mAP@0.5, and mAP@0.5:0.95 of the improved model have increased by 1.7%, 1.2%, 3.4%, and 3.3% respectively. Meanwhile, the parameters, FLOPs, and model size have decreased by 0.56M, 2.6G, and 0.7MB respectively. The improved model can better balance the relationship between detection performance and computational complexity. It is suitable for the task of recognizing word-wheel water meter readings and has practical application value. © (2025), (science and Information Organization). All Rights Reserved.

关键词： NP-hard

来源：评论

学校读者我要写书评

暂无评论

How graph convolutions amplify popularity bias for recommendation?

引用

Frontiers of computer science 2024年第5期18卷 121-132页

作者： Jiajia CHEN Jiancan WU Jiawei CHEN Xin XIN Yong LI Xiangnan HE School of Information Science and Technology University of Science and Technology of ChinaHefei 230026China School of Computer Science and Technology Zhejiang UniversityHangzhou 310058China School of Computer Science and Technology Shandong UniversityQingdao 250100China Department of Electronic Engineering Tsinghua UniversityBeijing 100084China

Graphconvolutional networks(GCNs)have become prevalent in recommender system(RS)due to their superiority in modeling collaborative *** improving the overall accuracy,GCNs unfortunately amplify popularity bias-tail items are less likely to be *** effect prevents the GCN-based RS from making precise and fair recommendations,decreasing the effectiveness of recommender systems in the long *** this paper,we investigate how graph convolutions amplify the popularity bias in *** theoretical analyses,we identify two fundamental factors:(1)with graph convolution(i.e.,neighborhood aggregation),popular items exert larger influence than tail items on neighbor users,making the users move towards popular items in the representation space;(2)after multiple times of graph convolution,popular items would affect more high-order neighbors and become more *** two points make popular items get closer to almost users and thus being recommended more *** rectify this,we propose to estimate the amplified effect of popular nodes on each node's representation,and intervene the effect after each graph ***,we adopt clustering to discover highly-influential nodes and estimate the amplification effect of each node,then remove the effect from the node embeddings at each graph convolution *** method is simple and generic-it can be used in the inference stage to correct existing models rather than training a new model from scratch,and can be applied to various GCN *** demonstrate our method on two representative GCN backbones LightGCN and UltraGCN,verifying its ability in improving the recommendations of tail items without sacrificing the performance of popular *** are open-sourced^(1)).

关键词： recommendation graph convolution networks popularity bias

来源：评论

学校读者我要写书评

暂无评论

Image Guidance Encoder-Decoder Model in Image Captioning and Its Application

IAENG International Journal of Computer Science

引用

IAENG International Journal of computer science 2024年第9期51卷 1385-1392页

作者： Yang, Zhen Zhou, Ziwei Wang, Chaoyang Xu, Liang School of Applied Technology University of Science and Technology Liaoning Anshan China School of Computer and Software Engineering University of Science and Technology Liaoning Anshan China School of Computer and Software Engineering University of Science and Technology Liaoning Anshan China

This paper introduces a new network model - the Image Guidance Encoder-Decoder Model (IG-ED), designed to enhance the efficiency of image captioning and improve predictive accuracy. IG-ED, a fusion of the convolutional network VGGNet-16 and the long short-term memory network (LSTM), is designed based on the encoder-decoder structure. The image captioning performance sees significant enhancements when leveraging the IG-ED network model. The network training process unfolds in a series of steps. Initially, the input image undergoes convolution via the VGGNet-16 network, producing a 512-dimensional vector. Concurrently, each word in the image's caption is encoded to generate a corresponding 512-dimensional vector consistent with the image feature dimension. These two vectors form the input for the decoding process. Subsequently, the vectors are fed into the redesigned fusion LSTM (F-LSTM) network at different time steps to gradually train the parameters of the IG-ED framework. The training process is completed by utilizing a loss function for determining convergence. Evaluation of the IG-ED model's performance is conducted using CIDEr and seven other evaluation metrics on the MSCOCO 2014 dataset. The results exhibit substantial improvements over the "Adaptive Attention Mode" network and "Neural Talk" network. Additionally, the parameter count of the IG-ED architecture is significantly reduced compared to the "Adaptive Attention Mode" network, leading to decreased computational resource requirements and enabling edge computing on the neural network. © (2024), (International Association of Engineers). All Rights Reserved.

关键词： Long short-term memory

来源：评论

学校读者我要写书评

暂无评论

A binary-domain recurrent-like architecture-based dynamic graph neural network

引用

Autonomous Intelligent Systems 2024年第1期4卷 259-270页

作者： Zi-chao Chen Sui Lin School of Computer Science and Technology Guangdong University of TechnologyGuangzhou510006China

The integration of Dynamic Graph Neural Networks(DGNNs)with Smart Manufacturing is crucial as it enables real-time,adaptive analysis of complex data,leading to enhanced predictive accuracy and operational efficiency in industrial *** address the problem of poor combination effect and low prediction accuracy of current dynamic graph neural networks in spatial and temporal domains,and over-smoothing caused by traditional graph neural networks,a dynamic graph prediction method based on spatiotemporal binary-domain recurrent-like architecture is proposed:Binary Domain Graph Neural Network(BDGNN).The proposed model begins by utilizing a modified Graph Convolutional Network(GCN)without an activation function to extract meaningful graph topology information,ensuring non-redundant *** the temporal domain,Recurrent Neural Network(RNN)and residual systems are employed to facilitate the transfer of dynamic graph node information between learner weights,aiming to mitigate the impact of noise within the graph *** the spatial domain,the AdaBoost(Adaptive Boosting)algorithm is applied to replace the traditional approach of stacking layers in a graph neural *** allows for the utilization of multiple independent graph learners,enabling the extraction of higher-order neighborhood information and alleviating the issue of *** efficacy of BDGNN is evaluated through a series of experiments,with performance metrics including Mean Average Precision(MAP)and Mean Reciprocal Rank(MRR)for link prediction tasks,as well as metrics for traffic speed regression tasks across diverse test *** with other models,the better experiments results demonstrate that BDGNN model can not only better integrate the connection between time and space information,but also extract higher-order neighbor information to alleviate the over-smoothing phenomenon of the original GCN.

关键词： Dynamic graph neural network Smart manufacturing Over-smoothing Link prediction Traffic prediction

来源：评论

学校读者我要写书评

暂无评论

Semantic-specific multimodal relation learning for sentiment analysis

引用

Neural Computing and Applications 2024年第18期36卷 10799-10809页

作者： Wu, Rui Luo, YuanYi Liu, JiaFeng Tang, XiangLong School of Computer Science and Technology Harbin Institute of Technology Harbin150001 China

Multimodal sentiment analysis (MSA) seeks to understand human affection by leveraging signals from multiple modalities. A core challenge in MSA is the effective extraction of sentimental relations between these signals, as this can enhance a model’s consistency and accuracy. Existing studies typically use multimodal matching tasks to learn all semantic relations between modalities and then use downstream task to obtain the specific semantics from the multimodal representation. However, there are multiple semantics between modalities, such as action semantics, scene semantics and sentiment semantics. Relying solely on specific tasks to filter these semantics often results in a surplus of redundant information in the multimodal representation, potentially degrading MSA accuracy. In addition, the unimodal semantic expression is also important. In this paper, we propose a semantic-specific multimodal relation learning method to correlate modalities with specific semantics. Specifically, with smaller computational resources, we enhance unimodal sentimental semantic expression while diminishing non-sentimental semantic information in the multimodal representation. We conducted experiments on multimodal sentiment analysis datasets, CMU-MOSI, CMU-MOSEI and CH-SIMS. The results show that our method outperforms the current state-of-the-art. Notably, on the Acc2 evaluation metric, our approach exhibits an average accuracy improvement of 0.75 compared to the best baseline. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024.

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

MaliFuzz:Adversarial Malware Detection Model for Defending Against Fuzzing Attack

引用

Journal of Beijing Institute of technology 2024年第5期33卷 436-449页

作者： Xianwei Gao Chun Shan Changzhen Hu School of Computer Science and Technology Beijing Institute of TechnologyBeijing 100081China

With the prevalence of machine learning in malware defense,hackers have tried to attack machine learning models to evade *** is generally difficult to explore the details of malware detection models,hackers can adopt fuzzing attack to manipulate the features of the malware closer to benign programs on the premise of retaining their *** this paper,attack and defense methods on malware detection models based on machine learning algorithms were ***,we designed a fuzzing attack method by randomly modifying features to evade *** fuzzing attack can effectively descend the accuracy of machine learning model with single *** an adversarial malware detection model MaliFuzz is proposed to defend fuzzing *** from the ordinary single feature detection model,the combined features by static and dynamic analysis to improve the defense ability are *** experiment results show that the adversarial malware detection model with combined features can deal with the *** methods designed in this paper have great significance in improving the security of malware detection models and have good application prospects.

关键词： adversarial machine learning fuzzing attack malware detection

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：