检索结果-内蒙古大学图书馆

Object Detection Model for Remote Sensing Images Based on YOLOv9

学校读者我要写书评

暂无评论

IAENG International Journal of computer Science 2025年第3期52卷 840-847页

作者： Hou, Donghao Zhang, Yujun School of Computer and Software Engineering University of Science and Technology Liaoning Anshan114051 China

In the field of object detection for remote sensing images, especially in applications such as environmental monitoring and urban planning, significant progress has been made. This paper addresses the common challenges faced by traditional object detection methods in remote sensing images, such as the large number of targets and complex backgrounds, by proposing a novel network based on YOLOv9. The network innovatively introduces the C3_CD_CGA module, an enhanced module based on Cascaded Group Attention, designed to reduce computational redundancy and increase attention diversity, and enhances the processing capability of multi-scale information through the CD module. The C3 module employs deep asymmetric convolution to mitigate information loss and increase the receptive field. Additionally, the network integrates DSConv with the RepNCSPELAN4 module to adaptively focus on and precisely capture the features of elongated and curved local structures, such as vehicles. The introduction of the CARAFE module further improves the spatial resolution of the feature maps, significantly enhancing performance across various visual tasks. Experimental results show that the improved YOLOv9 achieves a mean average precision (mAP) of 88% on the SIMD dataset, which is an improvement of 1.6% compared to the baseline YOLOv9 model and 1.5% higher than the state-of-the-art YOLO-SE model. This model not only achieves more effective multi-target recognition in complex backgrounds but also strikes a good balance between accuracy and efficiency. © (2025), (International Association of Engineers). All rights reserved.

关键词： Urban planning

Detection of Small Underwater Organisms Based on Improved YOLOv8

学校读者我要写书评

暂无评论

IAENG International Journal of computer Science 2024年第8期51卷 1020-1026页

作者： Miao, Liheng Tian, Ying School of Computer Science and Software Engineering University of Science and Technology Liaoning Anshan114051 China School of Computer Science and Software Engineering University of Science and Technology Liaoning Anshan114051 China

The underwater environment is complex and diverse, making it challenging to locate aquatic organisms accurately. The precise identification of underwater animals is crucial for ecological research and fisheries management. Addressing the issue of inaccurate localization of small underwater targets, this study introduces a novel model, YOLOv8-2PCC, based on the YOLOv8 algorithm with improvements. First, to improve the efficiency of the YOLOv8 network, the C2F module in the original YOLOv8 network model was replaced with convolution to reduce the computational load of the model. Secondly, the up-sampling operator CARAFE is employed, which excels in capturing features at various scales. Finally, a small target detection layer has been incorporated to extract additional shallow features, effectively enhancing the model's ability to detect small targets. Utilizing the URPC dataset for training and testing, the results indicate that our proposed algorithm achieves a mean Average Precision (mAP) of 85.9%. Compared to YOLOv8n, there is a 4.4% improvement, effectively enhancing the accuracy of underwater organism detection in complex underwater environments. © (2024), (International Association of Engineers). All Rights Reserved.

关键词： Aquatic organisms

Steel Surface Defect Detection Algorithm Based on S-YOLOv8

学校读者我要写书评

暂无评论

IAENG International Journal of computer Science 2025年第3期52卷 644-652页

作者： Zhang, Xu Cui, Wenhua Tao, Ye Shi, Tianwei School of Computer Science and Software Engineering University of Science and Technology Liaoning Anshan China

Steel, being a widely utilized material in industrial production, holds a pivotal role in ensuring product safety and longevity. Hence, the exploration and implementation of steel surface defect detection technology carry significant importance. This paper introduces a steel surface defect detection algorithm based on S-YOLOv8. The algorithm, rooted in YOLOv8n as a benchmark model, initially incorporates a shift-wise shift operator in the backbone network. This introduction notably enhances accuracy compared to conventional CNN models while markedly reducing computational demands. Furthermore, the utilization of the SF-Neck framework, integrating the scale sequence feature fusion module (SSFF) and triple feature encoder module (TFE) in the head network, enriches the network’s multi-scale information extraction capabilities. Subsequently, the adoption of the WIoU loss function enhances the overall detector performance. Lastly, the integration of the SEAM occlusion attention module refines the detection head segment of the YOLOv8 algorithm, effectively addressing defect occlusion challenges. Experiments conducted on the NEU-DET dataset reveal that the mAP value of the S-YOLOv8 model reaches an impressive 84.2%. Comparative analysis with other mainstream algorithms demonstrates a substantial enhancement in detection accuracy, alongside a reduction in instances of leakage and misdetection. Consequently, this study charts a new technical trajectory for quality control within the steel manufacturing industry. © (2025), (International Association of Engineers). All rights reserved.

关键词： Benchmarking

FDM-RTDETR: A Multi-Scale Small Target Detection Algorithm

学校读者我要写书评

暂无评论

IEEE Access 2025年 13卷 88747-88761页

作者： Wang, Hongya Yu, Yongtao Tang, Zhaoxia Huaiyin Institute of Technology Faculty of Computer and Software Engineering Jiangsu Huaian223003 China

To address challenges in uncrewed aerial vehicles (UAV) object detection including complex backgrounds, severe occlusion, dense small objects, and varying lighting conditions, we propose FDM-DETR, a novel detection algorithm specifically designed for small objects in UAV imagery. This method effectively captures global image information by fusing multi-scale spatial features and performing feature extraction in the frequency domain within the backbone network. We design a Dynamic Feature Interaction (DIFI) module with position-based biases in the encoder, enhancing the model’s perception of local features for small objects. In the neck network, we introduce a Multi-Scale Feature Enhancement Pyramid (MSFEP) module to improve feature extraction capabilities for small object detection. Compared to RT-DETR, our improved model achieves performance gains of 2.5% and 2.7% in AP on the Vis-Drone2019 validation and test sets, respectively. While maintaining low computational complexity and parameter count, the method demonstrates significant improvements in detection performance. FDM-DETR exhibits robust practicality and reliability in UAV-based small object detection tasks. © 2013 IEEE.

关键词： Target drones

Underwater Biological Target Detection Algorithm and Research Based on YOLOv7 Algorithm

学校读者我要写书评

暂无评论

IAENG International Journal of computer Science 2024年第6期51卷 594-601页

作者： Zhuang, Hongwei Liu, Weisheng School of Computer Science and Software Engineering University of Science and Technology Liaoning Anshan China College of Computer Science and Software Engineering University of Science and Technology Liaoning CO Anshan114051 China

Underwater target detection is an important method for detecting marine organisms. However, due to the image occlusion of underwater targets, blurred water quality, poor lighting conditions, small targets, and complex backgrounds, the detection of underwater biological targets has posed significant challenges. In the intricate underwater environment, the conventional feature extraction method has a few drawbacks, including imprecise feature extraction, sluggish detection speed, and inadequate robustness. Consequently, an underwater target detection method based on the enhanced You Only Look Once 7 (YOLOv7) is proposed in this study. The network architecture is reconstructed, and the Deformable Convolutional Network (DCN) modules replace some 3×3 convolutional blocks in the ELAN structure to offset sampling points and reduce background interference. Skip connections and 1× 1 convolutional architecture are added to the DCN module to improve the model’s perception of image details. In addition, Contextual Transformer 3 (COT3) is also incorporated to improve visual performance. Finally, to improve the detection efficiency of small objects, the CIoU loss function is finally replaced by the Normalized Wasserstein Distance (NWD) algorithm. The mAP of DCCN-YOLOv7 on the URPC dataset is 80.4%, according to the experimental results, 2.8% higher than the YOLOv7 network model that is used as a baseline. Furthermore, in contrast to the original YOLOv7 algorithm, the detection speed and accuracy are higher, making it more appropriate for target recognition underwater. © (2024), (International Association of Engineers). All rights reserved.

关键词： Feature extraction

Impact of transfer learning compared to convolutional neural networks on fruit detection

学校读者我要写书评

暂无评论

Journal of Intelligent and Fuzzy Systems 2024年第4期46卷 7791-7803页

作者： Salem, Dina Ahmed Hassan, Nesma Abdelaziz Hamdy, Razan Mohamed Computer and Software Engineering Department Misr University for Science and Technology Giza Egypt

Smart farming, also known as precision agriculture or digital farming, is an innovative approach to agriculture that utilizes advanced technologies and data-driven techniques to optimize various aspects of farming operations. One smart farming activity, fruit classification, has broad applications and impacts across agriculture, food production, health, research, and environmental conservation. Accurate and reliable fruit classification benefits various stakeholders, from farmers and food producers to consumers and conservationists. In this study, we conduct a comprehensive comparative analysis to assess the performance of a Convolutional Neural Network (CNN) model in conjunction with four transfer learning models: VGG16, ResNet50, MobileNet-V2, and EfficientNet-B0. Models are trained once on a benchmark dataset called Fruits360 and another time on a reduced version of it to study the effect of data size and image processing on fruit classification performance. The original dataset reported accuracy scores of 95%, 93%, 99.8%, 65%, and 92.6% for these models, respectively. While accuracy increased when trained on the reduced dataset for three of the employed models. This study provides valuable insights into the performance of various deep learning models and dataset versions, offering guidance on model selection and data preprocessing strategies for image classification tasks. © 2024-IOS Press. All rights reserved.

关键词： Fruits

software Test Data Management Based on Knowledge Graph

学校读者我要写书评

暂无评论

Informatica (Slovenia) 2024年第16期48卷 27-36页

作者： Gao, Li Qiu, Junlin Chen, Guanhua Faculty of Computer and Software Engineering Huaiyin Institute of Technology Huai’an223003 China

As software development models and methods mature, large-scale software systems emerge. However, a critical challenge remains: the lack of a comprehensive software test data management model that integrates basic data management with advanced knowledge reasoning. To address this issue, we developed a software test data management model based on knowledge graphs, enabling intelligent management and reasoning of software test data. The model incorporates an entity extraction model based on a feed-forward neural network, a knowledge graph integration method based on graph databases, and a knowledge reasoning submodule based on deep learning. To validate the effectiveness of our model, we evaluated the performance of each component individually. Our deep learning-based entity extraction model achieved an accuracy of 0.92, a recall of 0.88, and an F1 score of 0.90, significantly outperforming traditional methods such as regular expressions and dictionary-based approaches. Utilizing Cypher for graph database querying, our system provides accurate answers with a response time of 0.12 seconds, outperforming SQL and SPARQL-based querying methods. Furthermore, our approach excels in knowledge-based reasoning with an accuracy of 0.89 and site coverage of 0.81, surpassing both ontology-based and graph-based reasoning methods. These results highlight the enhanced construction, querying, and reasoning capabilities of our knowledge graph-based approach for managing software testing data. © 2024 Slovene Society Informatika. All rights reserved.

关键词： Knowledge graph

Image Guidance Encoder-Decoder Model in Image Captioning and Its Application

学校读者我要写书评

暂无评论

IAENG International Journal of computer Science 2024年第9期51卷 1385-1392页

作者： Yang, Zhen Zhou, Ziwei Wang, Chaoyang Xu, Liang School of Applied Technology University of Science and Technology Liaoning Anshan China School of Computer and Software Engineering University of Science and Technology Liaoning Anshan China School of Computer and Software Engineering University of Science and Technology Liaoning Anshan China

This paper introduces a new network model - the Image Guidance Encoder-Decoder Model (IG-ED), designed to enhance the efficiency of image captioning and improve predictive accuracy. IG-ED, a fusion of the convolutional network VGGNet-16 and the long short-term memory network (LSTM), is designed based on the encoder-decoder structure. The image captioning performance sees significant enhancements when leveraging the IG-ED network model. The network training process unfolds in a series of steps. Initially, the input image undergoes convolution via the VGGNet-16 network, producing a 512-dimensional vector. Concurrently, each word in the image's caption is encoded to generate a corresponding 512-dimensional vector consistent with the image feature dimension. These two vectors form the input for the decoding process. Subsequently, the vectors are fed into the redesigned fusion LSTM (F-LSTM) network at different time steps to gradually train the parameters of the IG-ED framework. The training process is completed by utilizing a loss function for determining convergence. Evaluation of the IG-ED model's performance is conducted using CIDEr and seven other evaluation metrics on the MSCOCO 2014 dataset. The results exhibit substantial improvements over the "Adaptive Attention Mode" network and "Neural Talk" network. Additionally, the parameter count of the IG-ED architecture is significantly reduced compared to the "Adaptive Attention Mode" network, leading to decreased computational resource requirements and enabling edge computing on the neural network. © (2024), (International Association of Engineers). All Rights Reserved.

关键词： Long short-term memory

An Intelligent Privacy Protection Scheme for Efficient Edge Computation Offloading in IoV

学校读者我要写书评

暂无评论

Chinese Journal of Electronics 2024年第4期33卷 910-919页

作者： Liang YAO Xiaolong XU Wanchun DOU Muhammad Bilal School of Software Nanjing University of Information Science and Technology State Key Laboratory for Novel Software Technology Nanjing University Department of Computer and Electronics Systems Engineering Hankuk University of Foreign Studies

As a pivotal enabler of intelligent transportation system(ITS), Internet of vehicles(Io V) has aroused extensive attention from academia and industry. The exponential growth of computation-intensive, latency-sensitive,and privacy-aware vehicular applications in Io V result in the transformation from cloud computing to edge computing,which enables tasks to be offloaded to edge nodes(ENs) closer to vehicles for efficient execution. In ITS environment,however, due to dynamic and stochastic computation offloading requests, it is challenging to efficiently orchestrate offloading decisions for application requirements. How to accomplish complex computation offloading of vehicles while ensuring data privacy remains challenging. In this paper, we propose an intelligent computation offloading with privacy protection scheme, named COPP. In particular, an Advanced Encryption Standard-based encryption method is utilized to implement privacy protection. Furthermore, an online offloading scheme is proposed to find optimal offloading policies. Finally, experimental results demonstrate that COPP significantly outperforms benchmark schemes in the performance of both delay and energy consumption.

关键词： Industries Privacy Energy consumption Transportation Computational efficiency Encryption Protection