检索结果-内蒙古大学图书馆

Person Re-Identification Algorithm Based on Improved ResNet

学校读者我要写书评

暂无评论

IAENG International Journal of Applied Mathematics 2024年第5期54卷 894-901页

作者： Shen, Wenrui Wang, Zhifeng School of Computer Science and Software Engineering University of Science and Technology Liaoning Anshan114051 China School of Computer Science and Software Engineering University of Science and Technology Liaoning Anshan114051 China

Person Re-Identification falls within the scope of computer vision, acting a technique to ascertain the presence of a specified pedestrian within a video or image library. The related research is of great significance in real-world environments such as criminal investigation and statistical analysis of commercial foot traffic and has received extensive attention from the academic community. However, traditional methods such as manual extraction cannot adapt to large-scale data volumes, and deep learning-based methods at this stage suffer from interference in complex environments such as similar costumes, perspective changes, and occlusion. Therefore, in this paper, we investigate the above problems. Firstly, we expand the dataset by introducing random erasure-based preprocessing of pedestrian images to enhancing the robustness and generalization capability of neural networks. Secondly, a composite attention mechanism is introduced after the network residual layer to enhance the spatial information capability and feature expression. Finally, the union loss composed of Circle Loss, Ternary Loss, and Cross Entropy Loss was chosen for network training in the loss optimization phase. Findings from the experiments reveal that the improved method proposed in this experiment achieves 96.0%Rank-1 and 88.3%mAP in Market1501, which reflects the validity of the approach proposed in this manuscript, and provides valuable reference suggestions for Person Re-Identification related research. © (2024), (International Association of Engineers). All Rights Reserved.

关键词： Image enhancement

Small Object Detection in Aerial Drone Imagery based on YOLOv8

学校读者我要写书评

暂无评论

IAENG International Journal of computer Science 2024年第9期51卷 1346-1354页

作者： Pan, Junyu Zhang, Yujun School of Computer Science and Software Engineering University of Science and Technology Liaoning Anshan114051 China School of Computer Science and Software Engineering University of Science and Technology Liaoning Anshan114051 China

In recent years, the utilization of unmanned aerial vehicles (UAVs) for aerial target detection has gained significant attention due to their high-altitude perspective and maneuverability, which offer novel opportunities and tremendous potential in this field. However, detecting targets in UAV aerial images remains highly challenging due to the presence of numerous small targets with limited feature information, as well as issues like target occlusion and complex backgrounds that severely impact detection accuracy. To address these challenges, we propose a detection model called BDC-YOLOv8 that aims to enhance accuracy for small targets while minimizing computational complexity. Specifically, we augment the YOLOv8 architecture by incorporating a dedicated detection head tailored for small targets to improve performance when encountering such objects. Additionally, we restructure the neck network of the model to better extract and fuse feature information from targets with significant scale variations. Furthermore, we introduce the concept of DynamicHead to enhance the detection head by incorporating various attention mechanisms suitable for our task ahead of the original detection head, thereby enhancing the model’s capability to detect objects of different scales and complex backgrounds. Moreover, we introduce Convolutional Block Attention Module (CBAM) to identify regions of interest in densely populated areas. Extensive experiments conducted on the VisDrone2019 dataset yield promising results where our model achieves a mean Average Precision (mAP) score of 38% and an AP50 score of 59.6%. Compared to the original YOLOV8 model, improvements are observed with increases in mAP by 2.5% and AP50 by 3.7%, respectively. Notably, our model demonstrates a significant enhancement in detecting small targets with an increase in APs evaluation metric by 4.1%. © (2024), (International Association of Engineers). All Rights Reserved.

关键词： Aerial photography

Improved Infrared Road Object Detection Algorithm Based on Attention Mechanism in YOLOv8

学校读者我要写书评

暂无评论

IAENG International Journal of computer Science 2024年第6期51卷 673-680页

作者： Luo, Zilong Tian, Ying School of Computer Science and Software Engineering University of Science and Technology Liaoning Liaoning114051 China School of Computer Science and Software Engineering University of Science and Technology Liaoning Anshan114051 China

In Currently, research in the field of infrared road object detection is primarily focused on enhancing model performance and robustness to address the challenges posed by complex real-world driving scenarios. In response to these challenges, this paper proposes an infrared road object detection algorithm based on an attention mechanism. By incorporating the CPCA module, which utilizes attention mechanisms, into the YOLOv8s model, the algorithm enhances the model’s focus on unobstructed areas and highly illuminated sections, extracting crucial feature information to improve both accuracy and robustness. Additionally, the original model’s downsampling layer is replaced with the Context Grided Network Block Downsampling (CGBD) module, which not only preserves feature edge information but also effectively handles local and contextual features, thereby enhancing the overall feature capturing capabilities of the model. To address the issue of equal aspect ratios in the model’s original loss function, the proposed algorithm adopts the superior Weighted Intersection over Union (WIoU). This not only addresses the shortcomings of the original loss function (CIoU) but also demonstrates increased sensitivity in classification tasks. Experimental results show that the improved algorithm, compared to YOLOv8s, achieves a 1.4% increase in mean average precision (mAP), along with notable improvements in precision and recall. Furthermore, when compared to mainstream model algorithms, the enhanced model significantly outperforms in infrared road object detection tasks, providing validation of its effectiveness. © (2024), (International Association of Engineers). All rights reserved.

关键词： Object detection

A Multi-Resolution Feature Fusion Method for Pedestrian Re-identification

学校读者我要写书评

暂无评论

IAENG International Journal of computer Science 2024年第9期51卷 1393-1403页

作者： Qin, Haitian Xu, Yang Chen, Xupeng School of Computer Science and Software Engineering University of Science and Technology Liaoning Anshan114051 China School of Computer Science and Software Engineering University of Science and Technology Liaoning Anshan114051 China

Pedestrian re-identification technology enables accurate identification of individuals and is widely used in modern intelligent video surveillance systems to aid law enforcement, including criminal apprehension and locating missing persons. However, variations in lighting, background, resolution, and other imaging conditions captured by different cameras create significant challenges in pedestrian feature extraction, often leading to poor recognition accuracy. To overcome these challenges, this paper presents a Multi-resolution Feature Fusion (MRFF) method for pedestrian re-identification, based on the Pedestrian Re-identification Relational Network (RNFPR). This approach incorporates the Coordinate Attention (CA) module into the DenseNet161 network to enhance feature extraction capabilities. Improving the discriminative and recognition accuracy of features requires learning and fusing pedestrian features from multiple low-resolution images. This process enhances the expressive power of feature maps, ultimately improving pedestrian recognition performance. Additionally, this method introduces a multi-resolution feature fusion module that segments and integrates multi-resolution features from image data. This enables the model to effectively combine feature information from various resolution levels, resulting in a more comprehensive feature representation. Experimental results show that the MRFF method achieves a 1.3% increase in mean Average Precision (mAP) and a 1.0% improvement in Rank-1 accuracy on the Market1501 dataset. For the DukeMTMC-reID dataset, it provides a 0.2% increase in mAP and a 0.7% enhancement in Rank-1 accuracy. Consequently, the MRFF approach results in an overall mAP increase of 0.7% on the DukeMTMC-ReID dataset, significantly improving pedestrian gender re-identification accuracy. © (2024), (International Association of Engineers). All Rights Reserved.

关键词： Security systems

Research on Social Recommendation Algorithm Based on PSO_KFCM Clustering and CBAM Attention Mechanism of Graph Neural Networks

学校读者我要写书评

暂无评论

IAENG International Journal of computer Science 2024年第8期51卷 936-948页

作者： Teng, Yue Yang, Kai School of Computer Science and Software Engineering University of Science and Technology Liaoning Anshan114051 China School of Computer Science and Software Engineering University of Science and Technology Liaoning Anshan114051 China

In today's society, people increasingly need information acquisition due to the rapid development of science and technology and the consequent increase in available data. However, finding the information users need from this vast data has become challenging. To tackle this problem, recommending preferred information to users is becoming increasingly important. However, accurately recommending information by analyzing existing models such as GraphRec is still a challenging problem. A method called PSO_KFCM is proposed in this paper to solve this problem better. The technique combines Particle Swarm Optimization (PSO) with hybrid optimization and the kernel fuzzy C-means clustering technique to cluster similar recommendation data into one class. This way, the complexity and randomness of the recommendation data are reduced. It improves the speed and accuracy of the model prediction, which lays a solid foundation for the subsequent recommendation. Various factors will impact the recommendation process, and channel and spatial characteristics are essential. CBAM attention is added to the original attention mechanism to fully utilize these features in the recommendation data to enhance its performance. Furthermore, this paper proposes a social recommendation prediction method that combines CBAM attention and PSO_KFCM clustering and introduces a new social model called TTYGNN. The TTYGNN model optimizes the recommendation effect while maintaining the original advantages, enabling users to obtain the required information more quickly and accurately. To verify the effectiveness and practicality of the proposed model, extensive experimental comparisons were conducted on two widely used datasets. The results show that the TTYGNN model outperforms similar methods in all indicators, proving its superiority in information recommendation. © (2024), (International Association of Engineers). All Rights Reserved.

关键词： Fuzzy clustering

Large Kernel Disassembling Attention Mechanism for Remote Sensing Object Detection

学校读者我要写书评

暂无评论

IAENG International Journal of computer Science 2024年第9期51卷 1367-1373页

作者： Geng, Y.X. Wang, L. Wang, Y.G. School of Computer Science and Software Engineering University of Science and Technology LiaoNing Anshan114051 China School of Computer Science and Software Engineering University of Science and Technology LiaoNing Anshan114051 China

In recent years, remote sensing object detection has become a research hotspot in computer vision tasks. However, previous approaches for remote sensing object detection often overlook the rich contextual information in images, which is crucial for accurately detecting occluded or interconnected objects using convolutional neural networks. To capture this contextual information, we propose a method called the Large Kernel Disassembling (LKD) Attention Mechanism. LKD breaks down large convolutional kernels to provide a larger receptive field to the convolutional neural networks, enabling them to capture rich contextual information in remote sensing images and enhance their performance. We employ an adaptive channel submodule and a deep convolutional spatial submodule. The adaptive channel submodule helps the network learn relationships between different channels, while the deep convolutional spatial submodule aids in extracting rich spatial features. We evaluate the proposed attention mechanism on the DIOR dataset and compare it with several recent attention mechanisms on the SSDD dataset. Experimental results demonstrate the superiority of LKD in terms of performance over other methods, validating the effectiveness of the Large Kernel Disassembling attention mechanism in remote sensing object detection tasks. © (2024), (International Association of Engineers). All Rights Reserved.

关键词： Convolutional neural networks

Research on Steel Surface Defects Detection Algorithms by YOLOv8 Based on Attention Mechanism

学校读者我要写书评

暂无评论

IAENG International Journal of computer Science 2024年第9期51卷 1309-1315页

作者： Gao, ShiQuan Tian, Ying School of Computer Science and Software Engineering University of Science and Technology Liaoning Anshan114051 China School of Computer Science and Software Engineering University of Science and Technology Liaoning Anshan114051 China

As deep learning advances, neural network technologies are increasingly penetrating the field of steel surface defect detection. To tackle the challenges of low accuracy and inadequate quality, we introduce CMS-YOLOv8s, a defect detection algorithm engineered for superior precision and efficiency. Initially, integrate the CBAM to enhance the network's focus on relevant information, allowing it to leverage multi-dimensional data like spatial and channel dimensions fully. Next, introduce the SPPFCSPC module to fuse feature information from different scales, expanding the model's receptive field and enhancing its feature extraction abilities. Finally, incorporate a small object detection head to boost the model's capability in identifying tiny targets, strengthening its ability to detect targets across different scales. Experimental findings show that CMS-YOLOv8s attains a detection accuracy of 70.4%, registering a significant 3.3% enhancement compared to the original YOLOv8s. This advancement distinctly improves defect detection accuracy across all categories. © (2024), (International Association of Engineers). All Rights Reserved.

关键词： Surface defects

Surface Defect Detection Algorithm for Strip Steel Based on Improved YOLOv7 Model

学校读者我要写书评

暂无评论

IAENG International Journal of computer Science 2024年第3期51卷 308-316页

作者： Wang, Zhu Liu, Weisheng the College of Computer Science and Software Engineering University of Science and Technology Liaoning Anshan China the College of Computer Science and Software Engineering the University of Science and Technology Liaoning CO Anshan114051 China

This research proposes a refined deep learning framework aimed at boosting the precision and efficacy of detecting surface imperfections in strip steel. This method integrates enhancement and simplification techniques inspired by the You Only Look Once version 7 (YOLOv7) detection method, resulting in significant enhancements in the model's accuracy, speed, and flexibility. The substitution of ELAN with Bottleneck Transformer 3 (BoT3) leads to improved accuracy and mean Average Precision (mAP) values, while also introducing a more lightweight network architecture. The incorporation of the Involution mechanism enhances the model's feature extraction capabilities, thereby improving its ability to recognize small targets through the utilization of local perceptual fields. The ASPP_CA architecture leverages a multi-scale feature fusion technique along with an attention mechanism to reduce model parameters and enhance inference speed. Furthermore, it extends the model's receptive field, allowing it to capture additional visual information. The enhanced algorithm, denoted as YOLOv7-IBA, demonstrates empirical results that underscore its superiority over the three current state-of-the-art detection techniques in identifying surface flaws on strip steel. The accuracy has been improved to 82.9%, representing a significant increase of 7.2% compared to the previous performance. Furthermore, the mean mAP value has experienced a 3.2% increase, reaching a total of 79.9%. Moreover, there has been a remarkable 8.8% improvement in efficiency. The adoption of this approach holds the potential to enhance both the precision and productivity of strip surface flaw detection, while also providing valuable methodological support for the advancement of other related disciplines. © (2024), (International Association of Engineers). All Rights Reserved.

关键词： Object detection

Underwater Target Detection Based on Improved YOLOv7

学校读者我要写书评

暂无评论

IAENG International Journal of computer Science 2024年第4期51卷 422-429页

作者： Fu, Junshang Tian, Ying School of Computer Science and Software Engineering University of Science and Technology Liaoning Liaoning 114051 China School of Computer Science and Software Engineering University of Science and Technology Liaoning Anshan 114051 China

Underwater target detection is an important part of marine exploration. However, in complex underwater environments due to factors like light absorption and scattering, as well as variations in water quality and clarity. These challenges result in inaccurate target feature extraction, sluggish detection speeds, and insufficient robustness in the detection methods. In order to address these issues, an enhanced YOLOv7 network (YOLOv7-SPNW-D) is proposed for underwater target detection in this study. The SPD-MP module structure replaces the MP module in the neck network to capture small targets and enhance detection accuracy. A novel NWD loss function is employed to facilitate smoother extraction of small target features. This enhances feature extraction and improves network inference speed. Additionally, incorporating a small target detection module enables the providing of more comprehensive small target information within a deep feature map. This, in turn, improves the capture of small target features in complex backgrounds, and avoids feature loss and enhancing model exactness. Through ablation experiments on the URPC dataset, it is shown that the improved YOLOv7-SPNW-D algorithm performs better than the original YOLOv7 algorithm, with the mAP50 value increased to 87.0%, proving the effectiveness of this method. In conclusion, the improved YOLOv7-SPNW-D model is more suitable for underwater marine organism target detection. © (2024) International Association of Engineers.

关键词： Water quality