Aircraft image classification has important application value in military, civil aviation and UAV surveillance. In recent years, YOLOv8 has performed well in many tasks as an advanced target detection model, but there...
详细信息
ISBN:
(纸本)9798400707032
Aircraft image classification has important application value in military, civil aviation and UAV surveillance. In recent years, YOLOv8 has performed well in many tasks as an advanced target detection model, but there is still room for improving its classification accuracy in specific tasks. In this paper, we enhance the YOLOv8 model for aircraft image classification by integrating the ConvNeXt module into its backbone network and incorporating the Coordinate Attention (CA) mechanism, thereby improving feature extraction and spatial perception capabilities. The ConvNeXt module significantly improves the efficiency of feature extraction through deep convolution operation and feature fusion strategy;The CA enhances the model's ability to capture spatial features by incorporating coordinate information effectively. In this paper, experiments are conducted on six types of aircraft image datasets, including in-flight UAVs, fighters, helicopters, missiles, airliners, and rockets. The experimental results show that the average classification accuracy of the improved model on the test set is improved by 3.5%. This indicates that the improvement strategy of introducing the ConvNeXt module and CA attention mechanism is effective in improving the classification performance of YOLOv8. The study in this paper not only verifies the effectiveness of the proposed method, but also provides new ideas and methods for future research on aircraft image classification.
作者:
Jahn, HDLR
Deutsch Zentrum Luft & Raumfahrt EV Inst Weltraumsensor & Planetenerkundung D-12489 Berlin Germany
A parallel-sequential unsupervised learning method for image smoothing is presented which can be implemented with a Multi Layer Neural Network. In contrast to older work of the author which has used 4-connectivity of ...
详细信息
ISBN:
(纸本)3540665994
A parallel-sequential unsupervised learning method for image smoothing is presented which can be implemented with a Multi Layer Neural Network. In contrast to older work of the author which has used 4-connectivity of processing elements (neurons) leading to a very big number of recursions now each neuron of network lever t+1 is connected with (2M+1)*(2M+1) neurons of layer t guaranteeing a significant reduction of network layers with the same good smoothing results.
Aiming at the problem that the structural complexity of the backbone architecture of denoising diffusion probabilistic model (DDPM) leads to the inefficiency of the training of the model, a lightweight image generatio...
详细信息
ISBN:
(纸本)9798400707032
Aiming at the problem that the structural complexity of the backbone architecture of denoising diffusion probabilistic model (DDPM) leads to the inefficiency of the training of the model, a lightweight image generation method (MobileDiT) based on MobileViT is proposed, which takes MobileViT block as the backbone architecture of DDPM, and improves the computational efficiency while keeping the image quality by combining lightweight convolution with Transformer. The conditional mechanism for introducing conditional information into the model is also improved by replacing the traditional layer normalization in the Transformer block with adaptive layer normalization and initialising each block as a constant function, allowing the model to process conditional information more efficiently. The experiment results show that the proposed model reduces the FID-50K to 2.15 and improves the IS value compared with models such as style Generative Adversarial Network (styleGAN) and Ablative Diffusion Model (ADM). This shows that the proposed model can not only improve the computational efficiency, but also enhance the quality of the generated images.
Regarding the issues that traditional water area extraction algorithms for remote sensing images require preset thresholds, are vulnerable to interference from mountain or vegetation shadows, and have poor robustness ...
详细信息
ISBN:
(纸本)9798400707032
Regarding the issues that traditional water area extraction algorithms for remote sensing images require preset thresholds, are vulnerable to interference from mountain or vegetation shadows, and have poor robustness at different spatiotemporal scales, a model based on the Res2Net-UNet is proposed to extract water area from remote sensing images. The model adopts the UNet's encoder-decoder architecture, deepens the model's level with the ResNet network, and performs multi-scale feature extraction and fine-grained feature expression by combining Res2Net. Moreover, the model integrates an attention mechanism to increase the weight of key features and fully utilize the spatial and channel information in the images. After that, the model adds a skip connection to better fuse the encoder and decoder to preserve the detailed features of the image. Experimental findings on the GF-1 dataset demonstrate that the Res2Net-UNet model significantly enhances the accuracy of water extraction in remote sensing images.
machinelearning and patternrecognition recently become a hot topic in computing world. This is due to the fast-growing of resources as well as techniques that make it easier to solve machinelearning and pattern rec...
详细信息
ISBN:
(纸本)9781728129303
machinelearning and patternrecognition recently become a hot topic in computing world. This is due to the fast-growing of resources as well as techniques that make it easier to solve machinelearning and patternrecognition problems. Problems that require machinelearning solutions may be very simple for humans but actually can be very complex for machines to solve them. Face recognition is amongst those problems. Almost all human can easily recognize others without require specific knowledge to do it, different from machines which require its. This paper discussed face recognition task using machinelearningstrategies which involved Kernel Principal Component Analysis (KPCA) and Support Vector machine (SVM) to identify person. KPCA extracted features from 2D image input and produced the important features of an image input. The extracted face features are recognized by SVM by classifying human face according to their stored identity in a database. SVM, which was basically a binary classifier worked by using one-against-one strategy to compare the face feature vector of a single testimage to the stored face image in a face image database. Experiment results on grayscale images with size 92x112 pixels gave 96.25% of accuracy rate. Hence, KPCA and SVM for face recognition is a robustmachinelearning method.
The automatic detection of person's identity is a very interesting issue both in social and industrial environments. In this paper a system for automatic identity recognition from face images, is presented. The pr...
详细信息
The automatic detection of person's identity is a very interesting issue both in social and industrial environments. In this paper a system for automatic identity recognition from face images, is presented. The proposed approach is based on an hybrid iconic approach, where a firstrecognition score is obtained by matching a person's face against an eigen-space obtained from an image ensemble of known individuals. The hypothesis is then verified by computing the correlation of the gray level histogram of the new face image with the histograms of the subjects in the database,A selective attentional mechanism is applied to reduce the amount of information needed to describe a database of human faces. This is accomplished both at the task level, by performing planned fixations, and at the sensor level, by adopting a space-variant sampling of the images, By using a space-variant image geometry, the size of the database is considerably reduced and consequently also the processing time for recognition. (C) 1997 Elsevier Science B.V.
An analog implementation of a deep machine-learning system for efficient feature extraction is presented in this work. It features online unsupervised trainability and non-volatile floating-gate analog storage. It uti...
详细信息
An analog implementation of a deep machine-learning system for efficient feature extraction is presented in this work. It features online unsupervised trainability and non-volatile floating-gate analog storage. It utilizes a massively parallel reconfigurable current-mode analog architecture to realize efficient computation, and leverages algorithm-level feedback to provide robustness to circuit imperfections in analog signal processing. A 3-layer, 7-node analog deep machine-learning engine was fabricated in a 0.13 mu m standard CMOS process, occupying 0.36 mm(2) active area. At a processing speed of 8300 input vectors per second, it consumes 11.4 mu W from the 3 V supply, achieving 1x10(12) operation per second per Watt of peak energy efficiency. Measurement demonstrates real-time cluster analysis, and feature extraction for patternrecognition with 8-fold dimension reduction with an accuracy comparable to the floating-point software simulation baseline.
Formal verification is playing an increasingly important role in the development of safety-critical systems, such as signal process systems. Through formal modeling and verification, the behavior and decision-making p...
详细信息
ISBN:
(纸本)9798400707032
Formal verification is playing an increasingly important role in the development of safety-critical systems, such as signal process systems. Through formal modeling and verification, the behavior and decision-making processes of safety-critical systems can be described more precisely, thereby ensuring the safety of the system. By utilizing formal models, comprehensive verification of these models can be conducted to identify and fix potential errors and defects in the early phase. However, as the scale of systems increases, the complexity of formal verification models correspondingly increases, potentially resulting in verification failures. To address this issue, this paper proposes a task-based requirements modeling and compositional verification approach for signal process systems. It reduces the complexity of formal verification models by only considering task related requirements. This approach not only effectively identifies task related errors but also ensures a certain level of verification efficiency. We also conduct a practical case study on an automated processing system to demonstrate the validity of our approach.
In previous studies on edge extraction, researchers have consistently sought to balance efficiency, quality, and stability. To address this, we propose an edge detection algorithm that leverages the self-attention mec...
详细信息
ISBN:
(纸本)9798400707032
In previous studies on edge extraction, researchers have consistently sought to balance efficiency, quality, and stability. To address this, we propose an edge detection algorithm that leverages the self-attention mechanism and post-processing methods to enhance the quality and stability of edge extraction in imageprocessing. In recent years, self-attention mechanisms have emerged as a core component in many cutting-edge models, particularly in the fields of Natural Language processing (NLP) and Computer Vision. They are capable of capturing long-range dependencies within the data. Consequently, we introduce the self-attention mechanism to enhance the capability of feature representation. By employing channel compression and scaling factors to stabilize the computation of attention scores, we ensure the reliability of the training process. Next, we combine Gaussian blurring with adaptive thresholding in the post-processingstage to reduce noise and detail interference, thereby improving edge detection accuracy. Experimental results on extensive benchmark datasets demonstrate that the proposed algorithm significantly outperforms existing methods in terms of edge detection accuracy and stability, particularly in the presence of complex backgrounds and noise. Additionally, the method exhibits strong computational efficiency, making it suitable for practical application. These advancements offer new insights and approaches for the development of edge detection technology.
In this paper, we have discussed different data pre-processing techniques and different machinelearning and deep learning models which are used for sentiment analysis. The dataset used was 'Restaurant Reviews'...
详细信息
暂无评论