With the continuous improvement of the application of artificial intelligence theory and intelligent hardware processing capabilities, the application of machinevision and unmanned technology is becoming more and mor...
详细信息
In order to improve the special training effect of sports movements, this paper combines machinevisionimage coarse-grained technology to build a special training system to improve the special training effect of spor...
详细信息
Computer vision is to measure and judge by machine instead of man, and convert the captured target scene into image signal through camera device. Transmitting it to the imageprocessing system and converting it into a...
详细信息
With the rapid progress and innovation of computer vision, imageprocessing and other related disciplines and theoretical methods, machinevision has been more and more widely used. In the process of welding productio...
详细信息
The limitations of a machine learning model can often be traced back to the existence of under-represented regions in the feature space of the training data. Data augmentation is a common technique that has been used ...
详细信息
ISBN:
(纸本)9781728198354
The limitations of a machine learning model can often be traced back to the existence of under-represented regions in the feature space of the training data. Data augmentation is a common technique that has been used to inflate training datasets with new samples to improve the model performance. However, these techniques usually focus on expanding the data in size and do not necessarily aim to cover the under-represented regions of the feature space. In this paper, we propose an Attention-guided Data Augmentation technique for vision Transformers (ADA-ViT). Our framework exploits the attention mechanism in vision transformers to extract visual concepts related to misclassified samples. The retrieved concepts describe under-represented regions in the training dataset that contributed to the misclassifications. We leverage this information to guide our data augmentation process by identifying new samples and using them to augment the training data. We hypothesize that this focused data augmentation populates under-represented regions and improves the model's accuracy. We evaluate our framework on the CUB dataset and CUB-Families. Our experiments show that ADA-ViT outperforms state-of-the-art data augmentation strategies, and can improve the accuracy of a transformer by an average margin of 2.5% on the CUB dataset and 3.3% on CUB-Families.
Zero-shot learning is a popular strategy for low-light image enhancement, as it allows convolutional neural networks (CNNs) to be trained without paired data. However, existing zero-shot learning methods often lead to...
详细信息
Fluidized bed granulation is a unit operation widely used in the pharmaceutical, chemical and food processing industries. It is a manufacturing technology that by suspending lose powders using hot air and transforms t...
详细信息
When the color of moving object is close to the background, the accuracy of moving object recognition is affected. So the method of moving object recognition based on machinevision is designed. In order to reduce the...
详细信息
ISBN:
(纸本)9783031288661;9783031288678
When the color of moving object is close to the background, the accuracy of moving object recognition is affected. So the method of moving object recognition based on machinevision is designed. In order to reduce the distortion of image edge position, the moving object is calibrated and corrected by vision. In order to reduce the influence of noise to a controllable range, the full information mobile monitoring image is enhanced to preserve the image details. The edge features obtained from view and template are calculated by moment, and the similarity is obtained. Then the contour feature of moving monitoring target is extracted based on machinevision. Segmentation of the background region, according to the moving object trajectory center point information such as speed, direction and so on to determine whether the trajectory is abnormal events. The proposed method is tested on INRIA dataset and Vehicle Reld dataset, and the results show that the proposed method can improve the accuracy and recall rate and has good detection performance.
Human Action Recognition is one of the most applied research directions in the field of Computer vision, which is widely used in human-computer interaction, Augmented Reality (AR) technology, security monitoring, and ...
详细信息
ISBN:
(纸本)9798350350920
Human Action Recognition is one of the most applied research directions in the field of Computer vision, which is widely used in human-computer interaction, Augmented Reality (AR) technology, security monitoring, and other scenarios. However, due to the complexity of human action gestures, existing Human Action Recognition methods have certain deficiencies in dealing with variable human gestures and action information, and the accuracy needs to be improved. To improve the accuracy, We propose a multi-dimensional network model based on SC-LSTM(Skip-Connection + LSTM). First, a Temporal Feature Extraction Module is designed based on SC-LSTM, and a Spatial Feature Extraction Module is designed based on CNN and Multi-Attention Mechanism to extract potential human action features from both temporal and spatial dimensions, respectively. Then, a separate SC-LSTM classification network is utilized to process these spatio-temporal features to obtain the final HAR results. The experimental results show that compared to other algorithms, the present model can more fully utilize the information in the temporal dimension, and thus performs better in terms of HAR accuracy.
Simplifying the development method of embedded vision platform can improve the application value of embedded machinevision platform, but there is a problem that the development effect is not satisfactory. Code progra...
详细信息
暂无评论