In various fields such as medical imaging, object detection, and video surveillance, multi view natural language query systems utilize image data to provide a more comprehensive perspective, allowing users to intuitiv...
In various fields such as medical imaging, object detection, and video surveillance, multi view natural language query systems utilize image data to provide a more comprehensive perspective, allowing users to intuitively query and obtain information. Due to the lack of a deep understanding of natural language in the hard coded matching rule method, the query results do not match the user's intentions and are difficult to meet practical application needs. Therefore, this article introduces machinevision algorithms for optimization and improvement. This article first discusses the system architecture of four modules: data input and preprocessing, visual feature extraction, natural language understanding and matching, and result generation and feedback. Then, the application of machinevision technology in the system was analyzed using two calculation formulas: grayscale conversion and binarization, and natural language processing technology was briefly discussed. Subsequently, a context understanding module was added to construct a multi view natural language query system based on machinevision. Finally, two sets of simulation experiments were conducted to draw the following conclusion: compared with traditional methods, the overall average improvement in image recognition accuracy indicators is about 14.3%, while the overall average improvement in response speed indicators is about 26.5%. This research system can effectively process images from different perspectives and match them with natural language queries.
The integration of human-robot interaction (HRI) technologies with industrial automation has become increasingly essential for enhancing productivity and safety in manufacturing environments. In this paper, we propose...
详细信息
ISBN:
(数字)9798331506520
ISBN:
(纸本)9798331506537
The integration of human-robot interaction (HRI) technologies with industrial automation has become increasingly essential for enhancing productivity and safety in manufacturing environments. In this paper, we propose a novel approach to address these challenges by using stereo vision and gesture control in cooperative robotic cells. Our system enables seamless authentication of operators and real-time verification of task execution, ensuring compliance with established protocols and safety *** features of our system include its gesture-based operation with gesture recognition algorithms, allowing operators to interact with robotic systems intuitively and efficiently. By leveraging stereo vision, our system accurately tracks the operators’ movement within the workspace, facilitating precise task execution and object *** present a detailed description of our system architecture, experimental configuration, and real-world performance assessment. Our results demonstrate the effectiveness and feasibility of our approach in enhancing operational efficiency, ensuring quality, and improving the overall user experience in industrial automation.
In recent years, with the development of science and technology and its application in agricultural production, China's agricultural science and technology have made great progress, the concept of "agricultur...
详细信息
ISBN:
(数字)9781665454575
ISBN:
(纸本)9781665454575
In recent years, with the development of science and technology and its application in agricultural production, China's agricultural science and technology have made great progress, the concept of "agricultural processing"has been mentioned, and the research on agricultural processing has also achieved fruitful results. The combination of intelligent and automated machine learning algorithms with traditional industries can promote productivity improvement on the one hand, and realize industrial upgrading and transformation on the other hand. However, in practical production applications, machine learning algorithms are restricted by factors such as high cost, and the research and application of machine learning algorithms are greatly limited. With the development of virtual simulation technology in the field of machine learning algorithm research, it provides a new way for machine learning algorithm technology to be applied to agricultural product processing. Therefore, the research on machine learning algorithms has become a trend. The development of machine learning algorithms will drive the development of modern agriculture. It is very necessary for the research of machine learning algorithms to learn algorithms. The image is converted into a data matrix, and a computer used to replace the human brain is used to analyze the image, while completing a vision related task. China's agricultural development is facing severe challenges such as rising costs, continuous deterioration of the ecological environment and high tension of resource conditions. With the deepening of machine learning algorithm research and the rapid development of machine learning algorithm technology, machine learning algorithm simulation technology, as a safe and economic experimental tool in the application of machine learning algorithm technology, plays a more and more important role. In order to make full use of the latest research results abroad and narrow the gap with the advanced level ab
With wide applications of machine learning algorithms, machine learning security has become a significant issue. The vulnerability to adversarial perturbations exists in most machine learning algorithms, including cut...
详细信息
ISBN:
(纸本)9783031064272;9783031064265
With wide applications of machine learning algorithms, machine learning security has become a significant issue. The vulnerability to adversarial perturbations exists in most machine learning algorithms, including cutting-edge deep neural networks. The standard adversarial perturbation defence techniques with adversarial training need to generate adversarial examples during the training process, which require high computational costs. This paper proposed a novel defence method using self-adaptive logit balancing and Gaussian noise boost training. This method can improve the robustness of deep neural networks without high computational cost and achieve competitive results compared with the adversarial training methods. Meanwhile, this defence method enables deep learning systems to have proactive and reactive defence during the operation. A sub-classifier is trained to determine whether the system is under attack and detect attack algorithms via the patterns of the Log-Softmax values. It can achieve high accuracy for detecting clean inputs and adversarial examples created by seven attack methods.
U-Net and its extensions have achieved significant success in medical image segmentation but face limitations in capturing global context and transferring cross-scale features. To address these challenges, we propose ...
详细信息
ISBN:
(数字)9798350355413
ISBN:
(纸本)9798350355420
U-Net and its extensions have achieved significant success in medical image segmentation but face limitations in capturing global context and transferring cross-scale features. To address these challenges, we propose a lightweight network, PCASNet, which integrates the Polarized Cross-scale Attention Module (PCAS Module) and the Dynamic Multi-scale Attention Gate (DMAG). The PCAS Module enhances global context modeling by fusing features from distant spatial positions, while the DMAG improves segmentation performance by filtering redundant features and emphasizing critical information, thereby strengthening global information modeling and feature selection capabilities. Experiments on breast and thyroid ultrasound datasets demonstrate that PCASNet outperforms traditional image segmentation algorithms in both accuracy and efficiency, highlighting its potential for applications in ultrasound medical imaging.
Pneumonia, an infectious lung condition caused by bacteria, viruses, or other microorganisms, significantly impacts both pediatric and geriatric populations. The COVID-19 pandemic has underscored the necessity for swi...
详细信息
ISBN:
(数字)9798350355413
ISBN:
(纸本)9798350355420
Pneumonia, an infectious lung condition caused by bacteria, viruses, or other microorganisms, significantly impacts both pediatric and geriatric populations. The COVID-19 pandemic has underscored the necessity for swift and accurate diagnostic tools, with over 760 million infections reported. This study investigates the role of artificial intelligence in lung X-ray image classification by comparing the performance of three Convolutional Neural Networks, two Transformer models, and the vision Mamba model using a standardized dataset. The study’s objectives were to (1) identify the most effective models, (2) examine the impact of initializing models using transfer learning, and (3) evaluate the vision Mamba model’s performance relative to conventional models. Using a Kaggle dataset containing four X-ray image categories—COVID-19, normal, lung opacity, and viral pneumonia—six models were trained and tested with and without transfer learning. Results indicated that the Swin Transformer model outperformed others, and transfer learning significantly enhanced model accuracy. Although the vision Mamba model exhibited lower accuracy, its computational efficiency highlights its potential utility. This study provides critical insights into artificial intelligence applications in lung disease diagnosis, supporting clinicians with accurate diagnostic tools.
In narrow overlap welding, serious defects in the weld will lead to band breakage accident, and the whole hot dip galvanizing unit will be shut down. Laser vision inspection hardware is used to collect real-time image...
详细信息
In narrow overlap welding, serious defects in the weld will lead to band breakage accident, and the whole hot dip galvanizing unit will be shut down. Laser vision inspection hardware is used to collect real-time image of weld surface, and image defect recognition and evaluation system is developed to real-time detect quality. Firstly, region division is implemented. In view of the characteristics of gray image such as large information, low contrast and blurred edge, an improved image segmentation algorithm is proposed by using image convolution to enhance edge features and combining with integral image, which can quickly and accurately extract the weld edge and divide the region, and the processing time can meet the real-time requirements. Then comparing the effect of traditional method and convolution neural network in identifying defects, VGG16 is used to identify weld defects. In order to ensure real-time performance, a two-stage weld defect recognition is proposed. First, the large defective area is identified, and then, the defect is accurately identified in the area. This method can quickly extract defect areas and complete weld defect classification. Experiments show that the accuracy can reach 97% and average running time takes 3.2 s, meeting the online detection requirements.
image segmentation plays a critical role in unlocking the mysteries of the universe, providing astronomers with a clearer perspective on celestial objects within complex astronomical images and data cubes. Manual segm...
详细信息
image segmentation plays a critical role in unlocking the mysteries of the universe, providing astronomers with a clearer perspective on celestial objects within complex astronomical images and data cubes. Manual segmentation, while traditional, is not only time-consuming but also susceptible to biases introduced by human intervention. As a result, automated segmentation methods have become essential for achieving robust and consistent results in astronomical studies. This review begins by summarizing traditional and classical segmentation methods widely used in astronomical tasks. Despite the significant improvements these methods have brought to segmentation outcomes, they fail to meet astronomers' expectations, requiring additional human correction, further intensifying the labor-intensive nature of the segmentation process. The review then focuses on the transformative impact of machine learning, particularly deep learning, on segmentation tasks in astronomy. It introduces state-of-the-art machine learning approaches, highlighting their applications and the remarkable advancements they bring to segmentation accuracy in both astronomical images and data cubes. As the field of machine learning continues to evolve rapidly, it is anticipated that astronomers will increasingly leverage these sophisticated techniques to enhance segmentation tasks in their research projects. In essence, this review serves as a comprehensive guide to the evolution of segmentation methods in astronomy, emphasizing the transition from classical approaches to cutting -edge machine learning methodologies. We encourage astronomers to embrace these advancements, fostering a more streamlined and accurate segmentation process that aligns with the ever-expanding frontiers of astronomical exploration.
Aiming at the problem of low recognition accuracy of digital display meter readings when the inspection robot performs inspection tasks, a YOLOv5-based digital display meter detection and recognition algorithm for dis...
详细信息
In the field of computer vision, 6D object detection and pose estimation are critical for applications such as robotics, augmented reality, and autonomous driving. Traditional methods often struggle with achieving hig...
详细信息
ISBN:
(数字)9798350355413
ISBN:
(纸本)9798350355420
In the field of computer vision, 6D object detection and pose estimation are critical for applications such as robotics, augmented reality, and autonomous driving. Traditional methods often struggle with achieving high accuracy in both object detection and precise pose estimation simultaneously. This study proposes an improved 6D object detection and pose estimation pipeline based on the existing 6D-VNet framework, enhanced by integrating a Hybrid Task Cascade (HTC) and a High-Resolution Network (HRNet) backbone. By leveraging the strengths of HTC’s multi-stage refinement process and HRNet’s ability to maintain high-resolution representations, our approach significantly improves detection accuracy and pose estimation precision. Furthermore, we introduce advanced post-processing techniques and a novel model integration strategy that collectively contribute to superior performance on public and private benchmarks. Our method demonstrates substantial improvements over state-of-the-art models, making it a valuable contribution to the domain of 6D object detection and pose estimation.
暂无评论