To address the issue of low accuracy and poor robustness of perceptual learning in complex scenarios, a new method integrating computer vision and machine learning is adopted, that is, by applying deep neural networks...
详细信息
Deep learning (DL) algorithms are swiftly finding applications in computer vision and natural language processing. Nonetheless, they can also be employed for creating convincing deepfakes, which are challenging to dis...
详细信息
Our skin is the hefty organ that envelops and shields body. It prevents us from numerous fatal and non fatal diseases. It is observed that due to bacteria or other causes of infection, skin faces certain minor or life...
详细信息
Photograph registration is aligning or more excellent snapshots of the same bodily object or scene;it's widely used in many regions of imageprocessing and pc imaginative and prescient. Evolutionary algorithms are...
详细信息
Diabetic Macular Edema (DME) is a disease of the eye's retina and it's a major factor of causing vision problems and leads towards blindness if it is undiagnosed. Early detection of DME can prevent vision loss...
详细信息
In the current era, machinevision systems are being implemented widely in varied fields due to its key features, such as rapid processing, non-contact-based technology and in-situ measurements. This technology also p...
详细信息
In the current era, machinevision systems are being implemented widely in varied fields due to its key features, such as rapid processing, non-contact-based technology and in-situ measurements. This technology also possesses wide applications in the manufacturing sector. The surface texture properties of any machined component vary based on the manufacturing process, machining parameters, tool and machine conditions etc. As the surface texture of the machined components greatly influences the functional performance, it is vital to examine the surface characteristics. The surface texture of the machine component can be assessed by implementing a series of imageprocessing techniques on its speckle images. Speckle image refers to the randomly distributed granular pattern which is obtained when a rough or textured surface is illuminated using a laser beam. This paper focuses on estimating the orientation of the workpiece and examining the surface characteristics based on the post-processing of the speckle images. The hardened steel workpieces used in this investigation were ground by varying the process parameters and speckle images were obtained at 0°, 30°, 60° and 90° orientations. The shifted power spectral density of the ground sample images contains high-energy coefficients which mimic a line and its orientation varies based on the sample orientation. The Hough transform technique was applied to the binary image of shifted PSD to efficiently determine the orientation. Furthermore, correlations have been established between several surface texture characteristics and GLCM parameters with the surface roughness of ground samples.
Automatic detection of pineapples in complex agricultural environments poses several challenges. During harvesting, pineapples that are suitable for collection exhibit intricate scaly surface textures and a wide range...
详细信息
Automatic detection of pineapples in complex agricultural environments poses several challenges. During harvesting, pineapples that are suitable for collection exhibit intricate scaly surface textures and a wide range of colors. Moreover, occlusion by leaves and fluctuating lighting conditions further complicate the detection of pineapples. In this paper, we propose a high-precision lightweight detection network based on the improved You Only Look Once version 7-tiny (Pineapple-YOLO) for the robot vision system to realize realtime and accurate detection of pineapple. The Convolutional Block Attention Module (CBAM) is embedded into the backbone network to enhance the feature extraction capability, and the Content-Aware Reassembly of Features (CARAFE) is introduced to perform up-sampling operations and expand the receptive field. The Scylla Intersection over Union (SIoU) loss function is used instead of the Complete Intersection over Union (CIoU) loss function to consider the vector angles and redefine the penalty criteria. Finally, the K-means++ clustering algorithm is used to re-cluster the labels of the pineapple dataset and update the size of the anchor. The experimental results show that Pineapple-YOLO achieves a mAP@0.5 of 89.7%, which is a 6.15% improvement over the original YOLOv7-tiny, demonstrating its superiority over other mainstream target detection models. Furthermore, in diverse natural environments where the agricultural robot operates, the Pineapple-YOLO algorithm sustains a commendable 92% success rate in fruit picking, achieved within an average time of 12 s. This demonstrates the efficiency of the visual module in practical engineering applications.
object detection based on event vision has been a dynamically growing field in computer vision for the last 16 years. In this work, we create multiple channels from a single event camera and propose an event fusion me...
详细信息
ISBN:
(数字)9798331506520
ISBN:
(纸本)9798331506537
object detection based on event vision has been a dynamically growing field in computer vision for the last 16 years. In this work, we create multiple channels from a single event camera and propose an event fusion method (EFM) to enhance object detection in event-based vision systems. Each channel uses a different accumulation buffer to collect events from the event camera. We implement YOLOv7 for object detection, followed by a fusion algorithm. Our multichannel approach outperforms single-channel-based object detection by 0.7% in mean Average Precision (mAP) for detection overlapping ground truth with IOU = 0.5.
Graph Neural Networks (GNNs) are neural models that use message transmission between graph nodes to represent the dependency of graphs. Variants of Graph Neural Networks (GNNs), such as graph recurrent networks (GRN),...
详细信息
Graph Neural Networks (GNNs) are neural models that use message transmission between graph nodes to represent the dependency of graphs. Variants of Graph Neural Networks (GNNs), such as graph recurrent networks (GRN), graph attention networks (GAT), and graph convolutional networks (GCN), have shown remarkable results on a variety of deep learning tasks in recent years. In this study, we offer a generic design pipeline for GNN models, go over the variations of each part, classify the applications in an organized manner, and suggest four outstanding research issues. Dealing with graph data, which provides extensive connection information among pieces, is necessary for many learning tasks. A model that learns from graph inputs is required for modelling physics systems, learning molecular fingerprints, predicting protein interfaces, and identifying illnesses. Reasoning on extracted structures (such as the dependency trees of sentences and the scene graphs of photos) is an important research issue that also requires graph reasoning models in other domains, such as learning from non-structural data like texts and images. Graph Neural Networks (GNNs) are primarily designed for dealing with graph-structured data, where relationships between entities are modeled as edges in a graph. While GNNs are not traditionally applied to image classification problems, researchers have explored ways to leverage graph-based structures to enhance the performance of Convolutional Neural Networks (CNNs) in certain scenario. GNN have been increasingly applied to Natural Language processing (NLP) tasks, leveraging their ability to model structured data and capture relationships between elements in a graph. GNN are also applied for traffic related problems particularly in modeling and optimizing traffic flow, analyzing transportation networks, and addressing congestion issues. GNN can be used for traffic flow prediction, dynamic routing & navigation, Anomaly detection, public transport network
The IP005, 005 Special Research Committee on imageprocessing for NDI of the Japanese Society of Non-Destructive Inspection (JSNDI) was established in 1979 as the fifth traversal research infrastructure among X-ray, u...
详细信息
The IP005, 005 Special Research Committee on imageprocessing for NDI of the Japanese Society of Non-Destructive Inspection (JSNDI) was established in 1979 as the fifth traversal research infrastructure among X-ray, ultrasonic and other NDI technology groups. This committee was chaired by M Onoe, E Yamamoto, M Takagi and H Yamada. It became more active in the early summer of 1996 under renewed organisation and includes more than 70 members led by the authors. In this article, the current activities of IP005 are introduced and some of the objectives for new machinevisionapplications especially in NDT are discussed. Topics presented in recent meetings are introduced in section I, while sections 2 and 3 deal with special working groups for automatic weld inspection and welding via imageprocessing and for new NDI applications such as food inspection.
暂无评论