Printed circuit boards (PCBs) are an essential component of electronic products, and detecting solder joint defects is critical in the PCB production process. machinevision technology allows detection with high effic...
详细信息
ISBN:
(纸本)9781510666313;9781510666320
Printed circuit boards (PCBs) are an essential component of electronic products, and detecting solder joint defects is critical in the PCB production process. machinevision technology allows detection with high efficiency and cost-effectiveness. Therefore, this paper summarizes the basic principles of imageprocessing-based and machine learning-based methods for defect detection and compares the advantages and disadvantages of both methods with relevant performance evaluation indicators. Finally, this paper contains a summary and an outlook.
作者:
Sahoo, Santosh KumarGodi, Rakesh Kumar
Department of Electronics and Instrumentation Engineering Hyderabad India
Department of Information Technology Hyderabad India
The research work dedicated for the object identification difficulty resolved by the techniques of principal component analysis (PCA) and linear discriminant analysis (LDA) along with robotic machinevision system. Th...
详细信息
machinevision is a technology and method used to provide automated image-driven analysis in applications such as inspection, process control, and guidance, and is very popular in industries nowadays. Computer/machine...
详细信息
machinevision is a technology and method used to provide automated image-driven analysis in applications such as inspection, process control, and guidance, and is very popular in industries nowadays. Computer/machinevision has been extensively developed and used in production to achieve precise automatic control. This paper presented an imageprocessing approach, a subset of machinevision, for the visual inspection system of the Clutch Friction Disc (CFD) produced for 2 wheelers. imageprocessing is used to inspect different parts of the CFD. After previous operations of production, a part enters the inspection system, where the geometry and size of the part are inspected, and then imageprocessing technology is used to decide to accept or reject the product. This paper presented the work constructed using a python program with OpenCV which aims to identify the major defects in clutch friction plates, by using different imageprocessing techniques. With the proposed approach decision can be made automatically that whether the processed part will be accepted or rejected and then will be identified as "Ok tested" and "Faulty" pieces. Copyright (C) 2022 Elsevier Ltd. All rights reserved.
image classification is one of the most fundamental capabilities of machinevision intelligence. In this work, we revisit the image classification task using visually-grounded language models (VLMs) such as GPT-4V and...
In this paper, an approach is presented for enhancing medical images using conditional generative adversarial networks. The generator employs the Nested U-Net Architecture (UNet++) in conjunction with the Global Atten...
详细信息
Circle extraction is a fundamental task in computer vision, which is widely applied in automated inspection and assembly. However, existing circle detection algorithms often exhibit limitations in terms of noise resis...
详细信息
vision-language models (VLMs) are impactful in part because they can be applied to a variety of visual understanding tasks in a zero-shot fashion, without any fine-tuning. We study generative VLMs that are trained for...
详细信息
vision-language models (VLMs) are impactful in part because they can be applied to a variety of visual understanding tasks in a zero-shot fashion, without any fine-tuning. We study generative VLMs that are trained for next-word generation given an image. We explore their zero-shot performance on the illustrative task of image-text retrieval across nine popular vision-language benchmarks. Our first observation is that they can be repurposed for discriminative tasks (such as image-text retrieval) by simply computing the match score of generating a particular text string given an image. We call this probabilistic score the Visual Generative Pre-Training Score (VisualGPTScore). While the VisualGPTScore produces near-perfect accuracy on some retrieval benchmarks, it yields poor accuracy on others. We analyze this behavior through a probabilistic lens, pointing out that some benchmarks inadvertently capture unnatural language distributions by creating adversarial but unlikely text captions. In fact, we demonstrate that even a "blind" language model that ignores any image evidence can sometimes outperform all prior art, reminiscent of similar challenges faced by the visual-question answering (VQA) community many years ago. We derive a probabilistic post-processing scheme that controls for the amount of linguistic bias in generative VLMs at test time without having to retrain or fine-tune the model. We show that the VisualGPTScore, when appropriately debiased, is a strong zero-shot baseline for vision-language understanding, oftentimes producing state-of-the-art accuracy. Copyright 2024 by the author(s)
Latterly, we have seen an immense increase in the amount of research, development and investment dedicated to the field of Autonomous Vehicle Technology or in common terms - Self-Driving Vehicles. While modern autonom...
详细信息
Sensing 3D objects is critical when 2D object recognition is not accessible. A robot pre-trained on a large point-cloud dataset will encounter unseen classes of 3D objects after deploying it. Therefore, the robot shou...
详细信息
ISBN:
(纸本)9798350318920;9798350318937
Sensing 3D objects is critical when 2D object recognition is not accessible. A robot pre-trained on a large point-cloud dataset will encounter unseen classes of 3D objects after deploying it. Therefore, the robot should be able to learn continuously in real-world scenarios. Few-shot class-incremental learning (FSCIL) requires the model to learn from few-shot new examples continually and not forget past classes. However, there is an implicit but strong assumption in the FSCIL that the distribution of the base and incremental classes is the same. In this paper, we focus on cross-domain FSCIL for point-cloud recognition. We decompose the catastrophic forgetting into base class forgetting and incremental class forgetting and alleviate them separately. We utilize the base model to discriminate base samples and new samples by treating base samples as in-distribution samples, and new objects as out-of-distribution samples. We retain the base model to avoid catastrophic forgetting of base classes and train an extra domain-specific module for all new samples to adapt to new classes. At inference, we first discriminate whether the sample belongs to the base class or the new class. Once classified at the model level, test samples are then passed to the corresponding model for class-level classification. To better mitigate the forgetting of new classes, we adopt the soft label and hard label replay together. Extensive experiments on synthetic-to-real incremental 3D datasets show that our proposed method can balance the performance between the base and new objects and outperforms the previous state-of-the-art methods.
Segmentation of nuclei plays a crucial role in the field of medical imageprocessing, particularly in the identification and examination of diseasesAlthough the U-Net ++ has shown good results in this task, there are ...
详细信息
暂无评论