In this paper, on the basis of a brief description of deep learning theory, we study the use of image information recognition technology to detect image data, construct training models, and improve the accuracy of ima...
详细信息
In this article, a new approach is suggested in remote-sensing images registration. In the suggested approach, first, the features extraction process is done based on proposed redundant keypoint elimination method syn...
详细信息
As a key factor in the milling process, the wear status of the milling cutter has a significant impact on the machining quality of the workpiece. To detect wear on a milling machine efficiently and precisely, this pap...
详细信息
ISBN:
(纸本)9798350363272;9798350363265
As a key factor in the milling process, the wear status of the milling cutter has a significant impact on the machining quality of the workpiece. To detect wear on a milling machine efficiently and precisely, this paper presents the development of a milling machine wear detection system based on machinevision and digital imageprocessing. The system including link mechanisms and industrial camera is designed for auxiliary localization and collection of on-machineimages of milling cutter status. The image preprocessing method based on automatic threshold segmentation and Canny edge detection operator is proposed to identify the edge of cutter wear. The Maximum connected domains algorithm is used to screen the wear area of the milling cutter and the amount of wear is obtained based on a calibrated scaling method. Experimental results show that the proposed system is suitable for industrial use due to its rapid detection speed and strong recognition accuracy, which are desirable for engineering applications.
Industry 4.0, the digitalization of manufacturing promises to lead to lowered cost, efficient processes and even discovery of new business models. However, many of the enterprises have huge investments in legacy machi...
详细信息
ISBN:
(纸本)9783031702587;9783031702594
Industry 4.0, the digitalization of manufacturing promises to lead to lowered cost, efficient processes and even discovery of new business models. However, many of the enterprises have huge investments in legacy machines which are not 'smart'. In this study, we thus designed a cost-efficient solution to retrofit a legacy conveyor belt-based cutlery washing machine with a commodity web camera. We then applied computer vision (using both traditional imageprocessing and deep learning techniques) to infer the speed and utilization of the machine. We detailed the algorithms that we designed for computing both speed and utilization. With the existing operational constraints of our client, frequent re-training of the deep learning model for object detection is not feasible. Thus, we compared the generalizability of the two techniques across 'unseen' cutleries and found traditional imageprocessing to be generalizable across 'unseen' images. Our proposed final solution uses traditional imageprocessing for computation of utilization but a hybrid of traditional imageprocessing and deep learning model for speed computation as it is more reliable. Our client has implemented our proposed solution for one conveyor belt-based cutlery washing machine and will be planning to scale this to multiple conveyor belt-based cutlery washing machines.
With the advance of deep learning in the BigData era, image/video coding for machines (VCM) as called for proposals by the moving picture experts group (MPEG) now becomes the pivotal technique for extensive intelligen...
详细信息
ISBN:
(纸本)9798350349405;9798350349399
With the advance of deep learning in the BigData era, image/video coding for machines (VCM) as called for proposals by the moving picture experts group (MPEG) now becomes the pivotal technique for extensive intelligent vision tasks. However, existing VCM methods typically focus on compressing features independently at each scale, ignoring the redundancy of features across multiple scales. This paper thus introduces a simple yet effective architecture called hybrid single input and multiple output (H-SIMO) for VCM, which can significantly reduce the redundancy across scales of features. More specifically, as the pyramid structure is commonly employed for localising multi-scale objects, our HSIMO method proposes to compress all features by inputting a single-scale feature while retaining the ability to decompress all the features. Moreover, an entropy model is seamlessly integrated into the training process to efficiently reduce the statistical redundancy of features. During the testing phase, the hybrid coding method, in conjunction with the versatile video coding (VVC), is employed to compress the features from both images and videos. We comprehensively evaluate the performance of our H-SIMO method in two standard machinevision tasks: object detection and instance segmentation, in which the experimental results verify the superior performances of our H-SIMO method.
During the development of machinevision setups for industrial applications, namely automatic visual inspection systems, the type of required lighting and correct illumination position and intensity are often not know...
详细信息
ISBN:
(纸本)9783031662706;9783031662713
During the development of machinevision setups for industrial applications, namely automatic visual inspection systems, the type of required lighting and correct illumination position and intensity are often not known in advance. In order to solve a specific machinevision task, an engineer needs to propose and implement a suitable imageprocessing algorithm and design the accompanying lighting system. Testing different illumination concepts, especially "in the field", using off-the-shelf lighting systems is time consuming. The paper presents a custom-made, cost effective, transportable and flexible system which may be used for testing different illumination techniques and simultaneously control multiple lighting modules. In addition to the hardware part, the developed software used for control the system is presented in the paper. The proposed device usefulness is demonstrated by solving a suitable multi-light digital imageprocessing task.
machinevision is widely used in the engineering field, especially in the intelligent monitoring, safety warning and information recognition scenarios. In this study, the novel civil engineering construction site safe...
详细信息
Today, image and video data is not only viewed by humans, but also automatically analyzed by computer vision algorithms. However, current coding standards are optimized for human perception. Emerging from this, resear...
详细信息
ISBN:
(纸本)9798350349405;9798350349399
Today, image and video data is not only viewed by humans, but also automatically analyzed by computer vision algorithms. However, current coding standards are optimized for human perception. Emerging from this, research on video coding for machines tries to develop coding methods designed for machines as information sink. Since many of these algorithms are based on neural networks, most proposals for video coding for machines build upon neural compression. So far, optimizing the compression by applying the task loss of the analysis network, for which ground truth data is needed, is achieving the best coding performance. But ground truth data is difficult to obtain and thus an optimization without ground truth is preferred. In this paper, we present an annotation-free optimization strategy for video coding for machines. We measure the distortion by calculating the task loss of the analysis network. Therefore, the predictions on the compressed image are compared with the predictions on the original image, instead of the ground truth data. Our results show that this strategy can even outperform training with ground truth data with rate savings of up to 7.5 %. By using the non-annotated training data, the rate gains can be further increased up to 8.2 %.
With the development and widespread application of machinevision technology, quality inspection based on machinevision has also become a popular research area in the field of statistical quality control. However, re...
详细信息
Frequency response analysis (FRA) is one of the most efficient methods that can diagnose the mechanical faults of power transformers. Digital imageprocessing of the FRA polar plot characteristics has been recently pr...
详细信息
暂无评论