To achieve efficient compression for both human vision and machine perception, scalable coding methods have been proposed in recent years. However, existing methods do not fully eliminate the redundancy between featur...
详细信息
ISBN:
(纸本)9798331529543;9798331529550
To achieve efficient compression for both human vision and machine perception, scalable coding methods have been proposed in recent years. However, existing methods do not fully eliminate the redundancy between features corresponding to different tasks, resulting in suboptimal coding performance. In this paper, we propose a frequency-aware hierarchical image compression framework designed for humans and machines. Specifically, we investigate task relationships from a frequency perspective, utilizing only HF information for machinevision tasks and leveraging both HF and LF features for image reconstruction. Besides, the residual block embedded octave convolution module is designed to enhance the information interaction between HF features and LF features. Additionally, a dual-frequency channel-wise entropy model is applied to reasonably exploit the correlation between different tasks, thereby improving multi-task performance. The experiments show that the proposed method offers -69.3%similar to-75.3% coding gains on machinevision tasks compared to the relevant benchmarks, and -19.1% gains over state-of-the-art scalable image codec in terms of image reconstruction quality.
In the field of Video Coding for machines (VCM), scalable feature compression has attracted attention for its potential to support a variety of machinevision tasks. However, the existing scalable feature compression ...
详细信息
ISBN:
(纸本)9798350349405;9798350349399
In the field of Video Coding for machines (VCM), scalable feature compression has attracted attention for its potential to support a variety of machinevision tasks. However, the existing scalable feature compression methods exhibit limited performance. To address this problem, we propose an end-to-end learned scalable multilayer feature compression method in this paper. First, we propose to leverage an end-to-end feature compression method, which can efficiently exploit redundancy among features through a learning approach, to improve compression efficiency. Second, we introduce a novel strategy involving the use of the transformed latent of the base layer as the conditional information for the enhancement layer. Given the learnable nature of our compression method, we propose to optimize the base layer and the enhancement layer jointly. The joint optimization encourages the base layer to produce more suitable conditional information for the enhancement layer. Comparative experiments against existing feature compression and image compression methods verify our approach's remarkable performance improvements.
machinevision and computer imageprocessing technologies are widely used in the metallurgical industry, especially in recognizing and analyzing defects in glass. High surfaces of planer surface and quality in the gla...
详细信息
This study introduces a machinevision system integrated into cyber-physical systems (CPS) for enhanced industrial control. The system employs a specialized script for real-time imageprocessing and edge detection, wi...
详细信息
ISBN:
(纸本)9798350372977;9798350372984
This study introduces a machinevision system integrated into cyber-physical systems (CPS) for enhanced industrial control. The system employs a specialized script for real-time imageprocessing and edge detection, with a focus on precision and speed. Results showcase the system's rapid processing capabilities and high-accuracy feature detection, facilitated by machine learning algorithms that enable adaptability and iterative improvement. The system distinguishes itself by not only providing rapid and accurate feature recognition but also by outputting precise coordinates, crucial for micron-level manufacturing precision. An intuitive human-machine interface ensures seamless operation within industrial workflows. This integration significantly improves automated quality control and operational efficiency, demonstrating the system's potential to advance smart manufacturing in line with Industry 4.0 standards.
Following the great success of curriculum learning in the area of machine learning, a novel deep curriculum learning method proposed in this paper, entitled DCL, particularly for the classification of fully polarimetr...
详细信息
Sparse representation based on dictionary learning has been widely used in many applications over the past decade. In this article, a new method is proposed for removing noise from video images using sparse representa...
详细信息
This paper presents an integrated approach to bottle label detection in industrial processes, combining machinevision techniques with Programmable Logic Controller (PLC) commanding for an efficient and reliable quali...
详细信息
ISBN:
(纸本)9798350373172;9798350373189
This paper presents an integrated approach to bottle label detection in industrial processes, combining machinevision techniques with Programmable Logic Controller (PLC) commanding for an efficient and reliable quality control system. The logic of the label detection station was developed using ladder diagrams to simulate the process and determine whether to retain or remove a bottle on the conveyor based on label presence. The imageprocessing component, executed in Python, manipulated the main variable indicating the label's presence on the bottle. CODESYS was employed to visually represent the process, and in the simulation phase, the bottle traversed the conveyor, halting when the proximity sensor activated. Python code provided the labeling status, and the corresponding image was displayed on a simulation screen. To bridge the gap between imageprocessing and logic control, the functionality of imageprocessing in Python was linked with the CODESYS logic. An OPC UA server in CODESYS facilitated external connections, enabling Python to access and configure CODESYS variables. Challenges, such as scanning rate disparities between Python and CODESYS, were addressed through the introduction of sessions synchronized by a counter. Technical steps involved using an OPC UA client program to monitor server availability and access CODESYS program variables. The installation of the Security plugin in CODESYS ensured secure external connections. The project's key realization was the seamless linkage between imageprocessing and PLC logic, demonstrating an effective integration of machinevision into industrial processes. For bottle label detection, the paper employed OpenCV for object recognition. The image segmentation method, utilizing adaptive thresholding, distinguished the bottle from its background, optimizing the separation process. Contours were identified using findContours(), and a thorough cleanup using arcLength() and approxPolyDP() functions ensured only relev
HyperSpectral image (HSI) processing has many applications in agriculture, Earth Change Monitoring, etc. Classification of HSIs is an important stage in most applications. Recently transformers have shown outstanding ...
详细信息
ISBN:
(纸本)9798350376357;9798350376340
HyperSpectral image (HSI) processing has many applications in agriculture, Earth Change Monitoring, etc. Classification of HSIs is an important stage in most applications. Recently transformers have shown outstanding performance in computer vision tasks and some transformer-based methods have been introduced for HSI classification. In this paper, we propose a novel classification algorithm using the convolutional mixer for HSI classification. Convolutional mixer is similar to transformers and is used to mix spatial and spectral information which are gathered separately by using depth wise convolution followed by a point wise convolution. We have used 3D convolutions in mixer block due to the 3D shape of hyperspectral cuboids to extract spectral and spatial features simultaneously, which have not been done before, to the best of our knowledge. Experimental results show that our method is superior to similar state of the art works in terms of overall accuracy and Kappa Coefficient.
Underwater images usually have low contrast, blurring, and extreme color distortion because the light is refracted, scattered, and absorbed as it passes through the water. These features can lead to challenges in imag...
详细信息
The narrow internal space, inadequate lighting, and complex geometry of internally threaded pipes pose significant challenges for accurate detection. This paper presents a computer vision-based solution that enhances ...
详细信息
ISBN:
(纸本)9798400707032
The narrow internal space, inadequate lighting, and complex geometry of internally threaded pipes pose significant challenges for accurate detection. This paper presents a computer vision-based solution that enhances image acquisition efficiency through innovative hardware design. By utilizing cylindrical stitching algorithms, we achieved panoramic image stitching of internal threads, and automated defect detection was implemented using a YOLO-based object recognition algorithm. This research addresses specific industrial needs for internal thread detection in oil pipes. Compared to existing technologies, this method significantly increases detection speed, reduces the risk of human error, and improves production efficiency.
暂无评论