It is significant that the field-of-view mechanism plays a role in the visual system of the primate, such as reducing information overload and focusing on important information, which can ensure optimal information pr...
详细信息
The rapid development of artificial intelligence facilitates the improvement of imageprocessing algorithms. For an intelligent inspection robot, the ability to analyze the environment through image collection plays a...
详细信息
the article analyzes, parameterizes, and experimentally verifies the known "Hand-Eye" approach of calibration methods between a 2D camera system and a robotic system. Thanks to our new sensing devices (in co...
详细信息
ISBN:
(纸本)9798350347630
the article analyzes, parameterizes, and experimentally verifies the known "Hand-Eye" approach of calibration methods between a 2D camera system and a robotic system. Thanks to our new sensing devices (in cooperation with suitable software), we can quickly and easily set up a new application. We emphasize determining, defining, and comparing their geometric quantities and field of view by machinevision at the scene. The tested methodology uses a calibration tool as a reference. It verifies the observed approach on a practical demonstration example of the assembly to recognize the learned model object for sensing (cube). Realized experiments demonstrate the presented approach's usefulness, effectiveness, and reliability (measurement success rate 98%). The research results are based on using an external evaluation unit of the 2D camera system, its available built-in functions, and communication based on Ethernet - Non-Procedural.
image communication increasingly involves machine-to-machine delivery. For example, images acquired by an autonomous drone can be compressed and sent to an edge server over a wireless network for resource-intensive pr...
详细信息
ISBN:
(纸本)9798350303582;9798350303599
image communication increasingly involves machine-to-machine delivery. For example, images acquired by an autonomous drone can be compressed and sent to an edge server over a wireless network for resource-intensive processing. Traditional compression techniques involving transform, quantization, and entropy coding reach high compression efficiency, but channel conditions worse than expected may lead to a sharp decrease in the decoded image quality. As an alternative, Linear Coding and Transmission (LCT) systems have been proposed to avoid this digital cliff problem: The reconstructed image quality decreases gradually as channel conditions degrade. This paper presents a comprehensive evaluation of computer vision tasks with input images processed and transmitted using LCT. It also analyses the benefits of network retraining, accounting for impairments due to LCT and noisy channel. Considering object detection and semantic segmentation over images transmitted and received by LCT systems, we show that the task accuracy degrades smoothly when the channel quality decreases, avoiding the cliff effect. Retraining with noisy images processed by LCT restores detection mAP degradation from 23.8% to 4.4% and segmentation mIoU degradation from 43.2% to 8.1% when the channel signal-to-noise ratio is 10 dB.
To obtain higher imaging quality, the dark level generated during the operation of CMOS image sensors (CIS) needs to be corrected. In this paper, a dark level correction circuit is designed based on a 4 T active pixel...
详细信息
In the past years, machine learning (ML) and deep learning (DL) have led to the advancement of several applications, including computer vision, natural language processing, and audio processing. These complex tasks re...
详细信息
ISBN:
(纸本)9798400716164
In the past years, machine learning (ML) and deep learning (DL) have led to the advancement of several applications, including computer vision, natural language processing, and audio processing. These complex tasks require large models, which is a challenge to deploy in devices with limited resources. These resource-constrained devices have limited computation power and memory. Hence, the neural networks must be optimized through network acceleration and compression techniques. This paper proposes a novel method to compress and accelerate neural networks from a small set of spatial convolution kernels. Firstly, a novel pruning algorithm is proposed based on the density-based clustering method that identifies and removes redundancy in CNNs while maintaining the accuracy and throughput tradeoff. Secondly, a novel pruning algorithm based on the grid-based clustering method is proposed to identify and remove redundancy in CNNs. The performance of the three pruning algorithms (density-based, grid-based, and partitional-based clustering algorithms) is evaluated against each other. The experiments were conducted using the deep CNN compression technique on the VGG-16 and ResNet models to achieve higher accuracy on image classification than the original model at a higher compression ratio and speedup.
Factors such as insufficient lighting, backlighting, and failed focusing can lead to low brightness, poor contrast, and noise in the collected images. These issues seriously affect the further analysis and processing ...
详细信息
Unmanned aerial vehicles (UAVs) are widely utilized in various fields. However, during mission execution, the occurrence of mechanical failures or subsystem malfunctions, including fuel shortages, may result in the UA...
详细信息
In the contemporary world, captions are one of the most essential tools. There are also integrated programmes that use deep neural network models to create and provide captions for particular photographs. This article...
详细信息
A fast-expanding topic is the study of palmprint biometric identification in contactless scenario, which uses techniques from computer vision and machine learning to identify and authenticate people. In this study, we...
详细信息
ISBN:
(纸本)9789819752119;9789819752126
A fast-expanding topic is the study of palmprint biometric identification in contactless scenario, which uses techniques from computer vision and machine learning to identify and authenticate people. In this study, we utilized a handcrafted video dataset with 60 distinct classes, each labelled as either a left or right hand, to investigate palmprint detection and matching tasks. The dataset showcases various variations in palmprint patterns, like distance from the sensor, orientation, finger positioning, and deformation, making it an ideal candidate for the development of robust and accurate palmprint recognition models. The major goal of the study is to identify palmprints in the video collection and match them with the right class or pattern. To accomplish this task, different machine learning (ML) and deep learning (DL) models were trained and evaluated. To find the best method for palmprint identification in a contactless manner, the accuracy of each model was tested. In conclusion, our study adds to the expanding body of knowledge on biometric palmprint identification and introduces a fresh handmade video dataset that can be used to compare the effectiveness of various models.
暂无评论