This study proposes a Convolutional Neural Network (CNN) based system to aid visually impaired individuals by classifying road surfaces. Leveraging image analysis and processing techniques, the model uses laser beams ...
详细信息
The proceedings contain 39 papers. The topics discussed include: performance analysis of several CNN based models for brain MRI in tumor classification;MRI-based lumbar sagittal alignment classification system;3D mapp...
ISBN:
(纸本)9798350352368
The proceedings contain 39 papers. The topics discussed include: performance analysis of several CNN based models for brain MRI in tumor classification;MRI-based lumbar sagittal alignment classification system;3D mapping and landing zone identification in complex terrains using DSM and photogrammetry;vision language models for oil palm fresh fruit bunch ripeness classification;towards no shadow: region-based shadow compensation on low-altitude urban aerial images;comparative analysis of deep learning architectures for blood cancer classification;exploration of group and shuffle module for semantic segmentation of sea ice concentration;on handcrafted machine learning features for art authentication;and acoustic signature modelling of marine vessels in various environmental and operational conditions.
The field of Obstacle Detection in computer vision is evolving rapidly, with many researchers contributing innovative solutions to address different challenges. However, the current landscape presents several barriers...
详细信息
The Dynamic vision System (DVS) is a novel image acquisition system that works only when there is a brightness change in a pixel, resulting in a stream of events including timestamps, spatial coordinates and the sign ...
详细信息
ISBN:
(纸本)9781728198354
The Dynamic vision System (DVS) is a novel image acquisition system that works only when there is a brightness change in a pixel, resulting in a stream of events including timestamps, spatial coordinates and the sign of the brightness change (increase or decrease). Although DVS's output data size is much smaller than conventional image systems, it still requires further compression, as the main applications of DVS are embedded systems with limited transmission and storage resources. In this paper, we propose a new method for lossless compression of event data streams based on point cloud representations. The event data stream is organized into a 3D point cloud to which a compression algorithm is applied. In addition, different generation strategies are devised in order to compare the compression performance of the proposed approach. Experimental results show an improved compression ratio of about 22% under lossless conditions.
Traditional vision-based navigation methods for mobile robot can only be applied to some specific simple scenes due to the limitations of complex road conditions and light variations. However, the research achievement...
详细信息
At present, the measurement of irregular workpiece is mainly carried out by coordinate measuring machine, but this kind of equipment is expensive and inconvenient to operate, and can’t be applied to soft objects. Thi...
详细信息
This paper presents a study on the evaluation of moisture content in in vitro chewed food boluses using image processing techniques. This integration offers a novel, non-destructive method that enhances the accuracy a...
详细信息
According to visual restaurant service robot application environment under the condition of uneven light and changeful, light and shade and occlusion, make the RGB image noise influence is very big which was collected...
详细信息
Computer vision is to measure and judge by machine instead of man, and convert the captured target scene into image signal through camera device. Transmitting it to the image processing system and converting it into a...
详细信息
Unsupervised image retrieval aims to learn the important visual characteristics without any given level to retrieve the similar images for a given query image. The Convolutional Neural Network (CNN)-based approaches h...
详细信息
ISBN:
(纸本)9798350349405;9798350349399
Unsupervised image retrieval aims to learn the important visual characteristics without any given level to retrieve the similar images for a given query image. The Convolutional Neural Network (CNN)-based approaches have been extensively exploited with self-supervised contrastive learning for image hashing. However, the existing approaches suffer due to lack of effective utilization of global features by CNNs and biased-ness created by false negative pairs in the contrastive learning. In this paper, we propose a TransClippedCLR model by encoding the global context of an image using Transformer having local context through patch based processing, by generating the hash codes through product quantization and by avoiding the potential false negative pairs through clipped contrastive learning. The proposed model is tested with superior performance for unsupervised image retrieval on benchmark datasets, including CIFAR10, NUS-Wide and Flickr25K, as compared to the recent state-of-the-art deep models. The results using the proposed clipped contrastive learning are greatly improved on all datasets as compared to same backbone network with vanilla contrastive learning.
暂无评论