In the technique of recognizing objects based on camera images, Raindrops that are adsorbed to camera lenses and refracting the incoming light are the main factors of lowering the recognition rate. Existing methods fo...
详细信息
ISBN:
(纸本)9781728132327
In the technique of recognizing objects based on camera images, Raindrops that are adsorbed to camera lenses and refracting the incoming light are the main factors of lowering the recognition rate. Existing methods for detecting these droplets mainly define ideal droplet forms and detect areas similar to those defined in the image to determine whether or not to absorb Raindrops in the image. However, since the shape of the droplet varies from lens to lens, it is difficult to determine whether the droplet is absorbed by various cameras by conventional methods. Therefore, this paper presents a Deep learning method of detecting Raindrops of various shapes, which were difficult to detect by conventional signal processing methods.
Computer vision systems are increasingly used for the early detection of skin cancers. Recognizing the first sign of melanoma is very important because if melanoma is found and treated in its primary stage the chances...
详细信息
ISBN:
(纸本)9781450369886
Computer vision systems are increasingly used for the early detection of skin cancers. Recognizing the first sign of melanoma is very important because if melanoma is found and treated in its primary stage the chances for long-term survival are excellent. On the contrary, as it progresses its treatment becomes increasingly harder and it has worse outcome. The various proposals of computer vision systems are characterized by some fundamental common phases: image acquisition, pre-processing, segmentation, features extraction and finally classification. Feature extraction aims at extracting the features from the lesion image in order to characterize the melanoma and feed the classifier. The recent research provided many different feature extraction algorithms for melanoma diagnosis from dermoscopy images from the simplest to the most sophisticated. Features are typically extracted using digital imageprocessing methods (i.e., segmentation, edge detection and color and structure processing), and an open discussion about the meaning of these features and the objective ways of measuring them is ongoing. This paper is a contribution to the feature extraction phase as it describes the most frequently used features in the elaboration of computer vision systems and reports a description of recent works for feature extraction and classification.
Style transfer is an increasingly popular field that can capture the styles of a particular artwork and use them to synthesize a new image with specific content. Previous NST algorithms have the limitation to transfer...
详细信息
The sense of discomfort when watching stereoscopic display caused by visual fatigue has hindered the widespread applications of 3D display. In this work, we explore the relationship between visual fatigue and parallax...
详细信息
Real-time object detection plays a significant role in the field of computer vision. Advanced object detection networks combine with the distribution characteristics in the image, while exposing detecting small target...
详细信息
Knowledge distillation has shown great success in classification, however, it is still challenging for detection. In a typical image for detection, representations from different locations may have different contribut...
ISBN:
(纸本)9781713845393
Knowledge distillation has shown great success in classification, however, it is still challenging for detection. In a typical image for detection, representations from different locations may have different contributions to detection targets, making the distillation hard to balance. In this paper, we propose a conditional distillation framework to distill the desired knowledge, namely knowledge that is beneficial in terms of both classification and localization for every instance. The framework introduces a learnable conditional decoding module, which retrieves information given each target instance as query. Specifically, we encode the condition information as query and use the teacher's representations as key. The attention between query and key is used to measure the contribution of different features, guided by a localization-recognition-sensitive auxiliary task. Extensive experiments demonstrate the efficacy of our method: we observe impressive improvements under various settings. Notably, we boost RetinaNet with ResNet-50 backbone from 37.4 to 40.7 mAP (+3.3) under 1× schedule, that even surpasses the teacher (40.4 mAP) with ResNet-101 backbone under 3× schedule.
imageprocessing (IP) applications have become popular with the advent of efficient algorithms and low-cost CMOS cameras with high resolution. However, IP applications are compute-intensive, consume a lot of energy an...
详细信息
ISBN:
(纸本)9781728117393
imageprocessing (IP) applications have become popular with the advent of efficient algorithms and low-cost CMOS cameras with high resolution. However, IP applications are compute-intensive, consume a lot of energy and have long processing times. image approximation has been proposed by recent works for an energy-efficient design of these applications. It also reduces the impact of long processing times. The challenge here is that the IP applications often work as a part of bigger closed-loop control systems, e.g. advanced driver assistance system (ADAS). The impact of image approximations that tolerate certain error on these image-based control (IBC) systems is very important. However, there is a lack of tool support to evaluate the performance of such closed-loop IBC systems when the IP is approximated. We propose a framework - for both software-in-the-loop (SiL) and hardware-in-the-loop (HiL) simulation - for performance evaluation of image approximation on a closed-loop automotive IBC system (IMACS). Both simulation setups model the 3D environment in 3ds Max, and simulate the system dynamics, camera position and environment in V-REP. Our SiL setup simulates the system software in C++ or Matlab. Here, V-REP runs as a server and the software as a client in synchronous mode. Our HiL simulation setup runs the system software in the NVIDIA Drive PX2 platform and communicates to V-REP using application programming interfaces (APIs) for synchronous execution. We show the effectiveness of our framework using a vision-based lateral control example.
This paper demonstrates a real-time user tracking and handover mechanism for indoor ultrahigh-speed beam-steered optical-wireless systems implementing a low-cost camera. This allows us to tackle LoS blocking by switch...
详细信息
ISBN:
(数字)9781943580712
ISBN:
(纸本)9781728167626
This paper demonstrates a real-time user tracking and handover mechanism for indoor ultrahigh-speed beam-steered optical-wireless systems implementing a low-cost camera. This allows us to tackle LoS blocking by switching to a secondary beam-steering device automatically.
Super resolution reconstruction of human face is a cost effective way to obtain high resolution images from its corresponding low resolution face. It is also known as face illusion. In order to obtain clearer texture ...
详细信息
ISBN:
(数字)9781728151694
ISBN:
(纸本)9781728151700
Super resolution reconstruction of human face is a cost effective way to obtain high resolution images from its corresponding low resolution face. It is also known as face illusion. In order to obtain clearer texture details, this paper proposes a densely connected super-resolution algorithm based on attention mechanism which consists of feature extraction and image reconstruction. By integrating channel and spatial domain information of the feature map, the Multi Attention Domain Module (MADM) is proposed: Features are weighted and recombined by analyzing the relationship between channels and spatial information of feature maps. The features of different layers are fused using dense connections. Experimental results show that the proposed algorithm can improve by up to 0.5dB in PSNR and the reconstructed face image has clearer texture details compared to existing algorithms.
After many years of research, optical flow algorithm has achieved good results in detecting moving objects in simple scenes, but the detection effect in some complex scenes is not ideal, for example, in scenes with ch...
详细信息
暂无评论