When taking images against strong light sources, the resulting images often contain heterogeneous flare artifacts. These artifacts can importantly affect image visual quality and downstream computervision tasks. Whil...
ISBN:
(纸本)9798350307184
When taking images against strong light sources, the resulting images often contain heterogeneous flare artifacts. These artifacts can importantly affect image visual quality and downstream computervision tasks. While collecting real data pairs of flare-corrupted/flare-free images for training flare removal models is challenging, current methods utilize the direct-add approach to synthesize data. However, these methods do not consider automatic exposure and tone mapping in image signal processing pipeline (ISP), leading to the limited generalization capability of deep models training using such data. Besides, existing methods struggle to handle multiple light sources due to the different sizes, shapes and illuminance of various light sources. In this paper, we propose a solution to improve the performance of lens flare removal by revisiting the ISP and remodeling the principle of automatic exposure in the synthesis pipeline and design a more reliable light sources recovery strategy. The new pipeline approaches realistic imaging by discriminating the local and global illumination through convex combination, avoiding global illumination shifting and local over-saturation. Our strategy for recovering multiple light sources convexly averages the input and output of the neural network based on illuminance levels, thereby avoiding the need for a hard threshold in identifying light sources. We also contribute a new flare removal testing dataset containing the flare-corrupted images captured by ten types of consumer electronics. The dataset facilitates the verification of the generalization capability of flare removal methods. Extensive experiments show that our solution can effectively improve the performance of lens flare removal and push the frontier toward more general situations.
Deep learning constitutes a fundamental pillar in the field of image recognition within autonomous vehicles (AVs), facilitating precise predictions based on unprocessed data. However, unlike human cognition, deep lear...
详细信息
To solve the problem of EEG signal on feature extraction and low recognition rate, we analyze the characteristics of EEG signal and propose an EEG signal analysis method based on common spatial pattern (CSP) and suppo...
详细信息
Remote photoplethysmography (rPPG) is a promising non-contact method for measuring heart rate (HR) and physiological signals. However, current deep learning approaches in this field primarily focus on extracting subtl...
详细信息
computervision, the cornerstone of modern artificial intelligence. Moving forward with the development of new tools and techniques, OpenCV (Open Source computervision Library) coupled with pandas' powerful data ...
详细信息
Learning low-dimensional latent state space dynamics models has proven powerful for enabling vision-based planning and learning for control. We introduce a latent dynamics learning framework that is uniquely designed ...
详细信息
ISBN:
(纸本)9781665445092
Learning low-dimensional latent state space dynamics models has proven powerful for enabling vision-based planning and learning for control. We introduce a latent dynamics learning framework that is uniquely designed to induce proportional controlability in the latent space, thus enabling the use of simple and well-known PID controllers. We show that our learned dynamics model enables proportional control from pixels, dramatically simplifies and accelerates behavioural cloning of vision-based controllers, and provides interpretable goal discovery when applied to imitation learning of switching controllers from demonstration. Notably, such proportional controlability also allows for robust path following from visual demonstrations using Dynamic Movement Primitives in the learned latent space.
Image-based insect species identification is a comprehensive application of computervision technology, image processing technology and patternrecognition technology to realize insect species identification. It is of...
详细信息
Live demonstration setup. (Left) The setup consists of a DAVIS346B event camera connected to a standard consumer laptop and undergoes some motion. (Right) The motion estimates are plotted in red and, for rotation-like...
详细信息
ISBN:
(纸本)9781665448994
Live demonstration setup. (Left) The setup consists of a DAVIS346B event camera connected to a standard consumer laptop and undergoes some motion. (Right) The motion estimates are plotted in red and, for rotation-like motions, the angular velocities provided by the camera IMU are also plotted in blue. This plot exemplifies an event camera undergoing large rotational motions (up to ~ 1000 deg/s) around the (a) x-axis, (b) y-axis and (c) z-axis. Overall, the incremental motion estimation method follows the IMU measurements. Optionally, the resultant global optical flow can also be shown, as well as the corresponding generated events by accumulating them onto the image plane (bottom left corner).
The process of recognizing human emotion is called emotion recognition. The degree to which people can accurately identify the emotions of others varies greatly. The field of study on using technology to assist indivi...
详细信息
ISBN:
(纸本)9798350360806;9798350360790
The process of recognizing human emotion is called emotion recognition. The degree to which people can accurately identify the emotions of others varies greatly. The field of study on using technology to assist individuals in recognizing emotions is still in its infancy. The significance of facial emotion recognition lies in its ability to assist computers in deciphering human emotions from our faces. Its relevance can be summed up as follows: it can aid with mental health, make technology friendlier and more helpful, and even enhance products like video games and manufacturing processes. This approach analyzes facial expressions and characteristics using computervision techniques to determine emotions. Commonly accepted basic emotion categories include happiness, anger, fear, disgust, and surprise. Ensuring optimal functionality of the technology for users from diverse backgrounds.
The existence of pests and diseases threatens the agricultural production and brings serious economic losses to countries whose economy is based on agriculture. Traditional agricultural production methods require a lo...
详细信息
ISBN:
(纸本)9798350362770;9798350362763
The existence of pests and diseases threatens the agricultural production and brings serious economic losses to countries whose economy is based on agriculture. Traditional agricultural production methods require a lot of manpower to identify and detect pests and diseases, which will increase the cost of agricultural production. Now, with the development of deep learning technology, computervision based approaches can help reduce costs and improve efficiency. Through the deep learning model based on CNN, insects and pests can be accurately detected and recognized. This paper proposes an improved recognition model, MFnet, based on the lightweight deep neural network MobileNetV3. A new activation function, MLU6, is introduced in the proposed MFnet model. At the same time, we use the PolyLoss function to replace the cross-entropy loss function when training the proposed model. We also use transfer learning to improve the efficiency of network optimization and prevent overfitting through data augmentation and regularization. Ablation and comparative experiments are carried out and the IP102 pest dataset is used in the experiments. The experimental result shows that MFnet outperforms the original Mobilenetv3 and the other classic models in terms of classification accuracy, at the cost of increased number of parameters compared to the original Mobilenetv3. However, the parameter number, training time and classification time of the proposed model are still much less than those of the other classic models.
暂无评论