A hazy image is one where atmospheric effects degrade the contrast and visibility of the image. It is often caused by the dispersion of light into the moisture particles present, smoke etc. This results in lower perfo...
详细信息
ISBN:
(纸本)9798350320565
A hazy image is one where atmospheric effects degrade the contrast and visibility of the image. It is often caused by the dispersion of light into the moisture particles present, smoke etc. This results in lower performance in high level vision tasks such as object detection, free space detection, scene understanding, etc. Hence the images have to be dehazed before applying other high level algorithms. Dehazing is the process of reconstructing the original colour and contrast of the image if taken in normal conditions. image dehazing is a non-trivial task as it is hard to collect haze free ground truth images. Further, achieving dehazed images when variable haze is present is a significantly harder challenge. In this research, we propose the Non Homogeneous RESIDE dataset (NH-RESIDE) that contains images created synthetically using the principles of randomness and representativeness. Experimental results show that the model trained on our dataset produces visually more pleasing images with a much better dehazing effect on real world images. The model implemented in this paper also outperforms the state-of-the-art models by a huge margin on the NH-Haze dataset proposed by the NTIRE Non Homogeneous Dehazing Challenge at CVPR, achieving an average PSNR of 25.69 and an average SSIM of 0.80. It also achieves much better processingtimes when compared to other models, thereby facilitating real-time performance.
This paper presents a new hybrid pupil detection algorithm based on both AI and classical imageprocessing techniques. The problem addressed by the AI technique of neural networks is the detection of the binarization ...
详细信息
ISBN:
(纸本)9783031625220;9783031625237
This paper presents a new hybrid pupil detection algorithm based on both AI and classical imageprocessing techniques. The problem addressed by the AI technique of neural networks is the detection of the binarization threshold. Thus, using the percentile function calculated in a sufficiently large number of points, a convolutional neural network with a reduced number of layers is used to determine the appropriate threshold necessary to obtain the binarized eye image. Furthermore, classic imageprocessing solutions (morphological operations, Laplacian based edge detection, convexity correction, ellipse fitting) are used to detect the center of the pupil. Due to a good immunity to variable and non-uniform lighting conditions, the proposed algorithm shows a high detection rate when tested on three representative databases.
As the main carrier of underwater information, underwater video images play a vital role in human exploration and development of the ocean. However, due to the optical properties of water and the complex and variable ...
详细信息
In technology 4.0, all data is put on the Cloud for transmission and processing. images are one of the most important data and are quite large because they are created every day. Therefore, the increased demand for im...
详细信息
The video object segmentation (VOS) task involves the segmentation of an object over time based on a single initial mask. Current state-of-the-art approaches use a memory of previously processed frames and rely on mat...
详细信息
ISBN:
(纸本)9781713899921
The video object segmentation (VOS) task involves the segmentation of an object over time based on a single initial mask. Current state-of-the-art approaches use a memory of previously processed frames and rely on matching to estimate segmentation masks of subsequent frames. Lacking any adaptation mechanism, such methods are prone to test-time distribution shifts. This work focuses on matching-based VOS under distribution shifts such as video corruptions, stylization, and sim-to-real transfer. We explore test-time training strategies that are agnostic to the specific task as well as strategies that are designed specifically for VOS. This includes a variant based on mask cycle consistency tailored to matching-based VOS methods. The experimental results on common benchmarks demonstrate that the proposed test-time training yields significant improvements in performance. In particular for the sim-to-real scenario and despite using only a single test video, our approach manages to recover a substantial portion of the performance gain achieved through training on real videos. Additionally, we introduce DAVIS-C, an augmented version of the popular DAVIS test set, featuring extreme distribution shifts like image-/video-level corruptions and stylizations. Our results illustrate that test-time training enhances performance even in these challenging cases. Project page: https://***/test-time-training-vos/
The motive of underwater image enhancement is to improve and enhance the quality of photographs taken underwater. Various traditonal imageprocessing techniques and deep learning models have been developed to work on ...
详细信息
The integration of artificial intelligence (AI) methodology in the flexible electronics, BioMEMs and Dielectrophoresis (DEP) system provides an accurate, comprehensive, simple and cost-effective approach especially in...
详细信息
ISBN:
(纸本)9798350378313
The integration of artificial intelligence (AI) methodology in the flexible electronics, BioMEMs and Dielectrophoresis (DEP) system provides an accurate, comprehensive, simple and cost-effective approach especially in the realtime cell sorting, target identification, monitoring, biological entities separation, analysis and postulation. Here, a novel approach integrating AI algorithms with DEP technology for BioMEMs system has been demonstrated in real-time bacteria identification via detecting, identifying simultaneously, lock-in monitoring bacterial movement trajectories, analyzing, measuring and calculating bacteria's velocities of different frequencies. The AI-based bacterial recognition system creates real-time, fast, accurate, unlimited bacteria lock-in monitoring with high accuracy. Furthermore, the bacteria can be tracked, monitored, and analyzed using big data analysis and Deep Learning (DL) in real-time via a visual control interface system, as part of the Internet of Things (IoT) and intelligent information sharing between BioMEMS sensors and IoT networks.
The synthetic aperture radar (SAR) images provide all-time and all-weather observation for the earth, which has become one of important techniques for remote sensing (RS). Recently, the large-scale vision transformers...
详细信息
Plant disease outbreaks have been more common in recent years, which has put food security and agricultural output at serious risk. timely intervention and the avoidance of significant losses are contingent upon the s...
详细信息
Underwater imaging presents numerous challenges due to refraction, light absorption, and scattering, resulting in color degradation, low contrast, and blurriness. Enhancing underwater images is crucial for high-level ...
详细信息
ISBN:
(纸本)9798350318920;9798350318937
Underwater imaging presents numerous challenges due to refraction, light absorption, and scattering, resulting in color degradation, low contrast, and blurriness. Enhancing underwater images is crucial for high-level computer vision tasks, but existing methods either neglect the physics-based image formation process or require expensive computations. In this paper, we propose an effective framework that combines a physics-based Underwater image Formation Model (UIFM) with a deep image enhancement approach based on the retinex model. Firstly, we remove backscatter by estimating attenuation coefficients using depth information. Then, we employ a retinex model-based deep image enhancement module to enhance the images. To ensure adherence to the UIFM, we introduce a novel Wideband Attenuation prior. The proposed PhISH-Net framework achieves real-timeprocessing of high-resolution underwater images using a lightweight neural network and a bilateral-grid-based upsampler. Extensive experiments on two underwater image datasets demonstrate the superior performance of our method compared to state-of-the-art techniques. Additionally, qualitative evaluation on a cross-dataset scenario confirms its generalization capability. Our contributions lie in combining the physics-based UIFM with deep image enhancement methods, introducing the wideband attenuation prior, and achieving superior performance and efficiency.
暂无评论