Despite the fairly good performance of Convolutional Neural Networks (CNNs) in image classification tasks, existing CNNs do not perform well when handling datasets with Gaussian noise. This results in the instability ...
详细信息
Exploiting the infrared area of the spectrum for classification problems is getting increasingly popular, because many materials have characteristic absorption bands in this area. However, sensors in the short wave in...
详细信息
ISBN:
(纸本)9798350344868;9798350344851
Exploiting the infrared area of the spectrum for classification problems is getting increasingly popular, because many materials have characteristic absorption bands in this area. However, sensors in the short wave infrared (SWIR) area and even higher wavelengths have a very low spatial resolution in comparison to classical cameras that operate in the visible wavelength area. Thus, in this paper an upsampling method for SWIR images guided by a visible image is presented. For that, the proposed guided upsampling network (GUNet) uses a graph-regularized optimization problem based on learned affinities is presented. The evaluation is based on a novel synthetic near-field visible-SWIR stereo database. Different guided upsampling methods are evaluated, which shows an improvement of nearly 1 dB on this database for the proposed upsampling method in comparison to the second best guided upsampling network. Furthermore, a visual example of an upsampled SWIR image of a real-world scene is depicted for showing real-world applicability.
The image obtained using an image sensor with limited dynamic range cannot perfectly represent the various lighting conditions of the real world. Various HDR methods have been studied for expanding the dynamic range i...
详细信息
ISBN:
(纸本)9781728198354
The image obtained using an image sensor with limited dynamic range cannot perfectly represent the various lighting conditions of the real world. Various HDR methods have been studied for expanding the dynamic range in a single image. However, it is difficult to avoid ghosting artifacts caused by the movement of the subject over time and the corresponding texture loss. To solve these problems, we present a novel HDR image acquisition method via dynamic range transformer (DrT) that learns enhanced log-perceptual information using Swin-Fourier convolutional neural network as a backbone. When training the DrT with Swin-Fourier network, it estimates the attention map to obtain an HDR image by minimizing the enhanced log-perceptual (ELP) loss. The Swin-Fourier network considers both local and global contexts simultaneously, which reduces ghosting and texture loss. By learning ELP, it also minimizes color distortion and restores fine details of the dynamic range. Experimental results demonstrate that the HDR results obtained using DrT show reduced color distortion, significantly decreased ghosting artifacts, and texture loss compared to conventional methods. We provide implementation code of our proposed methods in https://***/HeunSeungLim/DrT
image segmentation is a crucial step in imageprocessing having various applications in biomedical image analysis. Segmentation of the magnetic resonance images of the brain is one such key area in biomedical image an...
详细信息
ISBN:
(纸本)9783031585340;9783031585357
image segmentation is a crucial step in imageprocessing having various applications in biomedical image analysis. Segmentation of the magnetic resonance images of the brain is one such key area in biomedical image analysis that segments various tissues in the brain and detects tumor regions. In this paper, an unsupervised rough spatial ensemble kernelized fuzzy clustering segmentation algorithm is presented for automated segmentation of magnetic resonance images of the brain. The proposed algorithm is an integration of Rough Fuzzy C Means clustering and the kernel method with a novel ensemble kernel being a combination of spherical kernel, Gaussian, and Cauchy kernels, which improves the performance of the segmentation algorithm. The proposed algorithm performs better than the existing clustering algorithms across a wide range of magnetic resonance images of the brain along with visual indications obtained from the results.
The proceedings contain 593 papers. The topics discussed include: MDBFUSION: a visible and infrared image fusion framework capable for motion deblurring;prune channel and distill: discriminative knowledge distillation...
ISBN:
(纸本)9798350349399
The proceedings contain 593 papers. The topics discussed include: MDBFUSION: a visible and infrared image fusion framework capable for motion deblurring;prune channel and distill: discriminative knowledge distillation for semantic segmentation;imbalanced data robust online continual learning based on evolving class aware memory selection and built-in contrastive representation learning;privacy-preserving visual cues communication for hearing-impaired people using deep learning;transformer-based clipped contrastive quantization learning for unsupervised image retrieval;attention enhancement with parallel groups for remote sensing object detection;cross-domain few-shot in-context learning for enhancing traffic sign recognition;and recurrent 3-D multi-level visual transformer for joint classification of heterogeneous 2-D and 3-D radiographic data.
A series of infrared and visible image fusion (IVIF) methods have emerged to improve the performance of segmentation task. However, existing perception-focused IVIF methods take visual effects and semantic information...
详细信息
ISBN:
(纸本)9798350344868;9798350344851
A series of infrared and visible image fusion (IVIF) methods have emerged to improve the performance of segmentation task. However, existing perception-focused IVIF methods take visual effects and semantic information as a unified goal for training, ignoring the task conflicts. Moreover, these methods often involve manually designed modules, which are laborious and suboptimal. To solve the problems, we propose a collaborative feature learning framework based on neural architecture search (NAS). Specifically, we extract shared features of fusion and segmentation tasks into a unified space and separately process task objectives through a dual decoder. In light of the essential role that semantic information plays in the segmentation task, we construct a hybrid search space with transformers incorporated to enhance context dependence handling. Our method undergoes extensive experiments, showcasing exceptional visual effects and significant enhancements in segmentation tasks compared to other state-of-the-art methods.
The aggressive action greatly jeopardizes individual safety and general well-being. Various alternative tactics have been used to reduce violent behavior, including the installation and maintenance of surveillance sys...
详细信息
Automation improves the quality of fruits through quick and accurate detection of pest and disease infections, thus contributing to the country's economic growth and productivity. Although humans can identify the ...
详细信息
Automation improves the quality of fruits through quick and accurate detection of pest and disease infections, thus contributing to the country's economic growth and productivity. Although humans can identify the fruit damage caused by pests and diseases, the methods used are inconsistent, time-consuming, and variable. The surface features of fruits typically observed by consumers who seek their health benefits affect their market value. The issue of pest and disease infections further deteriorates fruits' quality, becoming a mounting stressor on farmers since they reduce the potential revenue from fruit production, processing, and export. This article reviews various studies on detecting and classifying damages in fruits. Specifically, we review articles where state-of-the-art approaches under segmentation, imageprocessing, machine learning, and deep learning have proved effective in developing automated systems that address hurdles associated with manual methods of assessing damage using visual experiences. This survey reviews thirty-two journal and conference papers from the past thirteen years that were found electronically through Google Scholar, Scopus, ieee, ScienceDirect, and standard online searches. This survey further presents a detailed discussion of previous research done in the past while emphasizing their strengths and limitations as well as outlining potential future research topics. It also reveals that much as the use of automated detection and classification of fruit damage has yielded promising results in the horticulture industry, more research is still needed with systems required to fully automate the detection and classification processes, especially those that are mobile phone-based towards addressing occlusion challenges.
The existing cross-modal retrieval methods trend toward the conventional multi-modal alignment while ignoring the localization bias caused by visual hallucination, including color pollution and appearance-like occlusi...
详细信息
The optimization method of visual communication in imageprocessing technology is an important aspect of modern digital media. It involves using various technologies to improve the quality and clarity of images, video...
详细信息
暂无评论