This paper reports on the NTIRE 2022 challenge on perceptual image quality assessment (IQA), held in conjunction with the New Trends in image Restoration and Enhancement workshop (NTIRE) workshop at CVPR 2022. This ch...
详细信息
ISBN:
(数字)9781665487399
ISBN:
(纸本)9781665487399
This paper reports on the NTIRE 2022 challenge on perceptual image quality assessment (IQA), held in conjunction with the New Trends in image Restoration and Enhancement workshop (NTIRE) workshop at CVPR 2022. This challenge is held to address the emerging challenge of IQA by perceptual imageprocessing algorithms. The output images of these algorithms have completely different characteristics from traditional distortions and are included in the PIPAL dataset used in this challenge. This challenge is divided into two tracks, a full-reference IQA track similar to the previous NTIRE IQA challenge and a new track that focuses on the no-reference IQA methods. The challenge has 192 and 179 registered participants for two tracks. In the final testing stage, 7 and 8 participating teams submitted their models and fact sheets. Almost all of them have achieved better results than existing IQA methods, and the winning method can demonstrate state-of-the-art performance.
Existing image inpainting methods used traditional and deep learning methods to restore a large missing region in the damaged image. This often leads to color discrepancy and blurriness. Pre-processing of prior line d...
详细信息
ISBN:
(纸本)9798350396386
Existing image inpainting methods used traditional and deep learning methods to restore a large missing region in the damaged image. This often leads to color discrepancy and blurriness. Pre-processing of prior line detection by user assistance is usually employed to reduce the blurry of center region by segmenting the large region into more minor. However, it operates manually, which is time-consuming. This paper introduces a technique to generate two-line types: penetrator and interactor in constructing auxiliary lines as guidance. These lines assist structure propagation established automatically, while the remaining small regions are filled by texture propagation. Experiments on large regular masks demonstrate that our proposed approach generates higher-quality results than other methods.
image enhancement is a process to improve the visual standard of image so as to extract spatial features of image. Histogram Equalization is method by which image can be improved for better perception and interpretati...
详细信息
ISBN:
(纸本)9781665454025
image enhancement is a process to improve the visual standard of image so as to extract spatial features of image. Histogram Equalization is method by which image can be improved for better perception and interpretation by different automated imageprocessing system and human beings. Although, for low-contrast images (with lower dynamic range) traditional histogram equalization deteriorate quality of the output image by introducing washed-out form caused by too much brightness change. This paper presents histogram equalization based image enhancement method which is optimized using cuckoo search algorithm. Foremost purpose of the proposed method is to control over and under enhancement by calculating clip limits of the image histogram before performing histogram equalization. This method uses cuckoo search algorithm to automatically control the clip limit according to the image nature using improved fitness function. Effectiveness of the proposed method in comparison to other methods is analyzed using well known image quality metrics.
Reconstructing the damaged images with perspective views has an extensive range in the field of image inpainting. However, most existing methods generated inadequately realistic restored images. Accomplishing this pro...
详细信息
ISBN:
(纸本)9798350396386
Reconstructing the damaged images with perspective views has an extensive range in the field of image inpainting. However, most existing methods generated inadequately realistic restored images. Accomplishing this problem, we propose an edge-enhanced image generation model considering viewpoints. Our method applies edge map information to guide image generation based on the perspective views of an image using vanishing points detection. Texture synthesis will be presented as post-processing to complete the remaining missing regions. Experiment shows that our approach can generate perspective images with convincing details, such as indoor and outdoor facades.
In the last few years, deep-learning models are becoming crucial for numerous scientific and industrial applications. Due to the growth and complexity of deep neural networks, researchers have been investigating techn...
详细信息
Risk, suppression, and damage assessment are essential parts of forest fire management. The involved data is distributed and heterogeneous, concerning meteorology, topomorphology, and socio-economy. In addition, it ha...
Risk, suppression, and damage assessment are essential parts of forest fire management. The involved data is distributed and heterogeneous, concerning meteorology, topomorphology, and socio-economy. In addition, it has a multiscale spatiotemporal analysis aspect. These reinforce the need to use the appropriate methods and technologies to enhance decision-making. This research investigates the potential of adopting spatial data warehousing techniques to build a SOLAP-based DSS that supports the whole decisional chain. (i) source collection and integration; (ii) data repository design and population; (iii) SOLAP analytics; and finally, (iv) data visualization in numerous formats, including tabular, graphic, and dynamic maps.
This paper presents the design, development, and implementation of Kulla, a virtual container-centric construction model that mixes loosely coupled structures with a parallel programming model for building infrastruct...
详细信息
This paper presents the design, development, and implementation of Kulla, a virtual container-centric construction model that mixes loosely coupled structures with a parallel programming model for building infrastructure-agnostic distributed and parallel applications. In Kulla, applications, dependencies and environment settings, are mapped with construction units called Kulla-Blocks. A parallel programming model enables developers to couple those interoperable structures for creating constructive structures named Kulla-Bricks. In these structures, continuous dataflow and parallel patterns can be created without modifying the code of applications. methods such as Divide&Containerize (data parallelism), Pipe&Blocks (streaming), and Manager/Block (task parallelism) were developed to create Kulla-Bricks. Recursive combinations of Kulla instances can be grouped in deployment structures called Kulla-Boxes, which are encapsulated into VCs to create infrastructure-agnostic parallel and/or distributed applications. Deployment strategies were created for Kulla-Boxes to improve the IT resource profitability. To show the feasibility and flexibility of this model, solutions combining real-world applications were implemented by using Kulla instances to compose parallel and/or distributed system deployed on different IT infrastructures. An experimental evaluation based on use cases solving satellite and medical imageprocessing problems revealed the efficiency of Kulla model in comparison with some traditional state-of-the-art solutions. (C) 2020 Published by Elsevier Inc.
In computer vision deep learning (DL) tasks, most of the input image datasets are stored in the JPEG format. These JPEG datasets need to be decoded before DL tasks are performed on them. We observe two problems in the...
详细信息
ISBN:
(纸本)9781728190747
In computer vision deep learning (DL) tasks, most of the input image datasets are stored in the JPEG format. These JPEG datasets need to be decoded before DL tasks are performed on them. We observe two problems in the current JPEG decoding procedures for DL tasks: (1) the decoding of image entropy data in the decoder is performed sequentially, and this sequential decoding repeats with the DL iterations, which takes significant time;(2) Current parallel decoding methods under-utilize the massive hardware threads on GPUs. To reduce the image decoding time, we introduce a pre-scan mechanism to avoid the repeated image scanning in DL tasks. Our pre-scan generates boundary markers for entropy data so that the decoding can be performed in parallel. To cooperate with the existing dataset storage and caching, systems, we propose two modes of the pre-scan mechanism: a compatible mode and a fist mode. The compatible mode does not change the image file structure so pre-scanned files can be stored back to disk for subsequent DL tasks. In comparison, the fast mode crafts a JPEG image into a binary format suitable for parallel decoding, which can be processed directly on the GPU. Since the GPU has thousands of hardware threads, we propose a fine-grained parallel decoding method on the pre-scanned dataset. The fine-grained parallelism utilizes the GPU effectively, and achieves speedups of around 1.5x over existing GPU-assisted image decoding libraries on real-world DL tasks.
The proceedings contain 16 papers. The special focus in this conference is on Signal and imageprocessing. The topics include: Deep Convolutional Neural Network-Based Diagnosis of Invasive Ductal Carcinoma;speaker Ide...
ISBN:
(纸本)9789813369658
The proceedings contain 16 papers. The special focus in this conference is on Signal and imageprocessing. The topics include: Deep Convolutional Neural Network-Based Diagnosis of Invasive Ductal Carcinoma;speaker Identification in Spoken Language Mismatch Condition: An Experimental Study;Ultrasound image Classification Using ACGAN with Small Training Dataset;preface;Chaotic Ions Motion Optimization (CIMO) for Biological Sequences Local Alignment: COVID-19 as a Case Study;assessment of Eyeball Movement and Head Movement Detection Based on Reading;using Hadoop Ecosystem and Python to Explore Climate Change;a Brief Review of Intelligent Rule Extraction Techniques;the Effect of Different Feature Selection methods for Classification of Melanoma;intelligent Hybrid Technique to Secure Bluetooth Communications;parallel Algorithm to find Integer k where a given Well-distributed Graph is k-Metric Dimensional;a Fog-Based Retrieval of Real-Time Data for Health Applications;differential Evolution-Based Shot Boundary Detection Algorithm for Content-Based Video Retrieval;qutrit-Based Genetic Algorithm for Hyperspectral image Thresholding.
B-mode ultrasound tongue imaging is a non-invasive and real-time method for visualizing vocal tract deformation. However, accurately extracting the tongue’s surface contour remains a significant challenge due to the ...
详细信息
ISBN:
(数字)9798350368741
ISBN:
(纸本)9798350368758
B-mode ultrasound tongue imaging is a non-invasive and real-time method for visualizing vocal tract deformation. However, accurately extracting the tongue’s surface contour remains a significant challenge due to the low signal-to-noise ratio (SNR) and prevalent speckle noise in ultrasound images. Traditional supervised learning models often require large labeled datasets, which are labor-intensive to produce and susceptible to noise interference. To address these limitations, we present a novel Counterfactual Ultrasound Anti-Interference Self-Supervised Network (CUAI-SSN), which integrates self-supervised learning (SSL) with counterfactual data augmentation, progressively disentangles confounding factors, ensuring that the model generalizes well across varied ultrasound conditions. Our approach leverages causal reasoning to decouple noise from relevant features, enabling the model to learn robust representations that focus on essential tongue structures. By generating counterfactual image-label pairs, our method introduces alternative, noise-independent scenarios that enhance model training. Furthermore, we introduce attention mechanisms to enhance the network’s ability to capture fine-grained details even in noisy conditions. Extensive experiments on real ultrasound tongue images demonstrate that CUAI-SSN outperforms existing methods, setting a new benchmark for automated contour extraction in ultrasound tongue imaging. Our code is publicly available at https://***/inexhaustible419/CounterfactualultrasoundAI.
暂无评论