Robust real-world image enhancement from multi-exposure low dynamic range (LDR) images is a challenging task due to the unexpected inconsistency among the input images, such as the large motion or various exposures. I...
详细信息
ISBN:
(纸本)9781665493468
Robust real-world image enhancement from multi-exposure low dynamic range (LDR) images is a challenging task due to the unexpected inconsistency among the input images, such as the large motion or various exposures. In this paper, we propose a novel end-to-end image enhancement network to solve this problem. After extracting contextual information from the LDR images, we design a novel matching volume to align them by considering the motion and exposure differences among the input images. A stacked hourglass with dilated convolution is further utilized to aggregate the matched feature maps to the final enhanced image. In addition, we design a weakly-supervised pairwise loss function to evaluate the color consistency in the enhanced image, which further boosts the performance. We show the effectiveness of our methods on high dynamic ranging imaging (HDR) and End-to-End image signal processing (E2E-ISP) tasks. Experimental results demonstrate that our model achieves state-of-the-art enhancement performance.
In the pneumatic industry, quality control is an essential step in assessing tire compliance. Artificial neural networks are increasingly used to accomplish this task. Their training requires a large number of images ...
详细信息
ISBN:
(纸本)9781728198354
In the pneumatic industry, quality control is an essential step in assessing tire compliance. Artificial neural networks are increasingly used to accomplish this task. Their training requires a large number of images of the controlled products. However, at the launch production of a new tire, the lack of images causes a performance loss for the network. To solve this problem, we propose to translate perfect tires computer-based images into ad hoc manufacturing context-realistic ones as pre-manufacturing step to improve robustness and ensure production quality. The challenging work is to extract features in real images and apply them to computer-based images while maintaining the original geometry. In the paper, we propose Prefab-GEN, a novel architecture based on Cycle-GAN. In the generator part, an Inception U-Net architecture is developed to enforce geometrical structure conversion and extract more detailed features. The qualitative and quantitative evaluation on tire dataset shows improvements compared with state-of-art.
image restoration aims to reconstruct a sharp image from its degraded counterpart, which plays an important role in many fields. Recently, Transformer models have achieved promising performance on various image restor...
ISBN:
(纸本)9798350307184
image restoration aims to reconstruct a sharp image from its degraded counterpart, which plays an important role in many fields. Recently, Transformer models have achieved promising performance on various image restoration tasks. However, their quadratic complexity remains an intractable issue for practical applications. The aim of this study is to develop an efficient and effective framework for image restoration. Inspired by the fact that different regions in a corrupted image always undergo degradations in various degrees, we propose to focus more on the important areas for reconstruction. To this end, we introduce a dual-domain selection mechanism to emphasize crucial information for restoration, such as edge signals and hard regions. In addition, we split high-resolution features to insert multiscale receptive fields into the network, which improves both efficiency and performance. Finally, the proposed network, dubbed FocalNet, is built by incorporating these designs into a U-shaped backbone. Extensive experiments demonstrate that our model achieves state-of-the-art performance on ten datasets for three tasks, including single-image defocus deblurring, image dehazing, and image desnowing. Our code is available at https://***/c-yn/FocalNet.
Deep learning has revolutionized imageprocessing and reached state-of-the-art performance in a variety of medical image segmentation applications. Among them, Generative Adversarial Networks (GANs) have carved out a ...
详细信息
With the development of information technology, patents or long sentences of science and technology with huge vocabulary and complex structure have widely appeared in people's daily work. This kind of long sentenc...
With the development of information technology, patents or long sentences of science and technology with huge vocabulary and complex structure have widely appeared in people's daily work. This kind of long sentence often contains several clauses and non-predicate verb phrases, and these clauses and phrases often restrict and depend on each other, thus forming a complex language phenomenon in which there are phrases in clauses and clauses in phrases. This kind of long sentence plays a great role in the logic and rigor of English itself, but it brings considerable difficulties to machine translation. In order to solve this problem, this paper proposes a multilingual and general algorithm for complex long sentence translation based on multi-strategy analysis. The algorithm uses a combination of case-based pattern matching and rule analysis to comprehensively utilize a variety of relevant language features in the source language sentences, including grammatical semantic features, sentence length, punctuation marks Functional words and contextual conditions are used to simplify the segmentation of complex long sentences and compound the translation.
Accurately determining salient regions of an image is challenging when labeled data is scarce. DINO-based self-supervised approaches have recently leveraged meaningful image semantics captured by patch-wise features f...
ISBN:
(纸本)9798350307184
Accurately determining salient regions of an image is challenging when labeled data is scarce. DINO-based self-supervised approaches have recently leveraged meaningful image semantics captured by patch-wise features for locating foreground objects. Recent methods have also incorporated intuitive priors and demonstrated value in unsupervised methods for object partitioning. In this paper, we propose SEMPART, which jointly infers coarse and fine bi-partitions over an image's DINO-based semantic graph. Furthermore, SEMPART preserves fine boundary details using graph-driven regularization and successfully distills the coarse mask semantics into the fine mask. Our salient object detection and single object localization findings suggest that SEMPART produces high-quality masks rapidly without additional post-processing and benefits from co-optimizing the coarse and fine branches.
Cellular microscopy is enhanced by computational paradigms such as imageprocessing, computer vision, and machine learning. image segmentation is vital for quantifying cell images, enabling tracking and subsequent app...
详细信息
Few-shot learning serves as a viable solution for addressing data scarcity, thus exhibiting significant potential in the domain of medical image segmentation. In this work, we propose a simple and efficient framework ...
详细信息
ISBN:
(纸本)9781728198354
Few-shot learning serves as a viable solution for addressing data scarcity, thus exhibiting significant potential in the domain of medical image segmentation. In this work, we propose a simple and efficient framework for few-shot medical image segmentation, termed SRPNet, which leverages self-reinforcement between foreground and background. Notably, without the need for prior knowledge, the model autonomously adapts the segmentation effect of both foreground and background, thereby enhancing the segmentation of previously unseen classes. Experimental evaluations conducted on CT and MRI datasets demonstrate the superior performance of the proposed method compared to other state-of-the-art techniques. Code is available at https://***/q362096112/SRPNet.
This paper proposes a CNN model based on hard-assigned coding processing of image features. By clearly describing the relationship between the input image sample features participating in the learning dictionary train...
详细信息
Decomposing a single mixed image into individual image layers is the common crux of a classical category of tasks in image restoration. Several unified frameworks have been proposed that can handle different types of ...
ISBN:
(纸本)9798350307184
Decomposing a single mixed image into individual image layers is the common crux of a classical category of tasks in image restoration. Several unified frameworks have been proposed that can handle different types of degradation in superimposed image decomposition. However, there are always undesired structural distortions in the separated images when dealing with complicated degradation patterns. In this paper, we propose a unified framework for superimposed image decomposition that can cope with intricate degradation patterns adaptively. Considering the different mixing patterns between the layers, we introduce a degeneration representation in the latent space to mine the intrinsic relationship between the superimposed image and the degeneration pattern. Moreover, by extracting structure-guided knowledge from the superimposed image, we further propose structural guidance refinement to avoid confusing content caused by structure distortion. Extensive experiments have demonstrated that our method remarkably outperforms other popular image separation frameworks. The method also achieves competitive results on related applications including image deraining, image reflection removal, and image shadow removal, which validates the generalization of the framework.
暂无评论