Recently, text-to-image models based on diffusion have achieved remarkable success in generating high-quality images. However, the challenge of personalized, controllable generation of instances within these images re...
详细信息
Transformers have demonstrated outstanding performance in learned image compression (LIC) due to their high capacity for modeling complex dependencies. However, existing methods employ window-based attention mechanism...
详细信息
One mainstream of image anomaly detection is based on reconstruction. Such methods still struggle with diverse anomalies, such as near-in-distribution or deformed types. To address the challenge, we propose a Discrimi...
详细信息
We address the problem of object placement with user instructions using LLM and diffusion model. Traditional methods struggle to find a suitable location for filling the object with a semantically reasonable size. In ...
详细信息
This study proposes an innovative algorithm based on DCNN and multi-channel image fusion, aiming to improve the quality and efficiency of virtual scene image generation. The algorithm extracts depth information and te...
详细信息
The rapid growth in digital image sharing, driven by advancements in internet and communication technologies, has raised concerns about image integrity, especially in sensitive fields like healthcare. This paper prese...
详细信息
The conditional diffusion models have made significant progress in image synthesis, leveraging human annotations such as class labels or text descriptions to guide the generative process. However, different from image...
详细信息
Multimodal image fusion aims to merge features from different modalities to create a comprehensively representative image. However, existing medical image fusion methods often struggle to handle noise generated during...
详细信息
image clustering is a challenging task in computer vision, with performance heavily dependent on the quality of feature representations due to the inherent complexity of images. However, current image clustering metho...
详细信息
Lensless imaging systems eliminate the need for lenses by employing an encoding element to multiplex incident light signals, which are then captured directly onto a bare camera sensor. They present a promising alterna...
详细信息
暂无评论