Diffusion transformer (DiT) architecture catches much attention in image generation, which achieves better fidelity, performance, and diversity. However, most existing DiT-based image generation methods are global-awa...
详细信息
Neural network models for guitar amplifier emulation, while being effective, often demand high computational cost and lack interpretability. Drawing ideas from physical amplifier design, this paper aims to address the...
详细信息
Most existing text-to-image person retrieval methods usually assume that the training image-text pairs are perfectly aligned;however, the noisy correspondence(NC) issue (i.e., incorrect or unreliable alignment) exists...
详细信息
Learning-driven methods have revolutionized the field of blind deconvolution, with SelfDeblur as a pioneering method. It uses deep image priors to jointly estimate blur kernels and latent clear images, employing deep ...
详细信息
Survival prediction in PD-1 inhibitor patients has received extensive attention in recent years. Existing diffusion models generally focus blurring on key lesion regions, and the masks are weakly matched to CT images ...
详细信息
Camouflaged object detection (COD), which aims to segment objects that are highly similar to their background, is a valuable yet challenging task. Due to the interference of clutter and noise in the background, existi...
详细信息
Despite the significant advances of Convolutional neural networks (CNNs) and Transformers in image deraining, they either suffer from limited receptive fields or incur quadratic complexity, leading to an imbalance bet...
详细信息
To prevent image distortion, this paper explores methods for enhancing and optimizing graphic design images using 3D laser vision technology. The process involves collecting graphic design image data, mapping 3D laser...
详细信息
Convolutional neural networks are widely used in various segmentation tasks in medical images. However, they are challenged to learn global features adaptively due to the inherent locality of convolutional operations....
详细信息
Due to the absence or mismatch of semantic information, existing few-shot image generation methods suffer from unsatisfactory generation quality and diversity, which have minimal benefits as data augmentation for down...
详细信息
暂无评论