Convolutional neural network (CNN) based salient object detection (SOD) has achieved great development in recent years. However, in some challenging cases, i.e. small-scale salient object, low contrast salient object ...
详细信息
The prevalent solution for BioNER involves using representation learning techniques combined with sequence ***, such methods are inherently task-specific, demonstrate poor generalizability, and often require a dedicat...
详细信息
Scene text detection (STR) attracts much attention in computer vision and is widely used in real-time applications. Though many methods have been proposed for horizontal and oriented texts, STR frameworks for spotting...
详细信息
Current diffusion models for human image animation struggle to ensure identity (ID) consistency. This paper presents StableAnimator, the first end-to-end ID-preserving video diffusion framework, which synthesizes high...
详细信息
With the rapid progress of generation technology, it has become necessary to attribute the origin of fake images. Existing works on fake image attribution perform multi-class classification on several Generative Adver...
详细信息
Video object segmentation (VOS) aims to distinguish and track target objects in a video. Despite the excellent performance achieved by off-the-shell VOS models, existing VOS benchmarks mainly focus on short-term video...
详细信息
Existing deep video models are limited by specific tasks, fixed input-output spaces, and poor generalization capabilities, making it difficult to deploy them in real-world scenarios. In this paper, we present our visi...
With the assumption that a video dataset is multimodality annotated in which auditory and visual modalities both are labeled or class-relevant, current multimodal methods apply modality fusion or cross-modality attent...
详细信息
Anatomical landmark detection, a pivotal research area in medical image processing, holds immense value in surgical navigation, image registration, and related fields. Traditional machine learning methods struggle wit...
详细信息
Despite impressive advancements in diffusion-based video editing models in altering video attributes, there has been limited exploration into modifying motion information while preserving the original protagonist'...
详细信息
暂无评论