Structural vibration-based gait recognition has emerged as a promising soft-biometric modality, particularly for privacy-sensitive monitoring and access control. Despite its potential, current research is largely limi...
详细信息
vision Transformers (ViTs) have shown promise in medical image semantic segmentation (MISS) by capturing long-range correlations. However, ViTs often struggle to model local spatial information effectively, which is e...
详细信息
Zero-shot anomaly detection (ZSAD) identifies anomalies without needing training samples from the target dataset, essential for scenarios with privacy concerns or limited data. vision-language models like CLIP show po...
详细信息
Medical image registration is essential for integrating information from diverse imaging modalities for clinical diagnosis and treatment planning. Despite significant advancements, achieving efficient and precise defo...
详细信息
Underwater imaging presents unique challenges compared to open-air photography, primarily due to diminished visibility and geometric distortions, impeding the development of underwater Computer vision (CV) and robotic...
详细信息
In this paper, we aim to tackle the challenging Black-Box Open-Set Domain Adaptation (BB-OSDA) task. BB-OSDA enables conducting Open-Set Domain Adaptation (OSDA) with solely a black-box source model, broadening the ap...
详细信息
Inspired by structured state space models and graph neural network modeling, we proposes a novel graph-aware reasoning (GAR) model to effectively solve the problem between memory utilization efficiency and reasoning n...
详细信息
To prevent image distortion, this paper explores methods for enhancing and optimizing graphic design images using 3D laser vision technology. The process involves collecting graphic design image data, mapping 3D laser...
详细信息
The proceedings contain 122 papers. The topics discussed include: DFrFT-ES model for emotion recognition based on fractional Fourier transform of EEG signals;research on traffic sign recognition under complex meteorol...
ISBN:
(纸本)9781510687615
The proceedings contain 122 papers. The topics discussed include: DFrFT-ES model for emotion recognition based on fractional Fourier transform of EEG signals;research on traffic sign recognition under complex meteorological conditions;diffusion-augmented learning for long-tail recognition;apple leaf scab recognition using CNN and transfer learning;container image management in cloud-edge environments: an image deletion method based on layer affinity;computer graphics and image processing techniques based on visual communication design;dynamic fusion and non-negative matrix factorization-based multi-view clustering method;convolutional recurrent neural network-based EEG signal classification in motor imagery;and sentiment classification of MOOC courses by merging local context focus and bi-directional gated recurrent unit.
Adapting pre-trained models to new tasks can exhibit varying effectiveness across datasets. Visual prompting, a state-of-the-art parameter-efficient transfer learning method, can significantly improve the performance ...
详细信息
暂无评论