Transformer-based sequential recommendation is highly powerful as it can capture short-term and long-term sequential recommendation. It plays a crucial role in personalized recommendation systems, aiming to extract dy...
详细信息
Currently, most open vocabulary models rely on CLIP (Contrastive Language-Image Pre-training) to classify masked regions, but CLIP models still have many drawbacks. The traditional transformer model is context aware t...
详细信息
To fuse vocabulary features into the pre-training model is the mainstream data feature processing method for sequence labelling tasks. In general, the feature fusion methods that have been proposed at present are dire...
详细信息
Estimating human pose in complex multi-frame situations is a challenging task and has attracted intensive research by many researchers. Although 3D human pose estimation methods have achieved remarkable results in sce...
详细信息
Remote sensing object detection has important application value in fields such as environmental monitoring and resource detection and analysis. However, the current universal object detectors are not very effective in...
详细信息
Text-based person retrieval (TBPR) is a challenging topic in cross-modal retrieval tasks, aiming to query corresponding person images based on textual descriptions. This task is complicated by noisy correspondences be...
详细信息
Recently the analysis of remotely sensed images has played a vital role in various aspects of research. The current researches ignore the unique prior knowledge in remote sensing images and do not consider exploring t...
详细信息
With the prevalence of deep learning, people use multi-modality information for interpretation and reasoning. In this paper, a cross-modality encoder CMEEA (cross-modality encoder representation based on external atte...
详细信息
This paper addresses the limitations of the Contrastive Language-Image Pre-training (CLIP) model's image encoder and proposes a segmentation model WSSS-ECFE with enhanced CLIP feature extraction, aiming to improve...
详细信息
In this era of rapid development of information science and technology, data coding technology has been widely used in multimedia, computer, communication and other fields. Non-deterministic combined coding is further...
详细信息
暂无评论