Video-text retrieval (VTR) is an essential task in multimodal learning, aiming to bridge the semantic gap between visual and textual data. Effective video frame sampling plays a crucial role in improving retrieval per...
详细信息
Video-text retrieval (VTR) is an essential task in multimodal learning, aiming to bridge the semantic gap between visual and textual data. Effective video frame sampling plays a crucial role in improving retrieval performance, as it determines the quality of the visual content representation. Traditional sampling methods, such as uniform sampling and optical flow-based techniques, often fail to capture the full semantic range of videos, leading to redundancy and inefficiencies. In this work, we propose CLIP4Video-Sampling: Global Semantics-Guided Multi-Granularity Frame Sampling for Video-Text Retrieval, a global semantics-guided multi-granularity frame sampling strategy designed to optimize both computational efficiency and retrieval accuracy. By integrating multi-scale global and local temporal sampling and leveraging the CLIP (Contrastive Language-Image Pre-training) model’s powerful feature extraction capabilities, our method significantly outperforms existing approaches in both zero-shot and fine-tuned video-text retrieval tasks on popular datasets. CLIP4Video-Sampling reduces redundancy, ensures keyframe coverage, and serves as an adaptable pre-processing module for multimodal models.
Fault-tolerant recovery in Multicast communication is an important issue. An ideal multicast fault-tolerant approach can save network resources, reduce the delay and cost, realize quick recovery. In this paper, we pro...
详细信息
ISBN:
(纸本)9787900769428
Fault-tolerant recovery in Multicast communication is an important issue. An ideal multicast fault-tolerant approach can save network resources, reduce the delay and cost, realize quick recovery. In this paper, we propose a new multicast fault-tolerant approach-FH-TFTM based on proposed multicast fault-tolerant algorithms. It can go over problems faced in traditional approaches, for example node failure, unpredictable problems, time delay, network cost and loop avoidance. FH-TFTM can reduce consuming network resources, and carry out effectively setting up backup paths.
Traditional Chinese medicine (TCM) prescription is the most critical form of TCM treatment, and uncovering the complex nonlinear relationship between symptoms and TCM is of great significance for clinical practice and...
详细信息
Automated trust negotiation (ATN) is an approach that establishes mutual trust between strangers wishing to share resources or conduct business by gradually requesting and disclosing digitally signed credentials. When...
详细信息
The performance of transmit antenna selection is investigated. The exact closed-form expressions for the average BER and average SNR in MRT systems and EGT systems are proposed. The minimum number of increased transmi...
详细信息
Urban environments, or living cities, share the characteristic of a high degree of organized complexity. This complexity arises from the components of the urban fabric: streets, shops, offices, houses, pedestrian zone...
详细信息
Prescriptions play an essential role in the process of Traditional Chinese Medicine (TCM) diagnosis and treatment. Prescription generation is to generate a set of herbs to treat the symptoms of a patient by analyzing ...
详细信息
ISBN:
(纸本)9781665429825
Prescriptions play an essential role in the process of Traditional Chinese Medicine (TCM) diagnosis and treatment. Prescription generation is to generate a set of herbs to treat the symptoms of a patient by analyzing the relationship between symptoms and herbs. Although there have been a couple of studies to generate prescriptions, they have ignored the implicit relationship between the different symptoms of the patients. In addition, abundant semantic information and interpretability of the knowledge graph can help to portray the complicated relationships between the various modules of TCM. Therefore, this paper proposes a Knowledge-Aware neural Group representation learning model for Attentive Prescription Generation of Traditional Chinese Medicine (KGAPG), which regards the prescription generation task as a group recommendation problem. More specifically, multiple symptoms of a patient are considered as a symptom group and the complicated semantic information between symptoms and herbs can be captured by the knowledge graph. The syndrome information of multiple symptoms which is summarized by a group aggregation method based on the attention mechanism is applied to simulate the actual process of TCM diagnosis and treatment. The experiment results demonstrate that KGAPG is effective on a TCM prescription benchmark dataset, and its evaluation indicators of Precision, Recall and NDCG exceed other state-of-the-art methods.
This paper presents a rasterization rendering pipeline namely FreePipe. The system builds a bridge between the traditional graphics pipelines and the general purpose computing architecture CUDA by taking advantages of...
详细信息
A new construction method for polyphase sequences with two-valued periodic auto- and crosscorrelation functions is proposed. This method gives L families of polyphase sequences for each prime length L which is bigger ...
详细信息
暂无评论