The paper considers a mathematical model for managing a scientific digital ecosystem exemplified by the agriculture case;the model enables assessing the impact science exerts on the regions' socioeconomic position...
详细信息
When a Pre-trained Language Model (PLM) is adopted in video grounding task, it usually acts as a text encoder without having its knowledge fully utilized. Also, there exists an inconsistency problem between the pre-tr...
When a Pre-trained Language Model (PLM) is adopted in video grounding task, it usually acts as a text encoder without having its knowledge fully utilized. Also, there exists an inconsistency problem between the pre-training and downstream objectives. To solve the issues, we propose a new paradigm, named Span-based Prompt Tuning (SPTNet). It can convert the video grounding task into a cloze form. Specifically, a query is first changed into a form with mask token by a template, then the video and the query embeddings are integrated through a cross-modal transformer. The start and end points of the query matching time span are predicted with the embedding of the mask token. Experimental results on two public benchmarks ActivityNet Captions and Charades-STA show that our SPTNet achieves surpassing performance compared with state-of-the-art methods.
WiFi-based indoor localization is essential for asset tracking, healthcare monitoring, and smart buildings. However, existing systems face challenges such as data variability, environmental noise, and difficulty detec...
详细信息
The maximal guaranteed result in a hierarchical game with an undetermined factor is found in the class of strategies with feedback. The stability of the problem under consideration concerning perturbations of the payo...
The maximal guaranteed result in a hierarchical game with an undetermined factor is found in the class of strategies with feedback. The stability of the problem under consideration concerning perturbations of the payoff function of the lower-level player is studied. Regularizing estimates are obtained.
Temporal sentence grounding aims to detect the target segment most related to a given query in an untrimmed video. To alleviate the expensive annotation cost for temporal labels, researchers paid more attention to wea...
Temporal sentence grounding aims to detect the target segment most related to a given query in an untrimmed video. To alleviate the expensive annotation cost for temporal labels, researchers paid more attention to weakly supervised setting. Prior studies neglected the utilization of video representation reconstruction, which led to an unbalanced alignment learning. Moreover, they used different strategies to generate proposals which ignored the temporal structure in a query. In this paper, we propose a novel Conditional Video-Text Reconstruction Network (CVTRN). It supports conditional reconstruction of video and text representation. Specifically, video and text features are fused to compute semantic alignment, which is the condition of reconstruction. A new mask strategy for mask conditioned sentence reconstruction is also devised. This strategy focuses more on boundary regions than the widely used Gaussian mask in previous methods. Experimental results on two public benchmark datasets show that our CVTRN outperforms the state-of-the-art methods.
Inspired by the success of volumetric 3D pose estimation, some recent human mesh estimators propose to estimate 3D skeletons as intermediate representations, from which, the dense 3D meshes are regressed by exploiting...
Inspired by the success of volumetric 3D pose estimation, some recent human mesh estimators propose to estimate 3D skeletons as intermediate representations, from which, the dense 3D meshes are regressed by exploiting the mesh topology. However, body shape information is lost in extracting skeletons, leading to mediocre performance. The advanced motion capture systems solve the problem by placing dense physical markers on the body surface, which allows to extract realistic meshes from their non-rigid motions. However, they cannot be applied to wild images without markers. In this work, we present an intermediate representation, named virtual markers, which learns 64 landmark keypoints on the body surface based on the large-scale mocap data in a generative style, mimicking the effects of physical markers. The virtual markers can be accurately detected from wild images and can reconstruct the intact meshes with realistic shapes by simple interpolation. Our approach outperforms the state-of-the-art methods on three datasets. In particular, it surpasses the existing methods by a notable margin on the SURREAL dataset, which has diverse body shapes. Code is available at https://***/ShirleyMaxx/VirtualMarker
Secure sum protocol is a significant secure multiparty computation protocol and it has various applications in privacy-preserving distributed multiparty computation. However, most existing secure sum protocols rarely ...
详细信息
Secure sum protocol is a significant secure multiparty computation protocol and it has various applications in privacy-preserving distributed multiparty computation. However, most existing secure sum protocols rarely considered how to resist underlying collusion which is a significant practical problem. Urabe et al. proposed a collusion-resistant secure sum protocol, but too much cost of communication and computation results in its low performance efficiency. In this paper, we propose security definitions to measure secure multiparty computation protocol's capability of resisting potential collusion. Then, we precisely analyze several previous secure sum protocols' capability of resisting collusion. In addition, considering realistic requirement to resist collusion and performance efficiency needs, we present a novel collusion-resisting secure sum protocol. Theoretical analysis and experimental results confirm that our secure sum protocol is efficient and has strong capability of resisting potential collusion such that it is much superior to previous ones. The communication overheads and computation complexity of our scheme both are linearity of the number of participants. Besides, our protocol's capability of resisting collusion is adjustable according to different security needs.
We construct a polynomial-time classical algorithm that samples from the output distribution of low-depth noisy Clifford circuits with any product-state inputs and final single-qubit measurements in any basis. This cl...
详细信息
In today’s digital world, Generative Artificial Intelligence (GenAI) such as Large Language Models (LLMs) is becoming increasingly prevalent, extending its reach across diverse applications. This surge in adoption ha...
详细信息
In the world of big data,it’s quite a task to organize different files based on their *** with heterogeneous data and keeping a record of every single file stored in any folder is one of the biggest problems encounte...
详细信息
In the world of big data,it’s quite a task to organize different files based on their *** with heterogeneous data and keeping a record of every single file stored in any folder is one of the biggest problems encountered by almost every computer *** of file management related tasks will be solved if the files on any operating system are somehow categorized according to their ***,the browsing process can be performed quickly and *** research aims to design a system to automatically organize files based on their similarities in terms of *** proposed methodology is based on a novel strategy that employs the charactaristics of both supervised and unsupervised machine learning approaches for learning categories of digital files stored on any computer *** results demonstrate that the proposed architecture can effectively and efficiently address the file organization challenges using real-world user *** results suggest that the proposed system has great potential to automatically categorize almost all of the user files based on their *** proposed system is completely automated and does not require any human effort in managing the files and the task of file organization become more efficient as the number of files grows.
暂无评论