In one-stage methods for video moment retrieval,the common representations indirectly supervised by boundary prediction fail to fully preserve the inherent characteristic of the video and query,which limits the retrie...
详细信息
ISBN:
(数字)9789887581536
ISBN:
(纸本)9781665482561
In one-stage methods for video moment retrieval,the common representations indirectly supervised by boundary prediction fail to fully preserve the inherent characteristic of the video and query,which limits the retrieval *** solve this problem,an Adversarial Video Moment Retrieval(AVMR) algorithm is proposed to learn the common representations with modality invariance and cross-modal *** is implemented through the process of adversarial learning between a feature projector and a modality *** feature projector tries to generate a modality-invariant common representation and to confuse the modality *** modality classifier tries to discriminate between different modalities based on the generated representation by the feature *** triplet constraints are further imposed on the feature projector to preserve the underlying cross-modal semantic structure of *** experimental results show that AVMR surpasses the baseline Attentive Cross-modal Relevance Matching(ACRM) by 1.10% and 1.73% in the "mIoU" metric on two public datasets Charades-STA and TACoS,respectively.
The rise of Artificial Intelligence for Science (AI4S) has highlighted the importance and urgency of ensuring open-ness, fairness, impartiality, diversity, and sustainability in scientific systems. Existing scientific...
详细信息
This paper focuses on the design of the non-fragile H∞ filtering of fuzzy discrete-time systems with Markovian jump and data loss. The system is represented by Takagi and Sugeno (T-S) fuzzy model. The imperfect infor...
详细信息
Dear Editor,This letter is concerned with visual perception closely related to heterogeneous *** the huge challenge brought by different image modalities,we propose a visual perception framework based on heterogeneous...
详细信息
Dear Editor,This letter is concerned with visual perception closely related to heterogeneous *** the huge challenge brought by different image modalities,we propose a visual perception framework based on heterogeneous image knowledge,i.e.,the domain knowledge associated with specific vision tasks,to better address the corresponding visual perception problems.
作者:
Zhang, ChunjieZheng, XiaolongBeijing Jiaotong University
Institute of Information Science Beijing100044 China Beijing Jiaotong University
Beijing Key Laboratory of Advanced Information Science and Network Technology Beijing100044 China University of Chinese Academy of Sciences
State Key Laboratory of Multimodal Artificial Intelligence Systems The State of Key Laboratory of Management and Control for Complex System Institute of Automation Chinese Academy of Sciences School of Artificial Intelligence Beijing100190 China
Most image classification methods are designed to either boost the classification accuracies with abundant supervision, or cope with the shortage of supervision information. This is often achieved by using the visual ...
详细信息
In this paper, we propose a safety-critical controller based on time-varying control barrier functions (CBFs) for a robot with an unicycle model in the continuous-time domain to achieve navigation and dynamic collisio...
详细信息
Power systems are essential to national security, economic prosperity, public health, and safety. However, as the frequency of extreme events and man-made attacks has increased dramatically in recent years, making res...
详细信息
LiDAR based place recognition is popular for loop closure detection and re-localization. In recent years, deep learning brings improvements to place recognition by learnable feature extraction. However, these methods ...
LiDAR based place recognition is popular for loop closure detection and re-localization. In recent years, deep learning brings improvements to place recognition by learnable feature extraction. However, these methods degenerate when the robot re-visits previous places with a large perspective difference. To address the challenge, we propose DeepRING to learn the roto-translation invariant representation from LiDAR scan, so that robot visiting the same place with a different perspective can have similar representations. There are two keys in DeepRING: the feature is extracted from sinogram, and the feature is aggregated by magnitude spectrum. The two steps keep the final representation with both discrimination and roto-translation invariance. Moreover, we state place recognition as a one-shot learning problem with each place being a class, leveraging relation learning to build representation similarity. Substantial experiments are carried out on public datasets, validating the effectiveness of each proposed component, and showing that DeepRING outperforms the comparative methods, especially in dataset level generalization.
An accurate and straightforward symplectic method is presented for the fracture analysis of fractional two-dimensional(2D)viscoelastic *** fractional Kelvin-Zener constitutive model is used to describe the time-depend...
详细信息
An accurate and straightforward symplectic method is presented for the fracture analysis of fractional two-dimensional(2D)viscoelastic *** fractional Kelvin-Zener constitutive model is used to describe the time-dependent behavior of viscoelastic *** the framework of symplectic elasticity,the governing equations in the Hamiltonian form for the frequency domain(s-domain)can be directly and rigorously *** the s-domain,the analytical solutions of the displacement and stress fields are constructed by superposing the symplectic eigensolutions without any trial function,and the explicit expressions of the intensity factors and J-integral are derived *** studies are provided to validate the accuracy and effectiveness of the present solutions.A detailed analysis is made to reveal the effects of viscoelastic parameters and applied loads on the intensity factors and J-integral.
Before launching a spacecraft, it is necessary to undergo micro low gravity simulations on the ground to test its reliability. The lifting method is not limited by movement time and space, and can simulate long-term l...
详细信息
暂无评论