Edge computing is an emerging promising computing paradigm that brings computation and storage resources to the network edge, significantly reducing service latency. In this paper, we aim to divide the task into sever...
详细信息
Self-supervised learning (SSL) has drawn increasing attention in histopathological image analysis in recent years. Compared to contrastive learning which is troubled with the false negative problem, i.e., semantically...
详细信息
A miniaturized filtering power divider with wide stopband based on quarter-mode substrate integrated waveguide (QMSIW) is proposed. The physical size of the QMSIW is reduced by 3/4 comparing with the traditional SIW s...
详细信息
Compared to the linear MIMO detectors, the Belief Propagation (BP) detector has shown greater capabilities in achieving near optimal performance and better nature to iteratively cooperate with channel decoders. Aiming...
详细信息
Traditional multislice iterative phase retrieval (MIPR) from snapshot two-dimensional measurements suffers from the two limitations of pre-defined support and iterative stagnation. To eliminate the requirements for pr...
详细信息
Non-autoregressive translation (NAT) models are typically trained with the cross-entropy loss, which forces the model outputs to be aligned verbatim with the target sentence and will highly penalize small shifts in wo...
详细信息
Spatiotemporal action detection relies on the learning of video spatial and temporal information. The current state-of-the-art convolutional neural network-based action detectors have achieved remarkable results using...
详细信息
ISBN:
(纸本)9781665464697
Spatiotemporal action detection relies on the learning of video spatial and temporal information. The current state-of-the-art convolutional neural network-based action detectors have achieved remarkable results using 2D CNN or 3D CNN architectures. However, due to the complexity of the network structure and spatiotemporal information perception, these methods are usually used in a non-real-time, offline manner. The main challenge of spatiotemporal action detection is to design an effective detection network architecture and effectively perceive the fused spatiotemporal features. Aiming at the above problems, our paper proposes a real-time action detection method based on multi-scale spatiotemporal feature. Aiming at the problem that only 2D or 3D backbone network cannot effectively model spatiotemporal features, we extract spatiotemporal features by multi-branch feature extraction networks respectively. For the lack of descriptiveness of single-scale spatiotemporal features, a multi-scale spatiotemporal feature-aware attention network is proposed to learn long-term temporal dependencies and spatial context information. And the fusion between temporal and spatial features is guided by fusion attention to highlight more discriminative spatiotemporal feature representations. The proposed method achieves 82.59% and 78.30% accuracy on two spatiotemporal action datasets UCF101-24 and JHMDB-21, respectively and reaching 73 frames/s.
Distributed document representation is one of the basic problems in natural language processing. Currently distributed document representation methods mainly consider the context information of words or sentences. The...
详细信息
Diffusion models have demonstrated promising results in text-to-audio generation tasks. However, their practical usability is hindered by slow sampling speeds, limiting their applicability in high-throughput scenarios...
详细信息
暂无评论