Skeleton-based action recognition has always been an important research topic in computer vision. Most of the researchers in this field currently pay more attention to actions performed by a single person while there ...
详细信息
Skeleton-based action recognition has always been an important research topic in computer vision. Most of the researchers in this field currently pay more attention to actions performed by a single person while there is very little work dedicated to the identification of interactions between two people. However, the practical application of interaction recognition is actually more critical in our society considering that actions are often performed by multiple people. How to design an effective scheme to learn discriminative spatial and temporal representations for skeleton-based interaction recognition is still a challenging problem. Focusing on the characteristics of skeleton data for interactions, we first define the moving distance to distinguish the action status of the participants. Then some view-invariant relative features are proposed to fully represent the spatial and temporal relationship of the skeleton sequence. Further, a new coding method is proposed to obtain the novel relative feature representations. Finally, we design a three-stream CNN model to learn deep features for interaction recognition. We evaluate our method on SBU dataset, NTU RGB+D 60 dataset and NTU RGB+D 120 dataset. The experimental results also verify that our method is effective and exhibits great robustness compared with current state-of-the-art methods.
With the rapid development of multimedia technology, audio-visual learning has emerged as a promising research topic within the field of multimodal analysis. In this paper, we explore parameter-efficient transfer lear...
Micro-expressions(MEs) have emerged as a viable strategy for affective estimation due to their high reliability in emotion detection. In recent years, deep learning methods have been successfully applied to the field ...
详细信息
Trajectory classification algorithms are widely used in fields such as behavior recognition, anomaly detection and monitoring, and video content analysis. To overcome the issues of traditional trajectory classificatio...
详细信息
Direction of Arrival (DoA) estimation is a key technology in array signal processing. One-bit quantization is a popular method for reducing hardware costs in DoA estimation. However, one-bit quantization introduces si...
详细信息
Direction of Arrival (DOA) estimation is a crucial technology in array signal processing. One-bit quantization is commonly employed to reduce hardware costs in DOA estimation, yet it introduces significant quantizatio...
详细信息
Task-oriented dialogue systems (TOD) aim to help users complete specific tasks through multiple rounds of dialogue, in which Dialogue State Tracking (DST) is a key component. The training of DST models typically neces...
详细信息
In this paper, we propose a novel convolutional neural network (MDR-Net) for ultrasound image segmentation by exploiting multi-decision and deep refinement of the target. Our MDR-Net consists of two main parts, i.e., ...
详细信息
Dynamic facial expression recognition (DFER) in the wild is still hindered by data limitations, e.g., insufficient quantity and diversity of pose, occlusion and illumination, as well as the inherent ambiguity of facia...
详细信息
This paper proposes a metasurface-based left-handed circularly polarized (CP) sequential rotating metasurface-based (MTS) antenna array for C-band. The array comprises a cluster of 4 × 4 periodically aligned MTS ...
详细信息
暂无评论