This paper presents a novel approach for dense scene text detection called DSSNet (Dense Script Spotter Network). The network leverages ResNet and FPN for feature extraction, employing multi-scale feature fusion and T...
详细信息
This article discusses the problem of shaft rotation control for continuous testing of industrial equipment using a radar sensor. The FMCW radar with a frequency of 77 Hz is used to irradiate a rotating shaft and...
详细信息
Image fusion combines images from multiple domains into one image, containing complementary information from source domains. Existing methods take pixel intensity, texture and high-level vision task information as the...
详细信息
vision-Language Models for remote sensing have shown promising uses thanks to their extensive pretraining. However, their conventional usage in zero-shot scene classification methods still involves dividing large imag...
详细信息
Deep learning for action recognition is an important technology for understanding videos. However, collecting video training dataset for deep learning model with low cost while maintaining enough diversity is challeng...
详细信息
Human-hand gesture recognition using millimetre wave radar is attractive in human-computer interfaces, industrial Internet of Things, and smart home. However, the existing CNN or RNN model is so complex and large that...
详细信息
This paper contains the way of making a portable acquisition system of a sEMG signal from the extensor digitorum muscle with real-time processing of this signal to generate hand grip force information. The system repr...
详细信息
Human Activity Recognition (HAR) such as fall detection has become increasingly critical due to the aging population, necessitating effective monitoring systems to prevent serious injuries and fatalities associated wi...
详细信息
Zero-shot Human-Object Interaction (HOI) detection aims to identify both seen and unseen HOI categories in an image. Most existing methods rely on semantic knowledge distilled from CLIP to find novel interactions but ...
详细信息
Recent multimodal foundation models are primarily trained on English or high resource European language data, which limits their applicability to other medium and low-resource languages, such as the Indian languages. ...
暂无评论