Acoustic-based human gesture recognition (HGR) offers diverse applications due to the ubiquity of sensors and touch-free interaction. However, existing machine learning approaches require substantial training data, ma...
详细信息
Acoustic-based human gesture recognition (HGR) offers diverse applications due to the ubiquity of sensors and touch-free interaction. However, existing machine learning approaches require substantial training data, making the process time-consuming, costly, and labor-intensive. Recent studies have explored cross-modal methods to reduce the need for large training datasets in behavior recognition, but they typically rely on open-source datasets that closely align with the target domain, limiting flexibility and complicating data collection. In this paper, we propose ${\sf Img2Acoustic}$ , a novel cross-modal acoustic-based HGR approach that leverages models trained on open-source image datasets (i.e., EMNIST, Omniglot) to effectively recognize custom gestures detected via acoustic signals. Our model incorporates a task-aware attention layer (TAAL) and a task-aware local matching layer (TALML), enabling seamless transfer of knowledge from image datasets to acoustic gesture recognition. We implement ${\sf Img2Acoustic}$ on commercial devices and conduct comprehensive evaluations, demonstrating that our method not only delivers superior accuracy and robustness compared to existing approaches but also eliminates the need for extensive training data collection.
In this study, we propose BioVLF-T, a novel automatic radiology report generation framework built on a Bio-Vision Language Foundational Model (VLF) with a temporal framework. BioVLF-T enhances the contextual understan...
详细信息
Understanding the near boundary acoustic oscillation of microbubbles is critical for the effective design of ultrasonic biomedical devices and surface cleaning ***,this study investigates the three-dimensional microbu...
详细信息
Understanding the near boundary acoustic oscillation of microbubbles is critical for the effective design of ultrasonic biomedical devices and surface cleaning ***,this study investigates the three-dimensional microbubble oscillation between two curved rigid plates experiencing a planar acoustic field using boundary integral method(BIM).The numerical model is validated via comparison with the nonlinear oscillation of the bubble governed by the modified Rayleigh-Plesset equation and with the axisymmetric model for an acoustic microbubble in infinite fluid ***,the influence of the wave direction and horizontal standoff distance(h)on the bubble dynamics(including jet velocity,jet direction,centroid movement,total energy,and Kelvin impulse)were *** was concluded that the jet velocity,the maximum radius and the total energy of the bubble are not significantly influenced by the wave direction,while the jet direction and the high-pressure region depend strongly on *** importantly,it was found that the jet velocity and the high-pressure region around the jet in acoustic bubble are drastically larger than their counterparts in the gas bubble.
This paper introduces the first prompt-based methods for aspect-based sentiment analysis and sentiment classification in Czech. We employ the sequence-to-sequence models to solve the aspect-based tasks simultaneously ...
详细信息
Parkinson's disease (PD) is a progressive neurological disorder that significantly impacts patients' quality of life. Accurate and early detection of PD is crucial for effective management and treatment. A stu...
详细信息
Attendance systems have become more modern, and one of the biometric systems without physical contact is face recognition. However, many face-based attendance systems still carry out attendance individually and cannot...
详细信息
ISBN:
(数字)9798350376968
ISBN:
(纸本)9798350376975
Attendance systems have become more modern, and one of the biometric systems without physical contact is face recognition. However, many face-based attendance systems still carry out attendance individually and cannot detect multiple faces simultaneously. In addition, capturing facial data in real-time is still a challenge because the relatively large distance between the camera and the individual reduces the ability to recognize faces. The general solution is to use super-resolution to generate better-quality faces while maintaining the main facial recognition features. One technique still being researched is super-resolution generative adversarial networks (SRGAN). SRGAN can enlarge the resolution of captured images and maintain image quality sufficient for face recognition. The attendance system can be easily integrated into edge devices such as the Jetson Nano. This paper proposes automatic and effective attendance systems with the super-resolution technique to detect and recognize faces in low-resolution input. The experimental results show that using face data capture with a resolution of 40 × 40 pixels and a four-fold magnification results in a resolution of 160 × 160 pixels. Combining Face SRGAN with FaceNet architecture as the basis of face recognition can achieve an accuracy rate of 78.19% and an F1-Score of 81.13% with an average processing time of 1.61 seconds per frame on a PC and 14.55 seconds per frame on a Jetson Nano at an average of face recognition per frame of as many as up to 8 faces simultaneously.
To keep up with the dynamic nature of the modern classroom, teachers must have access to online learning platforms for professional development purposes. This investigation looks at how these platforms can be used to ...
详细信息
Emotion recognition using brain-computer interfaces (BCIs) is an emerging field with the ongoing challenge of developing robust and efficient classification methods. This study introduces an original approach, the mul...
详细信息
This paper presents a method to estimate power system inertia in real-time using Phasor Measurement Unit (PMU) data and the swing equation. Inertia is essential for maintaining power system stability during disturbanc...
详细信息
This research is an approach to intelligent vehicles with a LoRa communication system, LoRaWAN compatible for Long-Range and Outdoor Communication, but in this paper, we will test the ability of LoRa to handle autonom...
详细信息
暂无评论