Gait analysis is one of the most common techniques for neuromusculoskeletal assessment;as a result of this, several devices and techniques have been developed to obtain more accurate tests. Some of the developments in...
详细信息
ISBN:
(纸本)9798331517519;9798331517526
Gait analysis is one of the most common techniques for neuromusculoskeletal assessment;as a result of this, several devices and techniques have been developed to obtain more accurate tests. Some of the developments in gait analysis are vision systems, force platforms, and biosignal acquisition devices, which often have high costs, are non-portable and need to be used in controlled environments;therefore, they are not applicable in different conditions in which the operation requirements can not be met. This work describes a portable system for gait analysis composed of modules for the acquisition of electromyography signals, inertial measurement sensors for the calculation of the angular range of the joints, instrumented insoles for the analysis of plantar load, and software for the acquisition, processing, and online visualization of the information. The gait analysis development was tested on a healthy subject using a treadmill with three different velocities. The obtained data was analyzed, and some features were compared with the reported using professional systems.
Text-image de-contextualization, which uses inconsistent image-text pairs, is an emerging form of misinformation and drawing increasing attention due to the great threat to information authenticity. With real content ...
详细信息
ISBN:
(纸本)9781665405409
Text-image de-contextualization, which uses inconsistent image-text pairs, is an emerging form of misinformation and drawing increasing attention due to the great threat to information authenticity. With real content but semantic mismatch in multiple modalities, the detection of de-contextualization is a challenging problem in media forensics. Inspired by the recent advances in vision-language models with powerful relationship learning between images and texts, we leverage the vision-language models to the media de-contextualization detection task. Two popular models, namely CLIP and VinVL, are evaluated and compared on several news and social media datasets to show their performance in detecting image-text inconsistency in de-contextualization. We also summarize interesting observations and shed lights to the use of vision-language models in de-contextualization detection.
This paper addresses few-shot semantic segmentation (FSS) guided by text, where we classify unseen novel classes using image and text references as in-context examples, without the need for training. We enhance the qu...
详细信息
Content-based video retrieval aims to retrieve near-duplicate entries from a database of a given query video. It plays an important role in combating video piracy. Robustness to video temporal dynamics is crucial for ...
详细信息
ISBN:
(纸本)9798350349405;9798350349399
Content-based video retrieval aims to retrieve near-duplicate entries from a database of a given query video. It plays an important role in combating video piracy. Robustness to video temporal dynamics is crucial for a representation model in video retrieval, as frames extracted from two copied videos are hardly temporally aligned in actual situations. However, current image retrieval datasets have difficulty in evaluating this robustness. To address this issue, we collect Similar Frame Dataset (SFD), which consists of 32,923 query-target pairs with 128,240 distraction images. The task of SFD is to retrieve the target frame from all items given a query frame. SFD is constructed by sampling frames from Kinetics-700 action classification dataset. An object detection model (Faster R-CNN) and a Multimodal Large Language Model (BLIP2) are used during sampling to select those valid frames. Besides, we propose Adjacent Frames Contrastive Learning (AFCL) framework. In AFCL, adjacent frames are sampled from unlabeled videos as positive pairs. An image representation model with robustness to changing frames can be trained under AFCL framework and achieve the state-of-the-art performance on SFD. The code will be released at https://***/Chuan-shanjia/Similar-Frame-Dataset.
During these past years, Socially Assistive robots (SARs) have been used to study the benefits of their uses with elderly people and people with dementia for healthcare purposes. Yet, almost all SARs have somewhat lim...
详细信息
ISBN:
(数字)9781665407311
ISBN:
(纸本)9781665407311
During these past years, Socially Assistive robots (SARs) have been used to study the benefits of their uses with elderly people and people with dementia for healthcare purposes. Yet, almost all SARs have somewhat limited perception capabilities or respond using simple pre-programmed behaviors and reactions, providing limited or repetitive interaction modalities. To overcome these limitations and take into consideration the strengths and weaknesses of SARs in healthcare settings, this paper presents T-Top, a tabletop robot designed with advanced audio and vision sensors, deep learning perceptual processing and telecommunication capabilities. Designed as a open hardware/software platform, the objective with T-Top is to provide an experimental platform that can implement richer interaction modalities and develop higher cognitive abilities from interacting with people.
作者:
Sahoo, Santosh KumarGodi, Rakesh Kumar
Department of Electronics and Instrumentation Engineering Hyderabad India
Department of Information Technology Hyderabad India
The research work dedicated for the object identification difficulty resolved by the techniques of principal component analysis (PCA) and linear discriminant analysis (LDA) along with robotic machine vision system. Th...
详细信息
Graph matching refers to establishing correspondence between two sets of point while keeping consistency between their edge sets. Recent works in learning-based graph matching have attempted to solve the problem eithe...
详细信息
ISBN:
(纸本)9798350349405;9798350349399
Graph matching refers to establishing correspondence between two sets of point while keeping consistency between their edge sets. Recent works in learning-based graph matching have attempted to solve the problem either by linear assignment, which transfers local structure information into node embedding at individual graphs, or by quadratic assignment through vertex classification over their association graph. However, the former embedding-based pipeline methods often neglect second-order edge similarity, leading to decreased accuracy;while the latter quadratic assignment solvers consume significant memory due to huge computation on the association graph. To address these issues, our key idea is to integrate a factorized embedding module to efficiently propogate information over the association graph. To this end, we propose a novel factorized embedding-based network, namely FEGM, which takes into account the second-order edge similarity, as well as a factorization model of GCN network, so that we extend the embedding-based pipeline for learning the Lawler's QAP while reducing memory consumption. Experimental results show that FEGM achieves a competitive matching accuracy while being superior in time and space efficiency.
In industrial production, the dumping of raw material packaging is mostly done manually, which not only affects production efficiency but also endangers the health of workers. This article proposes an improved algorit...
详细信息
The proceedings contain 10 papers. The special focus in this conference is on Design and Architectures for signal and Image processing. The topics include: LiFT: Lightweight, FPGA-Tailored 3D Object Detection Based on...
ISBN:
(纸本)9783031878961
The proceedings contain 10 papers. The special focus in this conference is on Design and Architectures for signal and Image processing. The topics include: LiFT: Lightweight, FPGA-Tailored 3D Object Detection Based on LiDAR Data;A Practical HW-Aware NAS Flow for AI vision Applications on Embedded Heterogeneous SoCs;Endoscopy Image Classification for Wireless Capsules with CNNs on Microcontroller-Based Platforms;joint Underwater Depth Estimation and Dehazing from a Single Image Using Attention U-Net;KD-AHOSVD: Neural Network Compression via Knowledge Distillation and Tensor Decomposition;Novel Scheduling and Shifter Networks for 5G LDPC Decoders;Comparison Between In-Core Hardware IDS, Off-Core Hardware IDS and Software IDS;comparative Study of Memory Optimization Techniques for Dataflow-Modeled Applications.
Diabetic foot is a complication of diabetes mellitus caused by prolonged hyperglycemia. It can lead to serious consequences such as ulceration, infection, and even amputation. Early detection and treatment are essenti...
详细信息
暂无评论