We introduce a novel stacked ensemble classifier for the unconstrained recognition of known and unknown gestural input data in nonverbal communication with a social robot. The architecture utilizes three separate CNNs...
详细信息
ISBN:
(纸本)9781665405409
We introduce a novel stacked ensemble classifier for the unconstrained recognition of known and unknown gestural input data in nonverbal communication with a social robot. The architecture utilizes three separate CNNs of different expected data input size and combines their output predictions to a unified estimate. Analysis shows that in comparison to a single CNN architecture, the combined estimate reduces prediction confidence values for unknown gestural movement segments, making the system able to identify unknown data input with higher certainty under both laboratory and real environment conditions. In a human-robot interaction experiment, we are able to improve unknown class detection accuracy by up to 40% under maintained or equal known class recognition performance, and hence considerably enhance the overall robustness of the recognition system.
To address the issues of low search efficiency, redundant nodes, and poor smoothness associated with the traditional A∗ algorithm in robotic path planning, this paper proposes a robot path planning algorithm that inte...
详细信息
Binocular vision-based target detection is one of the hot topics in computer vision, where the technique aims to detect and localize target objects in images. The technology has applications in fields such as autonomo...
详细信息
Brain Computer Interfaces establishes an interaction mode between the human brain and external devices, and has broad application prospects. With the development of neuroscience and artificial intelligence technology,...
详细信息
ISBN:
(纸本)9798350374520;9798350374513
Brain Computer Interfaces establishes an interaction mode between the human brain and external devices, and has broad application prospects. With the development of neuroscience and artificial intelligence technology, brain computer interfaces have rapidly evolved from a single perception stage to a multimodal cognition and complex interaction control stage. This article elaborates on the functional zoning of the brain and cerebellum and the latest research on brain signal classification. Afterwards, the latest research progress of deep learning based EEG signalprocessing technology was analyzed in depth from three key directions: brain network analysis, research on neurological diseases, cognitive analysis, and emotion recognition. This paper provides a detailed description of the typical structure and main functional unit design ideas of Brain Computer Interface chips, and analyzes the core and difficult technologies from the aspects of acquisition, 3D heterogeneous integration, low-power design, software and brain science research. Finally, based on typical Brain Computer Interface design and application cases, the future development trends of Brain Computer Interface chips and deep learning EEG technology are proposed.
The automation scenario in the current industrial as well as domestic applications has seen an exponential growth over the decade. robot plays an important role in industrial automation but in some cases, it needs som...
详细信息
Agriculture is an important sector that contributes significantly to Vietnam's GDP. The use of pesticides and herbicides, however, raises great concern due to their harm to the environment and human health. The go...
详细信息
The speaker diarization task answers the question "who is speaking at a given time?". It represents valuable information for scene analysis in a domain such as robotics. In this paper, we introduce a tempora...
详细信息
For the automatic installation requirements of spacecraft components, an automatic assembly scheme based on the cooperation of dual robots A large load robot is used to grab large weight components, and accurately tra...
详细信息
vision transformers (ViTs) have generated significant interest in the computer vision community because of their flexibility in exploiting contextual information, whether it is sharply confined local, or long range gl...
详细信息
ISBN:
(纸本)9781728198354
vision transformers (ViTs) have generated significant interest in the computer vision community because of their flexibility in exploiting contextual information, whether it is sharply confined local, or long range global. However, they are known to be data hungry and therefore often pretrained on large-scale datasets, e.g. JFT-300M or ImageNet. An ideal learning method would perform best regardless of the size of the dataset, a property lacked by current learning methods, with merely a few existing works studying ViTs with limited data. We propose Group Masked Model Learning (GMML), a self-supervised learning (SSL) method that is able to train ViTs and achieve state-of-the-art (SOTA) performance when pre-trained with limited data. The GMML uses the information conveyed by all concepts in the image. This is achieved by manipulating randomly groups of connected tokens, successively covering different meaningful parts of the image content, and then recovering the hidden information from the visible part of the concept. Unlike most of the existing SSL approaches, GMML does not require momentum encoder, nor relies on careful implementation details such as large batches and gradient stopping. Pretraining, finetuning, and evaluation codes are available under: https://***/GMML.
With the continuous development of robot technology in the field of industrial production, the introduction of biological electrical signals to control robots will not only provide a new way of human-computer interact...
详细信息
ISBN:
(纸本)9798331528911;9798331528928
With the continuous development of robot technology in the field of industrial production, the introduction of biological electrical signals to control robots will not only provide a new way of human-computer interaction operation, but also improve the security and stability of the system. Among them, SEMG has a high signal-to-noise ratio, and the robot control accuracy based on SEMG is higher compared to other bioelectrical signals. This paper studies the classification and identification method of sEMG signals. Classified and of signals by constructing a one-dimensional convolutional neural network model. By studying the one-dimensional convolutional neural network from the horizontal and vertical dimensions respectively, the experimental results show that the accuracy of the horizontal dimension recognition is higher, which can reach 93.28%.
暂无评论