Sign language plays an important role in information transmission and emotional communication between the hearing-impaired and the outside world. It is expressed through a rapid and complex combination of gestures, bo...
Sign language plays an important role in information transmission and emotional communication between the hearing-impaired and the outside world. It is expressed through a rapid and complex combination of gestures, body postures and facial expressions. In this paper, Google's MediaPipe framework is used to extract and process key point information of hands. Depth features are used to supplement the RBG diagram. A framework of skeleton sensing multi-modal recognition suitable for complex scenes is proposed. This can be used to deal with the problem of sign language recognition of multi-modal information in different scenes. However, most of the existing sign language recognition methods are traditional convolutional networks and LSTM. Although they have excellent achievements in image recognition and video recognition, their application in sign language recognition does not conform to the characteristics of sign language. In this paper, a Swin Transformer model is used to process both depth information and RGB information. Finally, statistical data fusion method was used to fuse the depth prediction results with RGB prediction results. Experimental analysis shows that this method performs well in sign language recognition in complex scenes compared with existing methods.
The inverse problem of ultrasound beamforming has recently emerged as a novel imaging method, attracting growing interest from the research community. Inverse problems in ultrasound beamforming hypothesize that the re...
详细信息
ISBN:
(数字)9798350387384
ISBN:
(纸本)9798350387391
The inverse problem of ultrasound beamforming has recently emerged as a novel imaging method, attracting growing interest from the research community. Inverse problems in ultrasound beamforming hypothesize that the received signal is linearly correlated with the beamformed image. Based on this assumption, a minimization problem comprises a fidelity term derived from the linear model and a regularization term that considers prior information about the underlying image. This study introduces the non-local structure tensor total variation as the regularization term to exploit the image’s local geometry and non-local self-similarity properties. Such regularization can also preserve the edge or orientation of an image. The simulation results demonstrate that using a single plane wave, the proposed method can achieve a spatial resolution of 0.34 mm full-width at half-maximum, a contrast-to-noise ratio of 16.82 dB, and a generalized contrast-to-noise ratio of 0.99.
As a new type of environmental protection technology that can treat sewage and generate electricity, microbial fuel cells (MFC) have broad research prospects. In recent years, MFC has made great breakthroughs, but its...
详细信息
Natural Language Inference (NLI) is a branch of Natural Language Processing (NLP) whose main task is to determine the relationship between two sentences. Such tasks essentially use pre-trained models to ensure accurac...
Natural Language Inference (NLI) is a branch of Natural Language Processing (NLP) whose main task is to determine the relationship between two sentences. Such tasks essentially use pre-trained models to ensure accuracy, and many applications are found in human-computer dialog and question-answering systems. However, in some cases where small devices are deployed offline, lightweight model implementations are often required to save computational resources. To address this problem and ensure more efficient inference capabilities, this paper describes how to improve the preexisting Transformer and proposes a Multi-Feature Fusion Transformer Network for NLI processing. The model integrates sequence features and introduces a priori information to enhance local features. It also achieves a comprehensive fusion of sequence information, local features, and non-local features, thus compensating for many potential shortcomings of Transformer. At the same time, it maintains the lightweight feature of the model and facilitates the implementation of downstream tasks. We verified the above point by experiments, our model has better inference performance compared to previous models used for the same task. The accuracy in the SNLI dataset reached 89.0%, while the matched and mismatched versions of MultiNLI reached 79.3% and 78.7%.
In this paper, a robust dynamic surface control method is designed for high-order strict feedback systems with nonlinear uncertainties. Based on the idea of dynamic surface control, a series of first-order low-pass fi...
In this paper, a robust dynamic surface control method is designed for high-order strict feedback systems with nonlinear uncertainties. Based on the idea of dynamic surface control, a series of first-order low-pass filters are introduced to obtain the derivative of the virtual control law, and the controller is designed directly for each higher-order subsystem without converting it into the first-order one, hence is more concise and efficient. It is proved by the Lyapunov stability theory that the tracking error can converge to a small domain around zero. The effectiveness of the proposed algorithm is also verified by a simulation with the flexible joint robotic arm system.
Intelligent transportation systems have become increasingly important for efficient traffic management and road safety. Vehicle classification is a fundamental task in these systems, enabling various applications such...
详细信息
This paper focuses on exploring the problem of achieving leader-follower consensus in uncertain Euler-Lagrange multi-agent systems, which are subjected to disturbance, and operate on switched digraph. To be more preci...
This paper focuses on exploring the problem of achieving leader-follower consensus in uncertain Euler-Lagrange multi-agent systems, which are subjected to disturbance, and operate on switched digraph. To be more precise, the system dynamics are characterized by uncertainties that can be linearly parameterized, and the disturbances that are considered are those originating from the leader system. A novel adaptive distributed control strategy is proposed to overcome the difficulties of suppressing disturbances and achieving consistency under the condition of joint connectivity of switched digraphs. By using Lyapunov stability theory and the generalized Barbalat's lemma, it is proved that the uncertain Euler-Lagrange multi-agent systems achieves state consensus asymptotically with the proposed distributed adaptive control protocol. The efficacy of the adaptive distributed control strategy proposed in the paper is validated by presenting a case study involving a system composed of four double-link manipulators.
This contribution presents a novel algorithm for real-time simulation of adaptive matrix- and pixel-headlights for motor vehicles. The simulation can generate the light distribution of a pair of pixel-headlamps with a...
详细信息
Aero-optic imaging is a kind of optical effect,which describes the imaging deviation on the imaging *** this paper,the effect of the change of Mach number of blunt aircraft on the aero-optic imaging deviation is *** i...
详细信息
Aero-optic imaging is a kind of optical effect,which describes the imaging deviation on the imaging *** this paper,the effect of the change of Mach number of blunt aircraft on the aero-optic imaging deviation is *** imaging deviations of Mach number 0.5—3 are analyzed *** results show that with the increase of Mach number,imaging deviation increases gradually,and the increase rate is gradually *** deviation slope decreases gradually with the increase of Mach number,and gradually tends to be zero,suggesting that imaging deviation is not sensitive to the change of the larger Mach *** other words,the Mach number of smaller changes can lead to larger imaging *** the Mach number of the aircraft increases,the slope of the imaging offset tends to be closer and closer to *** the Mach number of the aircraft increases to a certain extent,the change of the imaging offset will not have much ***,in order to reduce the impact of flight speed on imaging migration,the aircraft should fly at a higher Mach number.
The advent of Beyond 5G (B5G) and the anticipated arrival of 6G have spurred a remarkable impact on various aspects of human life. Next-generation Human Activity Recognition (HAR) systems are poised to advance healthc...
详细信息
ISBN:
(数字)9798350375503
ISBN:
(纸本)9798350375510
The advent of Beyond 5G (B5G) and the anticipated arrival of 6G have spurred a remarkable impact on various aspects of human life. Next-generation Human Activity Recognition (HAR) systems are poised to advance healthcare, create smart environments, and enhance overall well-being. The imperative for next-gen HAR systems lies in their capability to be intelligent, privacy-preserving, and deeply accurate. These systems, leveraging the cutting-edge capabilities of B5G and 6G, such as Re-configurable Intelligent Surface (RIS), aim to revolutionize the monitoring process and intelligently discern various human activities. Hence, this paper introduces BSgActiv, a smart RIS-enhanced HAR system. BSgActiv utilizes fractional wavelet transform to effectively highlight time and frequency features of activities from the measured channel state information (CSI) reflected from RIS. Afterward, these features are utilized to train a recurrent neural network that records the temporal characteristics of the input, hence promoting activity recognition. Moreover, BSgActiv integrates diverse modules that enhance the deep model's overall observation and ability to withstand and perform well in the presence of noise. BSgActiv evaluations contain two different realistic scenarios, including both non-line-of-sight and multi-floor setups, proofing its efficacy. Particularly, BSgActiv overcomes benchmark techniques and delivers an activity recognition accuracy of 89.7%.
暂无评论