Integrating deep learning in speech separation has revolutionized audio signal processing, impacting fields like speech recognition, audio-visual content creation, telecommunication, hearing aid technologies, etc. In ...
详细信息
Considering the varying advantages, disadvantages, and implementation difficulties of current indoor positioning algorithms, this paper conducts a comparative analysis of common UWB ranging methods. The Two-Way Rangin...
详细信息
Accurate self-localization of unmanned aerial systems (UAS) is needed to reduce their dependency on global navi-gation satellite systems (GNSS). Image retrieval techniques comparing aerial images with a reference data...
详细信息
With the development of autonomous driving technology, vehicles integrate camera, lidar, and radar modules for environment perception. In mass-produced models, cameras and lidars mostly serve as primary sensors, while...
In an increasingly globalized economy, the communication of corporate image plays a pivotal role in shaping perceptions and fostering relationships with international stakeholders. This study explores the design and i...
详细信息
This paper proposes a novel approach to video summarisation by introducing a feature variance model that enhances video frame extraction. In typical video record aggregation systems, a variety of image processing task...
详细信息
With the further research of deep learning, power companies have gradually eliminated the prevention and control by manual inspection, and have adopted deep learning to identify the safety hazards of power equipment, ...
详细信息
The proceedings contain 122 papers. The topics discussed include: DFrFT-ES model for emotion recognition based on fractional Fourier transform of EEG signals;research on traffic sign recognition under complex meteorol...
ISBN:
(纸本)9781510687615
The proceedings contain 122 papers. The topics discussed include: DFrFT-ES model for emotion recognition based on fractional Fourier transform of EEG signals;research on traffic sign recognition under complex meteorological conditions;diffusion-augmented learning for long-tail recognition;apple leaf scab recognition using CNN and transfer learning;container image management in cloud-edge environments: an image deletion method based on layer affinity;computer graphics and image processing techniques based on visualcommunication design;dynamic fusion and non-negative matrix factorization-based multi-view clustering method;convolutional recurrent neural network-based EEG signal classification in motor imagery;and sentiment classification of MOOC courses by merging local context focus and bi-directional gated recurrent unit.
We attempted to relate EEG brain activities evoked by naturalistic audio-visual video stimuli to the outputs of an audio-processing Transformer induced by audio inputs extracted from the same stimuli. We found a good ...
详细信息
ISBN:
(数字)9798331507022
ISBN:
(纸本)9798331507039
We attempted to relate EEG brain activities evoked by naturalistic audio-visual video stimuli to the outputs of an audio-processing Transformer induced by audio inputs extracted from the same stimuli. We found a good correspondence, especially in low-frequency brain activity. This is complementary to a previous study that showed good correspondence between high-frequency brain activity and a movie-processing Transformer. These suggest that combining audio- and movie-processing Transformers and using them as a brain simulator is promising. That is, by utilizing this, it should be possible to synthesize audio-visual video stimuli that can intervene in a variety of brain activities and functions.
With the rapid development of artificial intelligence and robotics, intelligent picking robots are being increasingly applied across various fields. However, how to efficiently identify and pick up the target object i...
详细信息
暂无评论