With an ever-growing demand for BCI-based systems, numerous algorithms and machine learning systems have been proposed over the past few decades. Although state-of-theart approaches have reached practically appropriat...
详细信息
With an ever-growing demand for BCI-based systems, numerous algorithms and machine learning systems have been proposed over the past few decades. Although state-of-theart approaches have reached practically appropriate levels of accuracy, most are often regarded as black-box models, which need more explainability. However, for cases where a Neural Network is used as the base for a classifier, a Layerwise Relevance Propagation (LRP) approach can be utilized to analyze the decision boundaries considered by the network. By calculating the importance of the neuron in each layer, the LRP can also be used as an effective model complexity reduction technique through the inactivation (pruning) of the neural pathways. The following work investigates the usability of the LRP framework in the field of BCI. This study provides an example of the practical application of the LRP with respect to the EEG (ERP) dataset, along with visual heatmap and scalp map examples of the LRP. Furthermore, the work analyzes the impact of network pruning on heatmap visualization and the model’s accuracy while also practically determining the maximum cutoff range for pruning BCI models.
Transparent conductive oxides exhibit attractive optical nonlinearity with ultrafast response and giant refractive index change near the epsilon-near-zero(ENZ) wavelength, originating from the intraband dynamics of co...
详细信息
Transparent conductive oxides exhibit attractive optical nonlinearity with ultrafast response and giant refractive index change near the epsilon-near-zero(ENZ) wavelength, originating from the intraband dynamics of conduction electrons. The optical nonlinearity of ENZ materials has been explained by using the overall-effective-mass and the overall-scattering-time of electrons in the extended Drude model. However, their response to optical excitation is yet the last building block to complete the theory. In this paper, the concept of thermal energy is theoretically proposed to account for the total energy of conduction electrons exceeding their thermal equilibrium value. The time-varying thermal energy is adopted to describe the transient optical response of indium-tin-oxide(ITO), a typical ENZ material. A spectrally-resolved femtosecond pump-probe experiment was conducted to verify our theory. By correlating the thermal energy with the pumping density, both the giant change and the transient response of the permittivity of ITO can be predicted. The results in this work provide a new methodology to describe the transient permittivities of ENZ materials, which will benefit the design of ENZ-based nonlinear photonic devices.
The agricultural industry is essential to the world's food production, and it is critical to use cutting-edge technologies to increase crop productivity. We provide a revolutionary Crop Recommendation System (CRS)...
详细信息
Soft robots are uniquely suited for applications in inspection, search and rescue, and exploration in confined and unstructured environments. Leveraging the benefits of these soft robots will increasingly require the ...
详细信息
Soft robots are uniquely suited for applications in inspection, search and rescue, and exploration in confined and unstructured environments. Leveraging the benefits of these soft robots will increasingly require the integration of equally compliant electronic components for sensing, actuation, control, and computation to maintain mechanical conformability at the system level. In this work, we demonstrate the integration of modular electronic sensors into a robotic snake platform using stretchable interconnects. Aimed at applications in nuclear infrastructure inspection, included sensors include visual perception and on-board radiation spectroscopy. In addition to high mechanical compliance, the stretchable interconnect enables physical separation of analog sensors and digital electronics for use in radiation environments. We present details of stretchable electronics fabrication and integration, standalone validation of integrated sensors, and field test results from a simulated nuclear infrastructure inspection task.
Byzantine machine learning has garnered considerable attention in light of the unpredictable faults that can occur in large-scale distributed learning systems. The key to secure resilience against Byzantine machines i...
详细信息
We focus on the generalization ability of the 6-DoF grasp detection method in this paper. While learning-based grasp detection methods can predict grasp poses for unseen objects using the grasp distribution learned fr...
详细信息
Emotion recognition can help human-computer interactions by enabling systems to respond empathetically and adapt to users' emotional conditions. This capability improves user experience, supporting the development...
详细信息
ISBN:
(数字)9798331508579
ISBN:
(纸本)9798331508586
Emotion recognition can help human-computer interactions by enabling systems to respond empathetically and adapt to users' emotional conditions. This capability improves user experience, supporting the development of a more intuitive and emotionally responsive communication system. This study analyzes a bimodal approach based on gender (male and female) to recognize emotions without contextual information in dialogue analysis. Utilizing the Multimodal EmoryNLP dataset extracted from the TV series Friends with acted speech, we focused on four primary emotions: Angry, Neutral, Joy, and Scared. The model used in this study for text classification is RoBERTa, and wav2vec 2.0 is used for audio feature extraction with the Bi-LSTM model for classification. The experiment results using weighted F1-score reveal that data augmentation enhanced the performance of analyzing the original dataset from 0.46% to 0.52% and the male dataset from 0.43% to 0.51 %. In comparison, the female dataset remained consistent at 0.46%. The weighted F1-score and Unweighted Averaged Recall (UAR) from the male dataset are higher, 51 % and 48%, respectively, than those from the female dataset, 46% and 47%, respectively. Gender-based analysis indicated that male and female datasets exhibited distinct performance patterns, highlighting variations in emotional expression and recognition between genders. These findings underscore the effectiveness of multimodal strategies in emotion recognition and suggest that gender-specific factors play a significant role in enhancing classification performance. While these results highlight performance trends, further validation through repeated trials and statistical analyses could provide stronger generalizations and insights into gender-based differences.
Previous visual object tracking methods employ image-feature regression models or coordinate autoregression models for bounding box prediction. Image-feature regression methods heavily depend on matching results and d...
ISBN:
(纸本)9798331314385
Previous visual object tracking methods employ image-feature regression models or coordinate autoregression models for bounding box prediction. Image-feature regression methods heavily depend on matching results and do not utilize positional prior, while the autoregressive approach can only be trained using bounding boxes available in the training set, potentially resulting in suboptimal performance during testing with unseen data. Inspired by the diffusion model, denoising learning enhances the model's robustness to unseen data. Therefore, We introduce noise to bounding boxes, generating noisy boxes for training, thus enhancing model robustness on testing data. We propose a new paradigm to formulate the visual object tracking problem as a denoising learning process. However, tracking algorithms are usually asked to run in real-time, directly applying the diffusion model to object tracking would severely impair tracking speed. Therefore, we decompose the denoising learning process into every denoising block within a model, not by running the model multiple times, and thus we summarize the proposed paradigm as an in-model latent denoising learning process. Specifically, we propose a denoising Vision Transformer (ViT), which is composed of multiple denoising blocks. In the denoising block, template and search embeddings are projected into every denoising block as conditions. A denoising block is responsible for removing the noise in a predicted bounding box, and multiple stacked denoising blocks cooperate to accomplish the whole denoising process. Subsequently, we utilize image features and trajectory information to refine the denoised bounding box. Besides, we also utilize trajectory memory and visual memory to improve tracking stability. Experimental results validate the effectiveness of our approach, achieving competitive performance on several challenging datasets. The proposed in-model latent denoising tracker achieve real-time speed, rendering denoising learning
Maintaining up-to-date environmental models from initial deployment through long-term autonomy in service is critical for applications like navigation and task planning. To address the challenges of persistent monitor...
详细信息
Large Language Models (LLMs) have achieved remarkable success across diverse tasks, largely driven by well-designed prompts. However, crafting and selecting such prompts often requires considerable human effort, signi...
详细信息
暂无评论