One of the best sources of information for biologists and ethologists to study wildlife behavior is video footage;in particular, aerial video footage provides a unique perspective on the behavior of animals in their n...
详细信息
In multi-robot collaborative exploration of unstructured environments, redundant exploration often occurs, leading to significant consumption of computational resources and thus reducing exploration efficiency. To add...
详细信息
Facial expression recognition (FER) plays a crucial role in domains such as healthcare and access security. Traditional models primarily utilize convolutional networks to extract features like facial landmarks and pos...
详细信息
Facial expression recognition (FER) plays a crucial role in domains such as healthcare and access security. Traditional models primarily utilize convolutional networks to extract features like facial landmarks and positions of facial features. However, these methods often result in feature maps with significant redundancy, contributing minimally to network performance enhancement. To address this limitation, we propose the DPConv module, which innovatively segments the channel dimension and applies dual convolutional kernel sizes. This module replaces several convolutional blocks within the POSTER++ (Mao et al. in POSTER++: A Simpler and Stronger Facial Expression Recognition Network. arXiv:2301.12149, 2023) architecture, leading to a reduction in parameters while simultaneously enhancing network efficiency and accuracy. Moreover, we propose a sliding window multi-head cross-self-attention mechanism, which is based on the sliding window multi-head self-attention (Liu et al. in Proceedings of the IEEE/CVF internationalconference on computer Vision, 2021) mechanism, which substitutes the conventional attention mechanism, facilitating the modeling of global dependencies and further optimizing the network's overall performance. Our model, DPPOSTER, was tested on the RAF-DB, FERPlus and SFEW datasets, and experimental comparisons were conducted with different combinations of convolution kernel sizes and channel segmentation ratios. The results showed that DPPOSTER achieved performance improvements of 0.59%, 0.37% and 2.32% over POSTER++ on the RAF-DB, FERPlus and SFEW datasets, respectively.
With the development of science and technology and the change of mode, the reasoning of the enemy's intention has been introduced into the battlefield, and the decision-making of air defense has higher requirement...
详细信息
ARINC653 is an important standard for IMA integrated modular avionics systems. In today's rapidly developing world, single core ARINC653 can no longer meet the increasingly diverse tasks and complex systems. There...
详细信息
Deep learning-primarily based aid allocation algorithms for 6G networks enable enhanced community overall performance via close to most useful schedulers. those algorithms leverage deep neural network architectures to...
详细信息
In order to dynamically create a sequence of textual descriptions for images, image description models often make use of the attention mechanism, which involves an automatic focus on different regions within an image....
详细信息
Air pollution can affect human health, so it is necessary to predict the air quality index (AQI) in advance. In this work, air quality data collected by the Internet of Drone Things (IoDT) is predicted and analyzed to...
详细信息
In this paper, a cloud platform-based power system energy load forecasting method is proposed to improve the efficiency and accuracy of power system energy load forecasting. Firstly, the architecture of distributed po...
详细信息
This paper introduces a novel approach for assessing piano performance through video analysis using Dynamic Time Warping (DTW). Traditional methods of evaluating piano playing often rely on auditory cues or sheet musi...
详细信息
暂无评论