Simultaneous Localization And Mapping (SLAM) is a technology aimed at concurrently ascertaining the position of a robot while mapping an unknown environment. Its core involves achieving accurate and stable pose estima...
详细信息
The existing fire alarm system has strict distance and installation requirements between the fire point and the detector, and is easy to be interfered by environmental factors. It is not suitable for places with large...
详细信息
Humans express emotions verbally and non-verbally through their voice, facial expressions, and body language. Facial expression recognition systems can identify the emotional state of any person by using different int...
详细信息
ISBN:
(纸本)9798350387896;9798350387889
Humans express emotions verbally and non-verbally through their voice, facial expressions, and body language. Facial expression recognition systems can identify the emotional state of any person by using different intelligent algorithms, such as Support Vector Machines, Hidden Markov Models, and Convolutional Neural Networks, among others. This study focuses on facial expression recognition using eye and mouth regions of images from the FER-2013 dataset by training convolutional neural network (CNN) models. Seven emotional states - happy, sad, fear, anger, disgust, surprise and neutral - were identified. The methodology included segmenting and concatenating the images to form three CNN models. The best-performing model, a four-layer CNN with 8, 16, 32, and 64 filters, achieved remarkable results: 99.05% accuracy, 100.00% precision, 93.75% recall, 96.77% F1-score, 95.95% validation accuracy, and a 0.15 validation loss with a processing time of 3.03 minutes. It was possible to develop a CNN model capable of identifying seven emotional states from only the data of the eye and mouth region using concatenated images.
Deep learning based object detection algorithms have been applied in various fields of life. As a representative of one-stage detection algorithms, YOLO series algorithms are highly favored for their fast *** resolve ...
Deep learning based object detection algorithms have been applied in various fields of life. As a representative of one-stage detection algorithms, YOLO series algorithms are highly favored for their fast *** resolve the existing issues with complex network architecture and weak real-time performance. YOLOv3 and YOLOv4 are improved by adding the lightweight network EfficientNet, using deep separable convolution rather than regular convolution, reducing a significant number of network parameters, using transfer learning to speed up the model’s training speed, and speeding up the model’s convergence. Finally, the accuracy, number of parameters, and detection rate of the network structure before as well as after improvement are compared to ensure that the model is successful. According to the experimental findings, the number of parameters for E-YOLOv3 and E-YOLOv4 are only 1/9 and 1/7 of what they were before the enhancement, and the detection rate has increased by 4 and 3 frames/second, respectively.
In the lighting conditions such as snowing, hazing, raining, and weak lighting condition, the accuracy of traffic sign recognition is not very high. It is important to develop an algorithms for real-time fast detectio...
详细信息
Deep learning (DL) has been successfully applied to recover compressive sensing (CS) signals in recent researches, which significantly reduce the time complexity than conventional model-based reconstruction algorithms...
详细信息
ISBN:
(数字)9798350376548
ISBN:
(纸本)9798350376555
Deep learning (DL) has been successfully applied to recover compressive sensing (CS) signals in recent researches, which significantly reduce the time complexity than conventional model-based reconstruction algorithms. However, some important elements in the CS theory such as sparsity and other priori constraints are abandoned in these DL-based methods, which leads a poor performance in reconstruction results at a high sampling ratio. In this paper, a novel reconstruction method is proposed to introduce sparsity into neural networks for CS reconstruction. More specifically, a two-step reconstruction algorithm based on mixed sparse representations and deep learning are designed to implement the purpose. To the best of our knowledge, there exists no published work that achieves the desirable function. Given a certain sampling ratio, the reconstructed image quality is much better than other CS related image acquisition and compression methods. The framework holds the promise to be widely used in the near future.
With the continuous development of science and technology, the capacity of data is also showing a geometric increase. Therefore, this paper mainly studies how to store query, change, and replace a large number of geog...
详细信息
As two core fields of machine learning, Natural Language processing and Computer Vision have derived a variety of cross research topics, and image captioning as a key topic of the current cross field has an important ...
详细信息
Multi-scale architectures have shown effectiveness in a variety of tasks thanks to appealing cross-scale complementarity. However, existing architectures treat different scale features equally without considering the ...
详细信息
ISBN:
(纸本)9781713871088
Multi-scale architectures have shown effectiveness in a variety of tasks thanks to appealing cross-scale complementarity. However, existing architectures treat different scale features equally without considering the scale-specific characteristics, i.e., the within-scale characteristics are ignored in the architecture design. In this paper, we reveal this missing piece for multi-scale architecture design and accordingly propose a novel Multi-Scale Adaptive Network (MSANet) for single image denoising. Specifically, MSANet simultaneously embraces the within-scale characteristics and the cross-scale complementarity thanks to three novel neural blocks, i.e., adaptive feature block (AFeB), adaptive multi-scale block (AMB), and adaptive fusion block (AFuB). In brief, AFeB is designed to adaptively preserve image details and filter noises, which is highly expected for the features with mixed details and noises. AMB could enlarge the receptive field and aggregate the multi-scale information, which meets the need of contextually informative features. AFuB devotes to adaptively sampling and transferring the features from one scale to another scale, which fuses the multi-scale features with varying characteristics from coarse to fine. Extensive experiments on both three real and six synthetic noisy image datasets show the superiority of MSANet compared with 12 methods. The code could be accessed from https://***/XLearning-SCU/2022-NeurIPS-MSANet.
Through computers, artists have found a way to enhance their production, discovering new ways for communicating their productions and devising new forms of expression. Being able to make the most of these facilities r...
详细信息
暂无评论