In space science and satellite imagery, better resolution of the data information obtained makes images clearer and interpretation more accurate. However, the huge data volume gained by the complex on-board satellite ...
详细信息
In space science and satellite imagery, better resolution of the data information obtained makes images clearer and interpretation more accurate. However, the huge data volume gained by the complex on-board satellite instruments becomes a problem that needs to be managed carefully. To reduce the data volume to be stored and transmitted on-ground, the signals received should be compressed, allowing a good original source representation in the reconstruction step. Image compression covers a key role in space science and satellite imagery and, recently, deep learning models have achieved remarkable results in computer vision. In this paper, we propose a spectral signals compressor network based on deep convolutional autoencoder (SSCNet) and we conduct experiments over multi/hyperspectral and RGB datasets reporting improvements over all baselines used as benchmarks and than the JPEG family algorithm. Experimental results demonstrate the effectiveness in the compression ratio and spectral signal reconstruction and the robustness with a data type greater than 8 bits, clearly exhibiting better results using the PSNR, SSIM, and MS-SSIM evaluation criteria.
Detecting the status of online learning and online working through human facial expressions and actions has been a hot topic in the field of Computer Vision in recent years. Anomalies have the characteristics of low o...
详细信息
ISBN:
(纸本)9781665421744
Detecting the status of online learning and online working through human facial expressions and actions has been a hot topic in the field of Computer Vision in recent years. Anomalies have the characteristics of low occurrence probability, difficulty in definition and various types, so anomaly detection is extremely challenging. This paper is based on image rather than video to detect laptop users' abnormal expressions more quickly, which divided into three steps. Firstly, we use illumination compensation to improve the brightness of the original image, and then separate the face from the background by skin color segmentation. Secondly, we combine sparse coding and an additional encoder to extract face features based on autoencoder, which reduces the noise impact in the encoding process and enhances the robustness of the model. Finally, we put the extracted face features into one-class Support Vector Machine (SVM) classifier based on the radial basis function (RBF) for classification. This paper conducts experiments on three real datasets of learning, entertainment and working, which proves that the Fl- score of our proposed model is up to 0.89157.
Anomalous sound detection (ASD) is one of the most important fields in industrial facility maintenance. For this task, semi-supervised approaches are preferred thanks to their simplicity and no training data labels re...
详细信息
ISBN:
(数字)9781728154718
ISBN:
(纸本)9781728154718
Anomalous sound detection (ASD) is one of the most important fields in industrial facility maintenance. For this task, semi-supervised approaches are preferred thanks to their simplicity and no training data labels required. These methods train an autoencoder (AE) with only normal sound data and detect anomalies based on anomaly scores of actual samples. In this paper, we propose applying the convolutional variational autoencoder (CVAE) to ASD task. Through experiments using machine sound data, the CVAE is proven to be effective in detecting abnormal sound and outperform existing methods.
Artificial neural networks can be an important tool to improve the search for admissible string compactifications and characterize them. In this paper we construct the heterotic orbiencoder, a general deep autoencoder...
详细信息
Artificial neural networks can be an important tool to improve the search for admissible string compactifications and characterize them. In this paper we construct the heterotic orbiencoder, a general deep autoencoder to study heterotic orbifold models arising from various Abelian orbifold geometries. Our neural network can be easily trained to successfully encode the large parameter space of many orbifold geometries simultaneously, independently of the statistical dissimilarities of their training features. In particular, we show that our autoencoder is capable of compressing with good accuracy the large parameter space of two promising orbifold geometries in just three parameters. Further, most orbifold models with phenomenologically appealing features appear in bounded regions of this small space. Our results hint towards a possible simplification of the classification of (promising) heterotic orbifold models.
Rapid and accurate diagnosis of Alzheimer's disease (AD) is critical for patient treatment, especially in the early stages of the disease. While computer-assisted diagnosis based on neuroimaging holds vast potenti...
详细信息
ISBN:
(纸本)9781665441216
Rapid and accurate diagnosis of Alzheimer's disease (AD) is critical for patient treatment, especially in the early stages of the disease. While computer-assisted diagnosis based on neuroimaging holds vast potential for helping clinicians detect disease sooner, there are still some technical hurdles to overcome. This study presents an end-to-end disease detection approach using convolutional autoencoders by integrating supervised prediction and unsupervised representation. The 2D neural network is based upon a pre-trained 2D convolutional autoencoder to capture latent representations in structural brain magnetic resonance imaging (MRI) scans. Experiments on the OASIS brain MRI dataset revealed that the model outperforms a number of traditional classifiers in terms of accuracy using a single slice.
This paper suggests to use membership-mapping as the building block of deep models. An alternative idea of deep autoencoder, referred to as Bregman Divergence Based Conditionally Deep autoencoder (that consists of lay...
详细信息
ISBN:
(纸本)9783030871017;9783030871000
This paper suggests to use membership-mapping as the building block of deep models. An alternative idea of deep autoencoder, referred to as Bregman Divergence Based Conditionally Deep autoencoder (that consists of layers such that each layer learns data representation at certain abstraction level through a membership-mappings based autoencoder), is presented. A multi-class classifier is presented that employs a parallel composition of conditionally deep autoencoders to learn data representation for each class. Experiments are provided to demonstrate the competitive performance of the proposed framework in classifying high-dimensional feature vectors and in rendering robustness to the classification.
Face recognition (FR) systems based on convolutional neural networks have shown excellent performance in human face inference. However, some malicious users may exploit such powerful systems to identify others' fa...
详细信息
Face recognition (FR) systems based on convolutional neural networks have shown excellent performance in human face inference. However, some malicious users may exploit such powerful systems to identify others' face images disclosed by victims' social network accounts, consequently obtaining private information. To address this emerging issue, synthesizing face protection images with visual and protective effects is essential. However, existing face protection methods encounter three critical problems: poor visual effect, limited protective effect, and trade-off between visual and protective effects. To address these challenges, we propose a novel face protection approach in this article. Specifically, we design a generative adversarial network (GAN) framework with an autoencoder (AEGAN) as the generator to synthesize the protection images. It is worth noting that we introduce an interpolation upsampling module in the decoder in order to let the synthesized protection images evade recognition by powerful convolution-based FR systems. Furthermore, we introduce an attention module with a perceptual loss in AEGAN to enhance the visual effects of synthesized images by AEGAN. Extensive experiments have shown that AEGAN not only can maintain the comfortable visual quality of synthesized images but also prevent the recognition of commercial FR systems, including Baidu and iKLYTEK.
Few Shot Object Detection is becoming a hot issue in the problem of deep learning object detection. Its main goal is to use very few samples for training and learning to obtain a model with good detection speed and ac...
详细信息
ISBN:
(纸本)9781450398343
Few Shot Object Detection is becoming a hot issue in the problem of deep learning object detection. Its main goal is to use very few samples for training and learning to obtain a model with good detection speed and accuracy. However, the most significant problem of few shot object detection detection is that the extraction of target feature information is insufficient and incomplete due to the sparse sample data. The solution to this problem in main methods is often to increase the number of samples through data enhancement, which can greatly enhance sample feature information. However, there is still the problem of inaccurate extraction of target feature information. This paper proposes a feature extraction network based on an autoencoder, that is, under the framework of autoEncode, refer to the network structure of Vgg-16 and integrate the SeNet channel attention mechanism to adjust the channel feature information weight parameters in the feature extraction network, and then design the similarity evaluation method between the sample input image and the reconstructed image based on the autoencoder is used to quantify the index representation feature extraction ability. Finally, the experiments show, compared with other main feature extraction algorithms, our method proposed in this paper has a better performance on the feature extraction ability index on the small-sample fine-grained dataset Oxford-102 Flower, and the speed reaches the level of main algorithm, which shows that the scheme proposed in this paper effectively improves the feature learning and extraction ability in the condition of FSOD.
Permeability prediction of porous media from numerical approaches is an important supplement for experimental measurements with the benefits of being more economical and efficient. However, the accuracy and reliabilit...
详细信息
Permeability prediction of porous media from numerical approaches is an important supplement for experimental measurements with the benefits of being more economical and efficient. However, the accuracy and reliability of traditional numerical approaches are strongly dependent on the high-resolution images of porous media, which greatly limits their implementation for engineering applications. Herein, a semi-supervised machine learning approach is proposed to predict the permeability of porous media from low-resolution images. This approach consists of an autoencoder (AE) module trained with unlabeled data to assist the backbone convolutional neural network (CNN) in the prediction by providing a mapping of the low-resolution porous media to high-resolution features. The low-resolution information from CNN trained with small amount of labeled data and high-resolution information from AE trained with larger amount of unlabeled data are comprehensively considered in this approach. The prediction performance of AE-CNN from low-resolution images is examined against the results from traditional approaches of CNN and lattice Boltzmann method (LBM) by the mean-square errors (MSE) and R-Squared (R2) calculations. Using 5-fold cross-validation method, the average value of R2 is 0.896 on the test dataset by AE-CNN, compared to 0.869 for the traditional CNN without the AE. The MSEs for AE-CNN are 0.022 and 0.064 on the training and test datasets respectively in the best-performance fold, while without AE, the MSEs for only CNN are 0.034 and 0.083 on the training and test datasets respectively in the best-performance fold, implying that AE modules can substantially improve the prediction performance from low-resolution images of porous media. As for the simulation results of LBM approach, its prediction reliability (average R2: 0.42;MSE: 0.37 and 0.36 in the best-performance fold) is extremely lower than that of CNN-based machine learning algorithms owing to huge numerical error
RNA-binding proteins (RBPs) are involved in a number of biological processes such as RNA synthesis, protein folding, alternative splicing, etc. Predicting RBPs can facilitate the discovery and treatment of human disea...
详细信息
RNA-binding proteins (RBPs) are involved in a number of biological processes such as RNA synthesis, protein folding, alternative splicing, etc. Predicting RBPs can facilitate the discovery and treatment of human diseases, such as muscle atrophy, nervous system diseases, and cancer. However, there are still various challenges in identifying RBPs using experimental methods. Computational methods, and in particular Deep Learning, are being deployed to alleviate some of these challenges and provide new avenues of investigation in the field of RBPs prediction. Here, we propose DEEPStack-RBP, a novel RBPs prediction tool based on deep learning and ensemble learning. First, conjoint triad (CT), local descriptors (LD), pseudo amino acid composition (PseAAC), multivariate mutual information (MMI) and position specific scoring matrix-transition probability composition (PSSM-TPC) are applied to extract multiple features from the proteins. Subsequently, autoencoder (AE) is used to eliminate redundancy in features, and SMOTE-ENN is employed to balance the samples by minimizing the number difference between positive and negative cases. Finally, the stacked ensemble classifier composed of bidirectional long short-term memory (BiLSTM), gated recurrent unit (GRU), and support vector machine (SVM) is used for prediction. On the training dataset RBP9873, the ACC value of DEEPStack-RBP reaches 98.76% with a MCC value of 0.9508. For the three independent test datasets of Human, S. cerevisiae and A. thaliana, the accuracy of the model is 97.16%, 97.67% and 99.57% respectively, and the MCC is 0.9405, 0.9499 and 0.9906 respectively. These results show that DEEPStack-RBP can be used as a powerful tool for RBPs prediction.(c) 2022 Elsevier B.V. All rights reserved.
暂无评论