A bidirectional autoencoder learns or approximates an identity mapping as it trains a single network with a version of the new bidirectional backpropagation algorithm. Ordinary unidirectional autoencoders find many us...
A bidirectional autoencoder learns or approximates an identity mapping as it trains a single network with a version of the new bidirectional backpropagation algorithm. Ordinary unidirectional autoencoders find many uses in image processing and in large language models. But they use separate networks for encoding and decoding. Bidirectional auto encoders use the same synaptic weights for encoding and decoding. The forward pass encodes while the backward pass decodes. Bidirectional auto encoders improved network performance and significantly reduced memory usage and used fewer parameters. Simulations compared unidirectional with bidirectional autoencoders for image compression and de noising. The models trained on the MNIST handwritten-digit and CIFAR-IO image datasets. The performance measures were the peak signal-to-noise ratio and the index of structural similarity. Bidirectional autoencoders outperformed unidirectional autoencoders and still reduced the number of trainable synaptic parameters by about 50%.
The direction of arrival (DOA) estimation task becomes more challenging when the sources are coherent. In this paper, a method has been suggested for the coherent DOA estimation. At first, the received signal of the u...
详细信息
ISBN:
(数字)9798350376340
ISBN:
(纸本)9798350376357
The direction of arrival (DOA) estimation task becomes more challenging when the sources are coherent. In this paper, a method has been suggested for the coherent DOA estimation. At first, the received signal of the uniform linear antenna array is divided into subarrays. For each of the subarrays, the multiple Toeplitz technique has been applied to produce the correlation matrix. Then, the forward backward smoothing has been exploited and the resultant correlation matrices are averaged over various snapshots to provide the improved-rank covariance matrix. At the last step, one of the subspace based DOA estimation schemes are leveraged to estimate the DOAs of the coherent sources. The simulation results indicate that the proposed method is superior over its rivals in various scenarios.
作者:
Tian, YePan, JingwenYang, ShangshangZhang, XingyiHe, ShupingJin, YaochuAnhui University
Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education Institutes of Physical Science and Information Technology Hefei230601 China Hefei Comprehensive National Science Center
Institute of Artificial Intelligence Hefei230088 China Anhui University
School of Computer Science and Technology Hefei230601 China Anhui University
Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education School of Artificial Intelligence Hefei230601 China Anhui University
Anhui Engineering Laboratory of Human-Robot Integration System and Intelligent Equipment School of Electrical Engineering and Automation Hefei230601 China Bielefeld University
Faculty of Technology Bielefeld33619 Germany
The sparse adversarial attack has attracted increasing attention due to the merit of a low attack cost via changing a small number of pixels. However, the generated adversarial examples are easily detected in vision s...
详细信息
This paper investigates the problem of incipient fault detection and diagnosis (FDD) in wind energy conversion systems (WECS) using an innovative and effective approach called the ensemble learning-sine cosine optimiz...
详细信息
This paper introduces an innovative approach to sleep stage classification, leveraging a multi-modal signal integration framework encompassing Electrooculography (EOG) and two-channel electroencephalography (EEG) data...
详细信息
The problem of predicting emotional attributes from speech has often focused on predicting a single value from a sentence or short speaking turn. These methods often ignore that natural emotions are both dynamic and d...
The problem of predicting emotional attributes from speech has often focused on predicting a single value from a sentence or short speaking turn. These methods often ignore that natural emotions are both dynamic and dependent on context. To model the dynamic nature of emotions, we can treat the prediction of emotion from speech as a time-series problem. We refer to the problem of predicting these emotional traces as dynamic speech emotion recognition. Previous studies in this area have used models that treat all emotional traces as coming from the same underlying distribution. Since emotions are dependent on contextual information, these methods might obscure the context of an emotional interaction. This paper uses a neural process model with a segment-level speech emotion recognition (SER) model for this problem. This type of model leverages information from the time-series and predictions from the SER model to learn a prior that defines a distribution over emotional traces. Our proposed model performs 21% better than a bidirectional long short-term memory (BiLSTM) baseline when predicting emotional traces for valence.
In this paper, an improved dead-beat controller (DBC) for a single-phase LC-coupling hybrid active power filter (LC-HAPF) is proposed to achieve both low steady-state error and fast transient response. The conventiona...
详细信息
We introduce new soft diamond regularizers that both improve synaptic sparsity and maintain classification accuracy in deep neural networks. These parametrized regularizers outperform the state-of-the-art hard-diamond...
详细信息
This paper studies the influence of an explicit symmetry break on the dynamics of a memristive band pass filter. It is shown that the circuit exhibits complex behaviours. The symmetry break is achieved by considering ...
详细信息
暂无评论