In this paper, we study both multi-armed and contextual bandit problems in censored environments. Our goal is to estimate the performance loss due to censorship in the context of classical algorithms designed for unce...
详细信息
Artificial neural networks (ANN) have been shown to be flexible and effective function estimators for the identification of nonlinear state-space models. However, if the resulting models are used directly for nonlinea...
Artificial neural networks (ANN) have been shown to be flexible and effective function estimators for the identification of nonlinear state-space models. However, if the resulting models are used directly for nonlinear model predictive control (NMPC), the resulting nonlinear optimization problem is often overly complex due to the size of the network, requires the use of high-order observers to track the states of the ANN model, and the overall control scheme does not exploit the available autograd tools for these models. In this paper, we propose an efficient approach to auto-convert ANN statespace models to linear parameter-varying (LPV) form and solve predictive control problems by successive solutions of linear model predictive problems, corresponding to quadratic programs (QPs). Furthermore, we show how existing deep-learning methods, such as SUBNET that uses a state encoder, enable efficient implementation of MPCs on identified ANN models. Performance of the proposed approach is demonstrated by a simulation study on an unbalanced disc system.
This paper discusses intelligent constellation generation based on autoencoder communication system. In previous studies, the amplitude was set to fluctuate between r=0.0 and 1.0. However, when checking the generated ...
This paper discusses intelligent constellation generation based on autoencoder communication system. In previous studies, the amplitude was set to fluctuate between r=0.0 and 1.0. However, when checking the generated constellation, distortion was confirmed instead of the conventional symbol arrangement. Therefore, in this paper, it compares the case where the amplitude is constant, the case where the average amplitude within a Minibatch is 1, and the case where the average amplitude is 1 for Interval time. The communication standard used in this research is IEEE 802.11a, assuming wireless Local Area Network (LAN) specifications. The IEEE 802.11a standard has an Fast Fourier Transform (FFT) length of 64, a subcarrier number of 52, and Quadrature Phase Shift Keying (QPSK) and 16 Quadrature Amplitude Modulation (QAM), modulation methods. A guard interval of 800 ns is added and the symbol length is 4000 ns. First, a simulation was performed under the condition that the amplitude was kept constant. QPSK with 4 symbols, constant amplitude model is rounded more than previous research result. 16QAM with 16 symbols is arranged regularly like lined up on a line. Second, the simulation was performed under the condition that the average amplitude within the minibatch was set to 1. QPSK with 4 symbols, appears to rotate clockwise. 16QAM with 16 symbols has a more uniform symbol placement than previous research result. Third, a simulation was performed under the condition that the average amplitude within Interval time was set to 1. QPSK with 4 symbols, is the closest to square among QPSK output results so far. The direction is slightly tilted, but if it can be rotated a little more, it may be possible to reproduce the same symbol arrangement as before. 16QAM with 16 symbols, the symbol arrangement is biased as a whole. However, it can be seen that are arranged in line on the line, perhaps due to regularity. As future work, in addition to the conditions set this time, it will exa
An urgent task in the training of personnel of industrial enterprises is the development of the required amount of practical skills. In the context of a pandemic, restrictive measures, as well as the high cost and dan...
详细信息
Clipping is a common nonlinear distortion that occurs whenever the input or output of an audio system exceeds the supported range. This phenomenon undermines not only the perception of speech quality but also downstre...
Clipping is a common nonlinear distortion that occurs whenever the input or output of an audio system exceeds the supported range. This phenomenon undermines not only the perception of speech quality but also downstream processes utilizing the disrupted signal. Therefore, a real-time-capable, robust, and low-response-time method for speech declipping (SD) is desired. In this work, we introduce DDD (Demucs-Discriminator-Declipper), a real-time-capable speech-declipping deep neural network (DNN) that requires less response time by design. We first observe that a previously untested real-time-capable DNN model, Demucs, exhibits a reasonable declipping performance. Then we utilize adversarial learning objectives to increase the perceptual quality of output speech without additional inference overhead. Subjective evaluations on harshly clipped speech shows that DDD outperforms the baselines by a wide margin in terms of speech quality. We perform detailed waveform and spectral analyses to gain an insight into the output behavior of DDD in comparison to the baselines. Finally, our streaming simulations also show that DDD is capable of sub-decisecond mean response times, outperforming the state-of-the-art DNN approach by a factor of six.
The proportional-integral-derivative (PID) controller is a popular control loop feedback mechanism that allows users to efficiently regulate their system outputs. As demonstrated in this project, one of the main appli...
详细信息
This paper delves into the challenges of binary classification using imbalanced datasets, particularly when instances of interest are infrequent. It explores a comprehensive approach that integrates Synthetic Minority...
This paper delves into the challenges of binary classification using imbalanced datasets, particularly when instances of interest are infrequent. It explores a comprehensive approach that integrates Synthetic Minority Over-sampling Technique (SMOTE), Generative Adversarial Networks (GANs), and Variational Autoencoders (VAEs) to enhance classification outcomes. Traditional classification models tend to favor the majority class, while the impact of imbalanced misclassification costs is often overlooked. The integration of SMOTE, GANs, and VAEs in binary classification, or SMOTE-GAN-VAE, addresses these challenges by generating synthetic instances, refining data representations, and capturing latent features. To evaluate the effectiveness of various data generation methods, a credit card fraud dataset is used. The performance metrics considered include F0.5-score, F1-score, and F2-score, which account for both precision and recall. The results indicate that SMOTE-GAN-VAE outperforms individual methods, such as SMOTE, GANs, and VAEs, demonstrating its potential to enhance data representation and classification accuracy, and outperformed the β- VAE filtered approach employed in previous literature.
This paper presents an optical coherence tomography (OCT) system in conjunction with a novel image reconstruction technique employed for in vitro imaging of human teeth. The primary goal is to enhance the signal-to-no...
This paper presents an optical coherence tomography (OCT) system in conjunction with a novel image reconstruction technique employed for in vitro imaging of human teeth. The primary goal is to enhance the signal-to-noise ratio (SNR) in the obtained images. The study entails a comparative analysis between the conventional Fast Fourier Transform (FFT) OCT image reconstruction method and a newly introduced scaled nonuniform discrete Fourier transform (NDFT) approach. The findings reveal that the NDFT method consistently delivers superior results in terms of peak signal-to-noise ratio (PSNR) and overall image quality, even when dealing with redundant and nonuniform frequency domain samples. In light of these results, this paper concludes that integrating NDFT into OCT procedures has the potential to significantly enhance the quality of image reconstructions, thereby fostering its broader application in the field of dental imaging.
In text mining, Latent Semantic Analysis (LSA) is the popular method to reduce the dimension of document vectors. Since LSA produces a set of topics by statistical information, the meaning of each topic is not *** pro...
详细信息
In this work we combine a high flow cytometry experimental setup and a 10Kframe/sec capable neuromorphic event-based camera, followed by lightweight machine learning schemes, thus allowing the simultaneous imaging and...
详细信息
ISBN:
(纸本)9798350345995
In this work we combine a high flow cytometry experimental setup and a 10Kframe/sec capable neuromorphic event-based camera, followed by lightweight machine learning schemes, thus allowing the simultaneous imaging and real-time classification of test particles, moving at a speed of 0.8m/sec with an accuracy of 97.6%. The key advantage of the utilized microscopy system, is the use of an event-based camera, generating spiking events, triggered by pixel's contrast changes. This bio-inspired operation, contrary to conventional CMOS cameras [1], alleviates bandwidth constraints and can significantly boost frame-rate capabilities, thus capturing high speed events. Following this paradigm, medical imaging modalities, where the detection and analysis of fast-moving particles is a necessity, such as high-flow cytometry, can greatly proliferate from the proposed approach.
暂无评论