IoT sensors are becoming increasingly important supplement to traditional monitoring systems, particularly for in-situ based monitoring. Data collected using IoT sensors are often plagued with missing values occurring...
详细信息
IoT sensors are becoming increasingly important supplement to traditional monitoring systems, particularly for in-situ based monitoring. Data collected using IoT sensors are often plagued with missing values occurring as a result of sensor faults, network failures, drifts and other operational issues. Missing data can have substantial impact on in-field sensor calibration methods. The goal of this research is to achieve effective calibration of sensors in the context of such missing data. To this end, two objectives are presented in this paper. 1) Identify and examine effective imputation strategy for missing data in IoT sensors. 2) Determine sensor calibration performance using calibration techniques on data set with imputed values. Specifically, this paper examines the performance of variational autoencoder (VAE), Neural Network with Random Weights (NNRW), Multiple Imputation by Chain Equations (MICE), Random Forest-based Imputation (missForest) and K-Nearest Neighbour (KNN) for imputation of missing values on IoT sensors. Furthermore, the performance of sensor calibration via different supervised algorithms trained on the imputed dataset were evaluated. The analysis showed VAE technique to outperform the other methods in imputing the missing values at different proportions of missingness on two real-world datasets. Experimental results also showed improved calibration performance with imputed dataset.
Laughter is one of the most important nonverbal sound that human generates. It is a means for expressing his emotions. The acoustic and contextual features of this specific sound are different from those of speech and...
详细信息
Laughter is one of the most important nonverbal sound that human generates. It is a means for expressing his emotions. The acoustic and contextual features of this specific sound are different from those of speech and many difficulties arise during their modeling process. During this work, we propose an audio laughter generation system based on unsupervised generative models: the autoencoder (AE) and its variants. This procedure is the association of three main sub-process, (1) the analysis which consist of extracting the log magnitude spectrogram from the laughter database, (2) the generative models training, (3) the synthesis stage which incorporate the involvement of an intermediate mechanism: the vocoder. To improve the synthesis quality, we suggest two hybrid models (LSTM-VAE, GRU-VAE and CNN-VAE) that combine the representation learning capacity of variational autoencoder (VAE) with the temporal modelling ability of a long short-term memory RNN (LSTM) and the CNN ability to learn invariant features. To figure out the performance of our proposed audio laughter generation process, objective evaluation (RMSE) and a perceptual audio quality test (listening test) were conducted. According to these evaluation metrics, we can show that the GRU-VAE outperforms the other VAE models.
Noise from automobiles, such as buzzing, squeaking, and rattling (BSR) noises, is a key factor in automobile quality assessment. It is necessary to classify these noises for appropriate handling and prevention. Althou...
详细信息
Noise from automobiles, such as buzzing, squeaking, and rattling (BSR) noises, is a key factor in automobile quality assessment. It is necessary to classify these noises for appropriate handling and prevention. Although many researchers have conducted studies to classify noise, they suffer from several problems: difficulty in extracting appropriate features, insufficient data to train a classifier, and weak robustness to surrounding noise. This paper proposes a method called latent semantic controlling generative adversarial networks (LSC-GAN) to solve these problems. To capture the features of data, a variational autoencoder (VAE), an autoencoder with approximate inference in a latent Gaussian model, learns the data representation by projecting them into the latent space according to their features and reconstructing the projected data. Because the generator and the discriminator of the LSC-GAN are trained simultaneously, the capacity to extract the characteristics of the data is improved and a knowledge space of classifiable data is also expanded with insufficient data. While data are generated by the generator, the encoder projects them back to the latent space according to their characteristics to advance the ability to extract features. Finally, the encoder is trained to the classifier, which is trained to classify BSR noises. The proposed classifier outperforms other models and achieves an accuracy of 96.68%. We confirm using a confusion matrix that the proposed model classifies the types of insufficient class better than other models. Our proposed model classifies data with accuracy of 94.68%, even if the data contains surrounding noise, which means it is more robust to BSR with surrounding noise than other models. (c) 2020 Elsevier B.V. All rights reserved.
Multi-agent reinforcement learning (MARL) is essential for a wide range of high-dimensional scenarios and complicated tasks with multiple agents. Many attempts have been made for agents with prior domain knowledge and...
详细信息
Multi-agent reinforcement learning (MARL) is essential for a wide range of high-dimensional scenarios and complicated tasks with multiple agents. Many attempts have been made for agents with prior domain knowledge and predefined structure. However, the interaction relationship between agents in a multi-agent system (MAS) in general is usually unknown, and previous methods could not tackle dynamical activities in an ever-changing environment. Here we propose a multi-agent Actor-Critic algo-rithm called Structural Relational Inference Actor-Critic (SRI-AC), which is based on the framework of centralized training and decentralized execution. SRI-AC utilizes the latent codes in variational autoen-coder (VAE) to represent interactions between paired agents, and the reconstruction error is based on Graph Neural Network (GNN). With this framework, we test whether the reinforcement learning learners could form an interpretable structure while achieving better performance in both cooperative and com-petitive scenarios. The results indicate that SRI-AC could be applied to complex dynamic environments to find an interpretable structure while obtaining better performance compared to baseline algorithms. (c) 2021 Elsevier B.V. All rights reserved.
Is it possible to find deterministic relationships between optical measurements and pathophysiology in an unsupervised manner and based on data alone? Optical property quantification is a rapidly growing biomedical im...
详细信息
Is it possible to find deterministic relationships between optical measurements and pathophysiology in an unsupervised manner and based on data alone? Optical property quantification is a rapidly growing biomedical imaging technique for characterizing biological tissues that shows promise in a range of clinical applications, such as intraoperative breast-conserving surgery margin assessment. However, translating tissue optical properties to clinical pathology information is still a cumbersome problem due to, amongst other things, inter- and intrapatient variability, calibration, and ultimately the nonlinear behavior of light in turbid media. These challenges limit the ability of standard statistical methods to generate a simple model of pathology, requiring more advanced algorithms. We present a data-driven, nonlinear model of breast cancer pathology for real-time margin assessment of resected samples using optical properties derived from spatial frequency domain imaging data. A series of deep neural network models are employed to obtain sets of latent embeddings that relate optical data signatures to the underlying tissue pathology in a tractable manner. These self-explanatory models can translate absorption and scattering properties measured from pathology, while also being able to synthesize new data. The method was tested on a total of 70 resected breast tissue samples containing 137 regions of interest, achieving rapid optical property modeling with errors only limited by current semi-empirical models, allowing for mass sample synthesis and providing a systematic understanding of dataset properties, paving the way for deep automated margin assessment algorithms using structured light imaging or, in principle, any other optical imaging technique seeking modeling. Code is available.
Recently, many efforts have been devoted to generating responses expressing a specific emotion or relating to a given topic in a controlled manner. However, limited attention has been given to generating responses wit...
详细信息
Recently, many efforts have been devoted to generating responses expressing a specific emotion or relating to a given topic in a controlled manner. However, limited attention has been given to generating responses with a specified syntactic pattern, which makes it possible to imitate someone's way of speaking in dialogue. To fulfill this goal, we propose two models to generate syntax-aware responses: a gross-constraint and a specific-constraint model. The former controls the syntactic patterns of generated responses at sentence-level, while the latter works at smaller language units, such as words or phrases, being capable of manipulating the syntactic structures of responses in a more subtle manner. The extensive experimental results on two different datasets show that both the two models not only can generate meaningful responses with a specific and coherent structure but also improve on the diversity of generated responses, with similar gains in readability, relevance, and diversity as measured by human judges.
The intrinsic complexity associated with passive sonar data makes the task of target recognition extremely challenging. The conventional classifier architectures based on hand-engineered feature transforms often fail ...
详细信息
The intrinsic complexity associated with passive sonar data makes the task of target recognition extremely challenging. The conventional classifier architectures based on hand-engineered feature transforms often fail miserably to disentangle the high-dimensional non-linear structures in the observed target records. Although the modern deep learning algorithms through hierarchical feature learning yield acceptable success rates, they often require tremendous amounts of data when trained in a supervised manner. An unsupervised generative framework utilizing a variational autoencoder (VAE) is proposed in this work in order to create better disentangled representations for the downstream classification task. The disentanglement is further enforced using a hyperparameter beta. For the purpose of better segregating the spectro-temporal features, an intermediate non-linearly scaled time-frequency representation is also employed in conjunction with beta-VAE. Experimental analysis of various classifier configurations yields encouraging results in terms of data efficiency and classification accuracy on target records collected from various locations of the Indian Ocean.
Map matching has been widely used in various indoor localization technologies. However, conventional map matching technologies based on probabilistic models, such as particle filter (PF), still have a series of limita...
详细信息
Map matching has been widely used in various indoor localization technologies. However, conventional map matching technologies based on probabilistic models, such as particle filter (PF), still have a series of limitations, such as underutilization of map information, poor generalization, and relatively low precision. To improve the performance of PF-based map matching technique, this paper proposes MapDem, a novel map matching model fusing dynamic word embeddings and variational autoencoder (VAE) to improve matching performance significantly. The key to our approach is to extract map information using dynamic word embeddings to represent each reachable point on the map as word vectors with allowable oriented trajectory information. The same point has different representations on different trajectories so that MapDem can adaptively learn the contextual information of map for position estimation. Unlike traditional particle filters, MapDem focuses on the learning of particle sets distribution by a statistical model, variational autoencoder (VAE), followed by estimating position with combined current and previous sequence information. Extensive experiments have been conducted with 610 trajectories in three real-world scenarios. Numerical results demonstrate the adaptability of MapDem which works equally well in all three different scenarios, outperforming traditional particle filters by 18% on average.
The ability of deep learning (DL) approaches to learn generalised signal and noise models, coupled with their fast inference on GPUs, holds great promise for enhancing gravitational-wave (GW) searches in terms of spee...
详细信息
The ability of deep learning (DL) approaches to learn generalised signal and noise models, coupled with their fast inference on GPUs, holds great promise for enhancing gravitational-wave (GW) searches in terms of speed, parameter space coverage, and search sensitivity. However, the opaque nature of DL models severely harms their reliability. In this work, we meticulously develop a DL model stage-wise and work towards improving its robustness and reliability. First, we address the problems in maintaining the purity of training data by deriving a new metric that better reflects the visual strength of the 'chirp' signal features in the data. Using a reduced, smooth representation obtained through a variational auto-encoder (VAE), we build a classifier to search for compact binary coalescence (CBC) signals. Our tests on real LIGO data show an impressive performance of the model. However, upon probing the robustness of the model through adversarial attacks, its simple failure modes were identified, underlining how such models can still be highly fragile. As a first step towards bringing robustness, we retrain the model in a novel framework involving a generative adversarial network (GAN). Over the course of training, the model learns to eliminate the primary modes of failure identified by the adversaries. Although absolute robustness is practically impossible to achieve, we demonstrate some fundamental improvements earned through such training, like sparseness and reduced degeneracy in the extracted features at different layers inside the model. We show that these gains are achieved at practically zero loss in terms of model performance on real LIGO data before and after GAN training. Through a direct search on similar to 8.8 days of LIGO data, we recover two significant CBC events from GWTC-2.1 (Abbott et al 2022 2108.01045 [gr-qc]), GW190519_153544 and GW190521_074359. We also report the search sensitivity obtained from an injection study.
Herein, a highly productive and defect-free 3D-printing system enforced by deep-learning (DL)-based anomaly detection and reinforcement-learning (RL)-based optimization processes is developed. Unpredictable defect fac...
详细信息
Herein, a highly productive and defect-free 3D-printing system enforced by deep-learning (DL)-based anomaly detection and reinforcement-learning (RL)-based optimization processes is developed. Unpredictable defect factors, such as machine setting errors or unexpected material flow, are analyzed by image-based anomaly detection implemented using a variational autoencoder DL model. Real-time detection and in situ correction of defects are implemented by an autocalibration algorithm in conjunction with the DL system. In view of productivity enhancement, the optimized set of diversified printing speeds can be generated from virtual simulation of RL, which is established using a physics-based engineering model. The RL-simulated parameter set maximizes printing speed and minimizes deflection-related failures throughout the 3D-printing process. With the synergistic assistance of DL and RL techniques, the developed system can overcome the inherent challenging intractability of 3D printing in terms of material property and geometry, achieving high process productivity.
暂无评论