In the detection using guided waves, the signal often carries a high level of non-Gaussian noise. The traditional denoising method cannot estimate and use the prior information of the signal, which leads to poor denoi...
详细信息
In the detection using guided waves, the signal often carries a high level of non-Gaussian noise. The traditional denoising method cannot estimate and use the prior information of the signal, which leads to poor denoising effect. To tackle this problem, this paper proposed a denoising network based on the combination of generative adversarial network (GAN) and autoencoder (AE). First, GAN is used to estimate the distribution characteristics of the extracted noise and generate samples. Second, according to the characteristics of the guided wave, a pair of datasets are generated to train DAE network. The trained denoising AE has strong robustness. As a result, the proposed GAN-AE based denoiser (GAD) can effectively can effectively reduce the noise level and has the ability to accurately recover the peak time of the wave packet. In particular, the proposed method significantly outperforms conventional denoising methods in low signal-to-noise (SNR) conditions.
Non-technical losses (NTLs) are one of the major causes of revenue losses for electric utilities. In the literature, various machine learning (ML)/deep learning (DL) approaches are employed to detect NTLs. The existin...
详细信息
Non-technical losses (NTLs) are one of the major causes of revenue losses for electric utilities. In the literature, various machine learning (ML)/deep learning (DL) approaches are employed to detect NTLs. The existing studies are mostly concerned with tuning the hyperparameters of ML/DL methods for efficient detection of NTL, i.e., electricity theft detection. Some of them focus on the selection of prominent features from data to improve the performance of electricity theft detection. However, the curse of dimensionality affects the generalization ability of ML/DL classifiers and leads to computational, storage, and overfitting problems. Therefore, to deal with the above-mentioned issues, this study proposes a system based on metaheuristic techniques (artificial bee colony and genetic algorithm) and denoising autoencoder for electricity theft detection using big data in electric power systems. The former (metaheuristics) are used to select prominent features, while the latter is utilized to extract high variance features from electricity consumption data. Firstly, 11 new features are synthesized using statistical and electrical parameters from the user's consumption history. Then, the synthesized features are used as input to metaheuristic techniques to find a subset of optimal features. Finally, the optimal features are fed as input to the denoising autoencoder to extract features with high variance. The ability of both metaheuristic and autoencoder techniques to select and extract features is measured using a support vector machine. The proposed system reduces the overfitting, storage, and computational overhead of ML classifiers. Moreover, we perform several experiments to verify the effectiveness of our proposed system and results reveal that the proposed system has better performance than its counterparts.
Gas path fault diagnosis plays a critical role in the security guarantee and maintenance of aero-engines. In this paper, an approach based on a fusion neural network under multiple-model architecture for gas path faul...
详细信息
Gas path fault diagnosis plays a critical role in the security guarantee and maintenance of aero-engines. In this paper, an approach based on a fusion neural network under multiple-model architecture for gas path fault detection and isolation is proposed. We develop a multi-channel long short-term memory network based on a sliding window to explore temporal and spatial relationships of data and capture the residuals of sensor mea-surements between predicted and observed values. Additionally, denoising autoencoders under a multiple-model architecture are introduced so as to perform fault detection and isolation based on the comparison of recon-structed prediction errors and isolation thresholds. Several simulation results verify that the diagnostic model has excellent robustness and diagnostic ability. The proposed method is compared with other common methods, and the advantages and functions of this method are presented.
A method to develop a data-driven model able to estimate hydraulic properties based on groundwater level (GWL) fluctuation patterns is proposed. In particular, a preprocessing method using a denoising autoencoder (DAE...
详细信息
A method to develop a data-driven model able to estimate hydraulic properties based on groundwater level (GWL) fluctuation patterns is proposed. In particular, a preprocessing method using a denoising autoencoder (DAE) is incorporated into the proposed method to improve the performance of the developed method. DAE is applied to prepare the input variable of the estimation model by extracting informative low-dimensional features from the original high-dimensional GWL data. Before applying the proposed DAE to this study, the reliability of applying the DAE is validated. First, the ability to reduce the noise of GWL data is validated by observing that an average of 71% of the noise is reduced. Additionally, the performances of the extracted principal characteristics of the GWL data is confirmed by reasonable matches in the extracted features to the corresponding hydraulics of the aquifers. In this case, both synthetic data and actual data acquired over South Korea are applied. Based on the validated DAE results, models to estimate two types of hydraulic properties are constructed. The estimation performances of the models are quantitatively validated using the correlation coefficient between the estimated and actual hydraulic properties. Overall, the constructed models for k and alpha/n show an appropriate estimation accuracy with a high correlation coefficient between the actual result and estimate (0.8663 and 0.7207, respectively). Therefore, using the proposed method, the hydraulic properties of an un-informed aquifer can be effectively inferred given GWL data without conducting field experiments (e.g., pumping tests). The proposed method is promising for efficient evaluations of the physical hydraulics of un-informed aquifers and, therefore, can be used as an effective tool to manage groundwater resources.
In both the research and engineering fields, missing data is a serious problem that cannot be overlooked. Therefore, available datasets with missing data are a challenge to be modeled by conventional global prediction...
详细信息
In both the research and engineering fields, missing data is a serious problem that cannot be overlooked. Therefore, available datasets with missing data are a challenge to be modeled by conventional global prediction models. In this paper, we propose a hybrid model consisting of an autoencoder and a gated linear network for solving the regression problem under missing value scenario. A sophisticated modeling and identifying algorithm is developed. First, an extended affinity propagation (AP) clustering algorithm is applied to obtain a self-organized competitive net dividing the datasets into several clusters. Second, a multiple imputation tool with topp%winner-take-all denoising autoencoders (DAE) is introduced to realize better predictions of missing values, in which rough estimates of missing values by using the mean imputation and similarity method within the clusters are used as teacher signals of DAE. Finally, a gated linear network is designed to construct a piecewise linear regression model with interpolations in the exact same way as a support vector regression with a quasilinear kernel composed using the cluster information obtained in the AP clustering step. Based on the experiments of five datasets, our proposed method demonstrates its effectiveness and robustness compared with other traditional kernels and state-of-the-art methods, even on datasets with a large percentage of missing values. (c) 2020 Institute of Electrical Engineers of Japan. Published by Wiley Periodicals LLC.
Detection and prevention of intrusions in enterprise networks and systems is an important, but challenging problem due to extensive growth and usage of networks that are constantly facing novel attacks. An intrusion d...
详细信息
Detection and prevention of intrusions in enterprise networks and systems is an important, but challenging problem due to extensive growth and usage of networks that are constantly facing novel attacks. An intrusion detection system (IDS) monitors the network traffic and system-level applications to detect malicious activities in the network. However, most of the existing IDSs are incapable of providing higher accuracy and less false positive rate (FPR). Therefore, there is a need for adaptive techniques to detect network intrusions that maintain a balance between accuracy and FPR. In this paper, we present a context-adaptive IDS that uses multiple independent deep reinforcement learning agents distributed across the network for accurate detection and classification of new and complex attacks. We have done extensive experimentation using three benchmark datasets including NSL-KDD, UNSW-NB15 and AWID on our model that shows better accuracy and less FPR compared to the state-of-the-art systems. Further, we analysed the robustness of our model against adversarial attack and observed only a small decrease in accuracy as compared to the existing models. To further improve the robustness of the system, we implemented the concept of denoising autoencoder. Also, we have shown the usability of our system in real-life application with changes in the attack pattern.
Software defect prediction plays a very important role in software quality assurance,which aims to inspect as many potentially defect-prone software modules as ***,the performance of the prediction model is susceptibl...
详细信息
Software defect prediction plays a very important role in software quality assurance,which aims to inspect as many potentially defect-prone software modules as ***,the performance of the prediction model is susceptible to high dimensionality of the dataset that contains irrelevant and redundant *** addition,software metrics for software defect prediction are almost entirely traditional features compared to the deep semantic feature representation from deep learning *** address these two issues,we propose the following two solutions in this paper:(1)We leverage a novel non-linear manifold learning method-SOINN Landmark Isomap(SL-Isomap)to extract the representative features by selecting automatically the reasonable number and position of landmarks,which can reveal the complex intrinsic structure hidden behind the defect data.(2)We propose a novel defect prediction model named DLDD based on hybrid deep learning techniques,which leverages denoising autoencoder to learn true input features that are not contaminated by noise,and utilizes deep neural network to learn the abstract deep semantic *** combine the squared error loss function of denoising autoencoder with the cross entropy loss function of deep neural network to achieve the best prediction performance by adjusting a *** compare the SL-Isomap with seven state-of-the-art feature extraction methods and compare the DLDD model with six baseline models across 20 open source software *** experimental results verify that the superiority of SL-Isomap and DLDD on four evaluation indicators.
Background Nowadays, patients with chronic diseases such as diabetes and hypertension have reached alarming numbers worldwide. These diseases increase the risk of developing acute complications and involve a substanti...
详细信息
Background Nowadays, patients with chronic diseases such as diabetes and hypertension have reached alarming numbers worldwide. These diseases increase the risk of developing acute complications and involve a substantial economic burden and demand for health resources. The widespread adoption of Electronic Health Records (EHRs) is opening great opportunities for supporting decision-making. Nevertheless, data extracted from EHRs are complex (heterogeneous, high-dimensional and usually noisy), hampering the knowledge extraction with conventional approaches. Methods We propose the use of the denoising autoencoder (DAE), a Machine Learning (ML) technique allowing to transform high-dimensional data into latent representations (LRs), thus addressing the main challenges with clinical data. We explore in this work how the combination of LRs with a visualization method can be used to map the patient data in a two-dimensional space, gaining knowledge about the distribution of patients with different chronic conditions. Furthermore, this representation can be also used to characterize the patient's health status evolution, which is of paramount importance in the clinical setting. Results To obtain clinical LRs, we considered real-world data extracted from EHRs linked to the University Hospital of Fuenlabrada in Spain. Experimental results showed the great potential of DAEs to identify patients with clinical patterns linked to hypertension, diabetes and multimorbidity. The procedure allowed us to find patients with the same main chronic disease but different clinical characteristics. Thus, we identified two kinds of diabetic patients with differences in their drug therapy (insulin and non-insulin dependant), and also a group of women affected by hypertension and gestational diabetes. We also present a proof of concept for mapping the health status evolution of synthetic patients when considering the most significant diagnoses and drugs associated with chronic patients. Conclusio
With the development of online social media, it attracts increasingly attentions to utilize social information for recommender systems. Based on the intuition that users are influenced by their social friends, these m...
详细信息
With the development of online social media, it attracts increasingly attentions to utilize social information for recommender systems. Based on the intuition that users are influenced by their social friends, these methods are capable of addressing the data sparse problem and improving the performance of recommender systems. However, these methods model the influences between each pair of users independently and ignore the interactions among these social influences, i.e., high-level signal of social information. In this paper, we propose a deep autoencoder model to learn social representations for recommender system. This approach aims to learn low- and high- level features from social information based on muti-layers neural networks and matrix factorization technique. Especially, we develop an improved deep autoencoder model, namedSparse Stacked denoising autoencoder(SSDAE), to address the data sparse and imbalance problems for social networks. Moreover, we incorporate these deep representations and matrix factorization model into a uniform framework for recommender system. Our experiments in Epinions and Ciao datasets demonstrate that our method can significantly improve the performance of recommender system, especially for sparse users.
Existing noise removal processes for airborne electromagnetic (AEM) data generally consist of several steps, with each using a specific method to remove a specific type of noise. To improve the efficiency of AEM denoi...
详细信息
Existing noise removal processes for airborne electromagnetic (AEM) data generally consist of several steps, with each using a specific method to remove a specific type of noise. To improve the efficiency of AEM denoising and reduce the impact of the subjective judgment of the operators on the processing results, we have adopted a deep learning method based on a denoising autoencoder (DAE), which enables in one single processing step the removal of multisource noise. The most common noise sources in AEM data, including motion-induced noise, nearby or moderately distant sferics noise, power-line noise, and background electromagnetic noise, will be combined with a large number of simulation responses to build a training set. The data in the training set will be used to train the deep learning DAE neural network so that the neural network could fully learn the respective characteristics of the signal and noise and further effectively distinguish the AEM response signal (useful signal) from the above noise. The field data were processed using this method, and the processing results were compared with those obtained using traditional methods. The comparison test revealed that this method is helpful to reduce the influence of subjective factors on the quality of data results and compress the entire AEM data processing time.
暂无评论