Cloud computing is a model for enabling ubiquitous, convenient, on-demand network access to a shared pool of configurable computing resources that can be rapidly provisioned and released with minimal management effort...
详细信息
Cloud computing is a model for enabling ubiquitous, convenient, on-demand network access to a shared pool of configurable computing resources that can be rapidly provisioned and released with minimal management effort or service provider interaction. There are many proposals for resource management approaches for cloud infrastructures, but effective resource management is still a major challenge for the leading cloud infrastructure operators (e.g., Amazon, Microsoft, Google), because the details of the underlying workloads and the real-world operational demands are too complex. Among those proposals, accurate host load prediction is one of the most effective measures to address this challenge. In this paper, we proposed a new method for host load prediction, which uses an autoencoder as the pre-recurrent feature layer of the echo state networks. The aim of our proposed method is to predict the host load in the future interval based on Google cluster usage dataset. Experiments performed on Google load traces show that our proposed method achieves higher accuracy than the state-of-the-art methods.
According to the smart manufacturing paradigm, the analysis of assets' time series with a machine learning approach can effectively prevent unplanned production downtimes by detecting assets' anomalous operati...
详细信息
According to the smart manufacturing paradigm, the analysis of assets' time series with a machine learning approach can effectively prevent unplanned production downtimes by detecting assets' anomalous operational conditions. To support smart manufacturing operators with no data science background, we propose an anomaly detection approach based on deep learning and aimed at providing a manageable machine learning pipeline and easy to interpret outcome. To do so we combine (i) an autoencoder, a deep neural network able to produce an anomaly score for each provided time series, and (ii) a discriminator based on a general heuristics, to automatically discern anomalies from regular instances. We prove the convenience of the proposed approach by comparing its performances against isolation forest with different case studies addressing industrial laundry assets' power consumption and bearing vibrations. (C) 2020 Elsevier B.V. All rights reserved.
The sensor-based human activity recognition (HAR) using machine learning requires a sufficiently large amount of annotated data to realize an accurate classification model. This requirement stimulates the advancement ...
详细信息
The sensor-based human activity recognition (HAR) using machine learning requires a sufficiently large amount of annotated data to realize an accurate classification model. This requirement stimulates the advancement of the transfer learning research area that minimizes the use of labeled data by transferring knowledge from the existing activity recognition domain. Existing approaches transform the data into a common subspace between domains which theoretically loses information, to begin with. Besides, they are based on the linear projection which is bound to linearity assumption and its limitations. Some recent works have already incorporated nonlinearity to find a latent representation that minimizes domain discrepancy based on an autoencoder that includes statistical distance minimization. However, such approach discovers latent representation for both domains at once, which causes sub-optimal representation because both domains compensate each other's reconstruction error during the training. We propose an autoencoder-based approach on domain adaptation for sensor-based HAR. The proposed approach learns a latent representation which minimizes the discrepancy between domains by reducing statistical distance. Instead of learning representation of both domains simultaneously, our method is a two-phase approach which first learns the representation for the domain of interest independently to ensure its optimality. Subsequently, the useful information from the existing domain is transferred. We test our approach on the publicly available sensor-based HAR datasets, using cross-domain setup. The experimental result shows that our approach significantly outperforms the existing ones.
Due to its high accuracy and ease of calculation,synchrophasor-based linear state estimation(LSE)has attracted a lot of attention in the last decade and has formed the cornerstone of many wide area monitor system(WAMS...
详细信息
Due to its high accuracy and ease of calculation,synchrophasor-based linear state estimation(LSE)has attracted a lot of attention in the last decade and has formed the cornerstone of many wide area monitor system(WAMS)***,an increasing number of data quality concerns have been reported,among which bad data can significantly undermine the performance of LSE and many other WAMS applications it *** data filtering can be difficult in practice due to a variety of issues such as limited processing time,non-uniform and changing patterns,and *** pre-process phasor measurement unit(PMU)measurements for LSE,we propose an improved denoising autoencoder(DA)-aided bad data filtering strategy in this *** data is first identified by the classifier module of the proposed DA and then recovered by the autoencoder *** characteristics distinguish the proposed methodology:1)The approach is lightweight and can be implemented at individual PMU level to achieve maximum parallelism and high efficiency,making it suited for real-time processing;2)the system not only identifies bad data but also recovers it,especially for critical *** use numerical experiments employing both simulated and real-world phasor data to validate and illustrate the effectiveness of the proposed method.
The imbalanced data classification is a challenging issue in many domains including medical intelligent diagnosis and fraudulent transaction analysis. The performance of the conventional classifier degrades due to the...
详细信息
The imbalanced data classification is a challenging issue in many domains including medical intelligent diagnosis and fraudulent transaction analysis. The performance of the conventional classifier degrades due to the imbalanced class distribution of the training data set. Recently, machine learning and deep learning techniques are used for imbalanced data classification. Data preprocessing approaches are also suitable for handling class imbalance problem. Data augmentation is one of the preprocessing techniques used to handle skewed class distribution. Synthetic Minority Oversampling Technique (SMOTE) is a promising class balancing approach and it generates noise during the process of creation of synthetic samples. In this paper, autoencoder is used as a noise reduction technique and it reduces the noise generated by SMOTE. Further, Deep one-dimensional Convolutional Neural Network is used for classification. The performance of the proposed method is evaluated and compared with existing approaches using different metrics such as Precision, Recall, Accuracy, Area Under the Curve and Geometric Mean. Ten data sets with imbalance ratio ranging from 1.17 to 577.87 and data set size ranging from 303 to 284807 instances are used in the experiments. The different imbalanced data sets used are Heart-Disease, Mammography, Pima Indian diabetes, Adult, Oil-Spill, Phoneme, Creditcard, BankNoteAuthentication, Balance scale weight & distance database and Yeast data sets. The proposed method shows an accuracy of 96.1%, 96.5%, 87.7%, 87.3%, 95%, 92.4%, 98.4%, 86.1%, 94% and 95.9% respectively. The results suggest that this method outperforms other deep learning methods and machine learning methods with respect to G-mean and other performance metrics.
Unmanned surface vehicle(USV)is currently a hot research topic in maritime communication network(MCN),where denoising and semantic segmentation of maritime images taken by USV have been rarely *** former has recently ...
详细信息
Unmanned surface vehicle(USV)is currently a hot research topic in maritime communication network(MCN),where denoising and semantic segmentation of maritime images taken by USV have been rarely *** former has recently researched on autoencoder model used for image denoising,but the existed models are too complicated to be suitable for real-time detection of *** this paper,we proposed a lightweight autoencoder combined with inception module for maritime image denoising in different noisy environments and explore the effect of different inception modules on the denoising ***,we completed the semantic segmentation task for maritime images taken by USV utilizing the pretrained U-Net model with tuning,and compared them with original U-Net model based on different ***,we compared the semantic segmentation of noised and denoised maritime images respectively to explore the effect of image noise on semantic segmentation *** studies are provided to prove the feasibility of our proposed denoising and segmentation ***,a simple integrated communication system combining image denoising and segmentation for USV is shown.
Abnormal detection plays an important role in video surveillance. LSTM encoder-decoder is used to learn representation of video sequences and applied for detecting abnormal event in complex environment. The learned re...
详细信息
Abnormal detection plays an important role in video surveillance. LSTM encoder-decoder is used to learn representation of video sequences and applied for detecting abnormal event in complex environment. The learned representation of LSTM encoder-decoder is learned from encoder, and it is crucial for decoder. However, LSTM encoder-decoder generally fails to account for the global context of the learned representation with a fixed dimension representation. In this paper, we explore a hybrid autoencoder architecture, which not only extracts better spatio-temporal context, but also improves the extrapolate capability of the corresponding decoder by the shortcut connection. The experiment shows that the hybrid model performs better than the state-of-the-art anomaly detection methods in both qualitative and quantitative ways on benchmark datasets.
The domain adaptation uses labeled source domain data to train a classifier to be used in the target domain with no or small amount of labeled data. Usually there exists discrepancy in terms of marginal and conditiona...
详细信息
The domain adaptation uses labeled source domain data to train a classifier to be used in the target domain with no or small amount of labeled data. Usually there exists discrepancy in terms of marginal and conditional distributions for both source and target domains,which is of critical importance to minimize the distribution discrepancy between domains. As a classical model in deep learning, the autoencoder is capable of realizing distribution matching and enhancing classification accuracy by extracting more abstract and effective features from data. A Domain adaptation network based on autoencoder(DANA) is proposed. The DANA structure consists of a couple of encoding layers: a feature extraction layer and a classification layer. For the feature extraction layer,the marginal distributions of source and target domains are matched by using the nonparametric maximum mean discrepancy measurement. For the classification layer, the softmax regression model is applied to encode the label information of source domains meanwhile to match the conditional distribution. Experimental results on ImageNet,Corel and Leaves datasets have shown the enhanced classification accuracy by our proposed algorithm compared with the classical methods.
Electricity theft is considered one of the most significant reasons of the non technical losses (NTL). It negatively influences the utilities in terms of the power supply quality, grid's safety, and economic loss....
详细信息
Electricity theft is considered one of the most significant reasons of the non technical losses (NTL). It negatively influences the utilities in terms of the power supply quality, grid's safety, and economic loss. Therefore, it is necessary to effectively deal with the electricity theft problem. For detecting electricity theft in smart grids (SGs), an efficient and state-of-the-art approach is designed in the underlying work based on autoencoder and bidirectional gated recurrent unit (AE-BiGRU). The proposed approach consists of six components: (1) data collection, (2) data preparation, (3) data balancing, (4) feature extraction, (5) classification and (6) performance evaluation. Moreover, bidirectional gated recurrent unit (BiGRU) is used for the identification of the anomalies in electricity consumption (EC) patterns caused due to factors like family formation changes, holidays, parties, and so on, which are referred as non-theft factors. The proposed autoencoder-bidirectional gated recurrent unit (AE-BiGRU) model employs the EC data acquired from state grid corporation of China (SGCC) for simulations. Furthermore, it is visualized from the simulation results that 90.1% accuracy and 10.2% false positive rate (FPR) are obtained by the proposed model. The results are better than different existing classifiers, i.e., logistic regression (LR), decision tree (DT), extreme gradient boosting (XGBoost), gated recurrent unit (GRU), etc.
To enhance the accuracy of identifying water sources in mine inrush incidents, this study, taking the Shengquan coal mine in Shandong, China, as a case study, proposes a novel water source identification model based o...
详细信息
To enhance the accuracy of identifying water sources in mine inrush incidents, this study, taking the Shengquan coal mine in Shandong, China, as a case study, proposes a novel water source identification model based on an improved autoencoder-the "Masked autoencoder-based Classifier" model. This model, through a unique autoencoder framework and a custom 'masked_loss' loss function, achieves semi-supervised learning and dimensionality reduction of groundwater sample ionic data. By configuring the hidden layers, the classifier component of the model directly receives data processed by the encoder component. This not only improves the model's performance but also optimizes its complexity. Through an evaluation of the model's fitting effectiveness, our model achieved an average accuracy of 88.8% across 20 runs, with precision, recall, F1-score, and MCC reaching 88.1%, 80.6%, 0.827, and 0.833, respectively, significantly outperforming other classic models. The model successfully identified the sources of three sets of inrush water samples, with a high number of successful runs and clear average probabilities. This work contributes not only to the field of mine water inrush source identification but also offers a new perspective for the broader field of machine learning.
暂无评论