检索结果-内蒙古大学图书馆

A deep variational matrix factorization method for recommendation on large scale sparse dataset

NEUROCOMPUTING 2019年 334卷 206-218页

作者： Zhang, Weina Zhang, Xingming Wang, Haoxiang Chen, Dongpei South China Univ Technol Higher Educ Mega Ctr Sch Comp Sci & Engn Guangzhou 510006 Guangdong Peoples R China

Traditional recommendation methods based on matrix factorization techniques have yielded immense success because of their good scalability. However, they still face the problem of data sparsity, which may lead to a reduction in recommendation performance. As it is hard to learn good latent features in the sparse user-item rating matrix. In recent years, deep learning is very appealing in learning effective representations. Its non-linear characteristics just remedy the shortcomings of matrix factorization. In this paper, a novel method deep variational matrix factorization recommendation (DVMF) is proposed for large scale sparse dataset. DVMF is based on latent factors to predict the ratings. The latent features of the users and items are respectively obtained through a deep nonlinear structure. Based on the latent factors and combined with matrix factorization method, the paper presents algorithm optimization method of DVMF. The experiments on three real-world datasets from different domains show that DVMF is able to provide higher accuracy than recommendation algorithms based on matrix factorization or deep learning individually on large scale sparse dataset. (C) 2019 Elsevier B.V. All rights reserved.

关键词： Recommendation system Deep matrix factorization variational autoencoder Matrix factorization

来源：评论

学校读者我要写书评

暂无评论

Latent Variable Based Anomaly Detection in Network System Logs

引用

IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS 2019年第9期E102D卷 1644-1652页

作者： Otomo, Kazuki Kobayashi, Satoru Fukuda, Kensuke Esaki, Hiroshi Univ Tokyo Grad Sch Informat Sci & Technol Tokyo 1138654 Japan Natl Inst Informat Tokyo 1018430 Japan Sokendai Dept Informat Tokyo 1018430 Japan

System logs are useful to understand the status of and detect faults in large scale networks. However, due to their diversity and volume of these logs, log analysis requires much time and effort. In this paper, we propose a log event anomaly detection method for large-scale networks without pre-processing and feature extraction. The key idea is to embed a large amount of diverse data into hidden states by using latent variables. We evaluate our method with 12 months of system logs obtained from a nation-wide academic network in Japan. Through comparisons with Kleinberg's univariate burst detection and a traditional multivariate analysis (i.e., PCA), we demonstrate that our proposed method achieves 14.5% higher recall and 3% higher precision than PCA. A case study shows detected anomalies are effective information for troubleshooting of network system faults.

关键词： network operation system logs syslog anomaly detection latent variable analysis variational autoencoder

来源：评论

学校读者我要写书评

暂无评论

Semi-Supervised EEG Signals Classification System for Epileptic Seizure Detection

引用

IEEE SIGNAL PROCESSING LETTERS 2019年第12期26卷 1922-1926页

作者： Abdelhameed, Ahmed M. Bayoumi, Magdy Univ Louisiana Lafayette Ctr Adv Comp Studies Lafayette LA 70503 USA Univ Louisiana Lafayette Dept Elect & Comp Engn Lafayette LA 70503 USA

In the past few decades, measuring and recording the brain electrical activities using Electroencephalogram (EEG) has become a standout amongst the tools utilized for neurological disorders' diagnosis, especially seizure detection. In this letter, a novel epileptic seizure detection system based on classifying raw EEG signals' recordings, eliminating the overhead of engineered feature extraction, is proposed. The system employs a mixing of unsupervised and supervised deep learning utilizing a one-dimensional convolutional variational autoencoder. To ascertain the robustness of the system against classifying unseen data, the evaluation of the proposed system is done using k-fold cross-validation. The classification results between normal and ictal cases have achieved a 100 accuracy while the classification results between the normal, inter-ictal and ictal cases accomplished a 99 overall accuracy which makes our system one of the most efficient among other state-of-the-art systems.

关键词： Classification cross-validation deep learning epileptic seizure detection feature extraction variational autoencoder

来源：评论

学校读者我要写书评

暂无评论

Exploring DNA Methylation Data of Lung Cancer Samples with variational autoencoders

Exploring DNA Methylation Data of Lung Cancer Samples with V...

引用

IEEE International Conference on Bioinformatics and Biomedicine (BIBM) - Human Genomics

作者： Wang, Zhenxing Wang, Yadong Harbin Inst Technol Sch Comp Sci & Technol Harbin Heilongjiang Peoples R China

ISBN: (纸本)9781538654880

Lung cancer causes over one million deaths each year worldwide. DNA methylation is a well-defined epigenetics factor in genome data analyses for model training. In this article, we explore the applications of unsupervised deep learning method, variational autoencoders, using DNA methylation data of lung cancer samples downloaded from the GDC TCGA project and perform further work with latent features. We show the logistic regression classifier on the encoded latent features accurately classifies cancer subtypes.

关键词： DNA methylation lung cancer variational autoencoder

来源：评论

学校读者我要写书评

暂无评论

Neural variational collaborative filtering with side information for top-K recommendation

引用

INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS 2019年第11期10卷 3273-3284页

作者： Deng, Xiaoyi Zhuang, Fuzhen Zhu, Zhiguo Huaqiao Univ Sch Business Quanzhou 362021 Fujian Peoples R China Huaqiao Univ Res Ctr Appl Stat & Big Data Xiamen 361021 Fujian Peoples R China Chinese Acad Sci Inst Comp Technol Key Lab Intelligent Informat Proc Beijing 100190 Peoples R China Zhengzhou Univ Sch Informat Engn Zhengzhou 450001 Henan Peoples R China Zhengzhou Univ Res Ctr Digital Med Image Tech Zhengzhou 450001 Henan Peoples R China Dongbei Univ Finance & Econ Sch Management Sci & Engn Dalian 116025 Peoples R China

Collaborative filtering (CF) is one of the most widely applied models for recommender systems. Despite its success, CF-based methods suffer from rating sparsity and cold-start problem, which leads to poor quality of recommendations. Previous studies have gave great attention to construct hybrid methods, by incorporating side information and user rating. variational autoencoder (VAE) has been confirmed to be highly effective in CF task, due to its Bayesian nature and non-linearity. However, rating sparsity remains a great challenge to most VAE models, which leads to poor latent user/item representations. In addition, most existing VAE-based methods model either latent user factors or latent item factors, resulting in the incapacity to recommend items to a new user or suggest a new item to existing users. To address these problems, we design a novel deep hybrid framework for top-k recommendation, neural variational collaborative filtering (NVCF), and propose three NVCF-based instantiation. In generative process, the side information of user and item is incorporated to alleviate rating sparsity, for learning better latent user/item representations. In inference process, a Stochastic Gradient variational Bayes approach is employed to approximate the unmanageable distributions of latent user/item factors. Experiments performed on four public datasets have indicated our methods significantly outperform the state-of-the-art hybrid CF models and VAE-based methods.

关键词： Neural collaborative filtering variational autoencoder Top-K recommendation Side information Implicit feedback

来源：评论

学校读者我要写书评

暂无评论

A Learning-Based Method for Solving III-Posed Nonlinear Inverse Problems: A Simulation Study of Lung EIT

引用

SIAM JOURNAL ON IMAGING SCIENCES 2019年第3期12卷 1275-1295页

作者： Seo, Jin Keun Kim, Kang Cheol Jargal, Ariungerel Lee, Kyounghun Harrach, Bastian Yonsei Univ Dept Computat Sci & Engn Seoul 120749 South Korea Goethe Univ Frankfurt Dept Math D-60325 Frankfurt Germany

This paper proposes a new approach for solving ill-posed nonlinear inverse problems. For ease of explanation of the proposed approach, we use the example of lung electrical impedance tomography (EIT), which is known to be a nonlinear and ill-posed inverse problem. Conventionally, penalty-based regularization methods have been used to deal with the ill-posed problem. However, experiences over the last three decades have shown methodological limitations in utilizing prior knowledge about tracking expected imaging features for medical diagnosis. The proposed method's paradigm is completely different from conventional approaches;the proposed reconstruction uses a variety of training data sets to generate a low dimensional manifold of approximate solutions, which allows conversion of the ill-posed problem to a well-posed one. variational autoencoder was used to produce a compact and dense representation for lung EIT images with a low dimensional latent space. Then, we learn a robust connection between the EIT data and the low dimensional latent data. Numerical simulations validate the effectiveness and feasibility of the proposed approach.

关键词： electrical impedance tomography deep learning variational autoencoder inverse problems

来源：评论

学校读者我要写书评

暂无评论

Extracting a biologically relevant latent space from cancer transcriptomes with variational autoencoders 23rd

Extracting a biologically relevant latent space from cancer ...

引用

23rd Pacific Symposium on Biocomputing (PSB)

作者： Way, Gregory P. Greene, Casey S. Univ Penn Genom & Computat Biol Grad Program Philadelphia PA 19104 USA Univ Penn Dept Syst Pharmacol & Translat Therapeut Philadelphia PA 19104 USA

ISBN: (纸本)9789813235533;9789813235526

The Cancer Genome Atlas (TCGA) has profiled over 10,000 tumors across 33 different cancer-types for many genomic features, including gene expression levels. Gene expression measurements capture substantial information about the state of each tumor. Certain classes of deep neural network models are capable of learning a meaningful latent space. Such a latent space could be used to explore and generate hypothetical gene expression profiles under various types of molecular and genetic perturbation. For example, one might wish to use such a model to predict a tumor's response to specific therapies or to characterize complex gene expression activations existing in differential proportions in different tumors. variational autoencoders (VAEs) are a deep neural network approach capable of generating meaningful latent spaces for image and text data. In this work, we sought to determine the extent to which a VAE can be trained to model cancer gene expression, and whether or not such a VAE would capture biologically-relevant features. In the following report, we introduce a VAE trained on TCGA pan-cancer RNA-seq data, identify specific patterns in the VAE encoded features, and discuss potential merits of the approach. We name our method "Tybalt" after an instigative, cat-like character who sets a cascading chain of events in motion in Shakespeare's "Romeo and Juliet". From a systems biology perspective, Tybalt could one day aid in cancer stratification or predict specific activated expression patterns that would result from genetic changes or treatment effects.

关键词： Deep Learning Gene Expression variational autoencoder The Cancer Genome Atlas

来源：评论

学校读者我要写书评

暂无评论

Empirical Evaluation of variational autoencoders for Data Augmentation 13

Empirical Evaluation of Variational Autoencoders for Data Au...

引用

13th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP) / International Conference on Computer Vision Theory and Applications (VISAPP)

作者： Jorge, Javier Vieco, Jesus Paredes, Roberto Andreu Sanchez, Joan Miguel Benedi, Jose Univ Politecn Valencia Dept Sistemas Informat & Computat Valencia Spain

ISBN: (纸本)9789897583063

Since the beginning of Neural Networks, different mechanisms have been required to provide a sufficient number of examples to avoid overfitting. Data augmentation, the most common one, is focused on the generation of new instances performing different distortions in the real samples. Usually, these transformations are problem-dependent, and they result in a synthetic set of, likely, unseen examples. In this work, we have studied a generative model, based on the paradigm of encoder-decoder, that works directly in the data space, that is, with images. This model encodes the input in a latent space where different transformations will be applied. After completing this, we can reconstruct the latent vectors to get new samples. We have analysed various procedures according to the distortions that we could carry out, as well as the effectiveness of this process to improve the accuracy of different classification systems. To do this, we could use both the latent space and the original space after reconstructing the altered version of these vectors. Our results have shown that using this pipeline (encoding-altering-decoding) helps the generalisation of the classifiers that have been selected.

关键词： Generative Models Data Augmentation variational autoencoder

来源：评论

学校读者我要写书评

暂无评论

Expanding variational autoencoders for learning and exploiting latent representations in search distributions 18

Expanding variational autoencoders for learning and exploiti...

引用

Genetic and Evolutionary Computation Conference (GECCO)

作者： Garciarena, Unai Santana, Roberto Mendiburu, Alexander Univ Basque Country Intelligent Syst Grp Donostia San Sebastian Spain

ISBN: (纸本)9781450356183

In the past, evolutionary algorithms (EAs) that use probabilistic modeling of the best solutions incorporated latent or hidden variables to the models as a more accurate way to represent the search distributions. Recently, a number of neural-network models that compute approximations of posterior (latent variable) distributions have been introduced. In this paper, we investigate the use of the variational autoencoder (VAE), a class of neural-network based generative model, for modeling and sampling search distributions as part of an estimation of distribution algorithm. We show that VAE can capture dependencies between decision variables and objectives. This feature is proven to improve the sampling capacity of model based EAs. Furthermore, we extend the original VAE model by adding a new, fitness-approximating network component. We show that it is possible to adapt the architecture of these models and we present evidence of how to extend VAEs to better fulfill the requirements of probabilistic modeling in EAs. While our results are not yet competitive with state of the art probabilistic-based optimizers, they represent a promising direction for the application of generative models within EDAs.

关键词： machine learning variational autoencoder estimation of distribution algorithm neural network generative modeling

来源：评论

学校读者我要写书评

暂无评论

Semi-Supervised Multichannel Speech Enhancement With a Deep Speech Prior

引用

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING 2019年第12期27卷 2197-2212页

作者： Sekiguchi, Kouhei Bando, Yoshiaki Nugraha, Aditya Arie Yoshii, Kazuyoshi Kawahara, Tatsuya Kyoto Univ Grad Sch Informat Kyoto 6068501 Japan RIKEN Ctr Adv Intelligence Project AIP Tokyo 1030027 Japan Natl Inst Adv Ind Sci & Technol Tokyo 1350064 Japan

This paper describes a semi-supervised multichannel speech enhancement method that uses clean speech data for prior training. Although multichannel nonnegative matrix factorization (MNMF) and its constrained variant called independent low-rank matrix analysis (ILRMA) have successfully been used for unsupervised speech enhancement, the low-rank assumption on the power spectral densities (PSDs) of all sources (speech and noise) does not hold in reality. To solve this problem, we replace a low-rank speech model with a deep generative speech model, i.e., formulate a probabilistic model of noisy speech by integrating a deep speech model, a low-rank noise model, and a full-rank or rank-1 model of spatial characteristics of speech and noise. The deep speech model is trained from clean speech data in an unsupervised auto-encoding variational Bayesian manner. Given multichannel noisy speech spectra, the full-rank or rank-1 spatial covariance matrices and PSDs of speech and noise are estimated in an unsupervised maximum-likelihood manner. Experimental results showed that the full-rank version of the proposed method was significantly better than MNMF, ILRMA, and the rank-1 version. We confirmed that the initialization-sensitivity and local-optimum problems of MNMF with many spatial parameters can be solved by incorporating the precise speech model.

关键词： Speech enhancement Noise measurement Data models Probabilistic logic Maximum likelihood estimation Time-frequency analysis Multichannel speech enhancement deep learning variational autoencoder nonnegative matrix factorization

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：