Long non-coding RNA (lncRNA) are shown to be closely associated with cancer metastatic events (CME, e.g., cancer cell invasion, intravasation, extravasation, proliferation) that collaboratively accelerate malignant ca...
详细信息
Long non-coding RNA (lncRNA) are shown to be closely associated with cancer metastatic events (CME, e.g., cancer cell invasion, intravasation, extravasation, proliferation) that collaboratively accelerate malignant cancer spread and cause high mortality rate in patients. Clinical trials may accurately uncover the relationships between lncRNAs and CMEs;however, it is time-consuming and expensive. With the accumulation of data, there is an urgent need to find efficient ways to identify these relationships. Herein, a graph embedding representation-based predictor (VGEA-LCME) for exploring latent lncRNA-CME associations is introduced. In VGEA-LCME, a heterogeneous combined network is constructed by integrating similarity and linkage matrix that can maintain internal and external characteristics of networks, and a variational graph auto-encoder serves as a feature generator to represent arbitrary lncRNA and CME pair. The final robustness predicted result is obtained by ensemble classifier strategy via cross-validation. Experimental comparisons and literature verification show better remarkable performance of VGEA-LCME, although the similarities between CMEs are challenging to calculate. In addition, VGEA-LCME can further identify organ-specific CMEs. To the best of our knowledge, this is the first computational attempt to discover the potential relationships between lncRNAs and CMEs. It may provide support and new insight for guiding experimental research of metastatic cancers. The source code and data are available at https://github .com /zhuyuan -cug /VGAE-LCME.
Massive light field (LF) data bring tremendous storage and transmission challenges, making the LF compression scheme highly demanded. This paper proposes a novel LF compression method via a variationalgraphauto-enco...
详细信息
ISBN:
(纸本)9781665466110
Massive light field (LF) data bring tremendous storage and transmission challenges, making the LF compression scheme highly demanded. This paper proposes a novel LF compression method via a variational graph auto-encoder (VGAE), aiming to exploit better the structural information of edges and vertices of the graph LF image. More specifically, the graph adjacency matrix and feature matrix are derived from the original graph data in the encoder. Subsequently, a graph convolutional network (GCN) is utilized to determine a multi-dimensional Gaussian distribution, from which the latent representation can be derived by sampling. Finally, the graph LF image can be reconstructed by the inner product of the latent variable in the decoder. The distinct characteristics of the proposed scheme lie in that VGAE encoder applies GCN as a function, which can better alleviate the loss of compression. Moreover, the divergence between the original and the reconstructed signals is evaluated using KL divergence to ensure that the estimator is unbiased, leading to better adaptability. The experimental results demonstrate that the proposed method achieves better performance than the state-of-the-art methods.
Recently, graph neural networks(GNNs) has achieved tremendous success in a variety of fields. Many approaches have been proposed to address data with graph structure. However, many of these are deterministic methods, ...
详细信息
Recently, graph neural networks(GNNs) has achieved tremendous success in a variety of fields. Many approaches have been proposed to address data with graph structure. However, many of these are deterministic methods, therefore, they are unable to capture the uncertainty, which is inherent in the nature of graph data. Various VAE(variationalauto-encoder)-based approaches have been proposed to tackle such problems. Unfortunately, due to the simple a posterior and a prior assumption problems of such methods, they are not well suited to handle uncertainty in graph data. For example, VGAE(variational graph auto-encoder) assumes that the posterior and prior distributions are simple Gaussian distributions, which can lead to overfitting problems when incompatible with the true distributions. Many methods propose to solve the posterior distribution problem, but most ignore the effect of the prior distribution. Therefore, in this paper, we proposed a novel method to solve the Gaussian prior problem. Specifically, in order to enhance the representation power of the prior distribution, we use the diffusion model to model the prior distribution. We incorporate the diffusion model into VGAE. In the forward diffusion process, noise is gradually added to the latent variables, and then the samples are recovered by the backward diffusion process. To realize the backward diffusion process, we propose a new denoising model which predicts noise by stacking GCN(graph Convolution Network) and MLP(Multi-layers Perceptron). We perform experiments on different datasets and the experimental results demonstrate that our method obtains state-of-the-art results.
The traditional graph representation methods can fit the information of graph with low-dimensional vectors, but they cannot interpret their composition, resulting in insufficient security. graph decoupling, as a metho...
详细信息
The traditional graph representation methods can fit the information of graph with low-dimensional vectors, but they cannot interpret their composition, resulting in insufficient security. graph decoupling, as a method of graph representation, can analyze the latent factors composing the graph representation vectors. However, in current graph decoupling methods, the number of factors is a hyperparameter, and enforce uniform decoupling vector dimensions which leads to information loss or redundancy. To address these issues, we propose a type-aware graph decoupling based on influence called variationalgraph Decoupling auto-encoder (VGDAE). It uses node labels as interpretable and objectively existing natural semantics for decoupling and allocates embedding space based on node influence, addressing the issues of manually setting the number of factors in traditional graph decoupling and the mismatch between node information size and embedding space. On the Cora, Citeseer, and fb-CMU datasets, VGDAE shows the impact of different node classes as decoupling targets on classification tasks. Furthermore, we perform visualization of the representations, VGDAE exhibits performance improvements of 2% in classification tasks and 12% in clustering tasks when compared with baseline models.
Although graph representation learning has been studied extensively in static graph settings, dynamic graphs are less investigated in this context. This paper proposes a novel integrated variational framework called D...
详细信息
Although graph representation learning has been studied extensively in static graph settings, dynamic graphs are less investigated in this context. This paper proposes a novel integrated variational framework called DYnamic mixture variationalgraph Recurrent Neural Networks (DyVGRNN), which consists of extra latent random variables in structural and temporal modelling. Our proposed framework comprises an integration of variational graph auto-encoder (VGAE) and graph Recurrent Neural Network (GRNN) by exploiting a novel attention mechanism. The Gaussian Mixture Model (GMM) and the VGAE framework are combined in DyVGRNN to model the multimodal nature of data, which enhances performance. To consider the significance of time steps, our proposed method incorporates an attention-based module. The experimental results demonstrate that our method greatly outperforms state-of-the-art dynamic graph representation learning methods in terms of link prediction and clustering.1 & COPY;2023 Elsevier Ltd. All rights reserved.
Long noncoding RNA (lncRNA) is a kind of noncoding RNA with a length of more than 200 nucleotide units. Numerous research studies have proven that although lncRNAs cannot be directly translated into proteins, lncRNAs ...
详细信息
Long noncoding RNA (lncRNA) is a kind of noncoding RNA with a length of more than 200 nucleotide units. Numerous research studies have proven that although lncRNAs cannot be directly translated into proteins, lncRNAs still play an important role in human growth processes by interacting with proteins. Since traditional biological experiments often require a lot of time and material costs to explore potential lncRNA-protein interactions (LPI), several computational models have been proposed for this task. In this study, we introduce a novel deep learning method known as combined graphauto -encoders (LPICGAE) to predict potential human LPIs. First, we apply a variationalgraphauto -encoder to learn the low dimensional representations from the high -dimensional features of lncRNAs and proteins. Then the graphauto -encoder is used to reconstruct the adjacency matrix for inferring potential interactions between lncRNAs and proteins. Finally, we minimize the loss of the two processes alternately to gain the final predicted interaction matrix. The result in 5 -fold cross -validation experiments illustrates that our method achieves an average area under receiver operating characteristic curve of 0.974 and an average accuracy of 0.985, which is better than those of existing six state-of-the-art computational methods. We believe that LPICGAE can help researchers to gain more potential relationships between lncRNAs and proteins effectively.
graph neural networks (GNNs) achieve remarkable performances for the tasks on graph data. However, recent studies uncover that they are extremely vulnerable to adversarial structural perturbations, leading to their ou...
详细信息
ISBN:
(纸本)9789819755905;9789819755912
graph neural networks (GNNs) achieve remarkable performances for the tasks on graph data. However, recent studies uncover that they are extremely vulnerable to adversarial structural perturbations, leading to their outcomes unreliable. In this paper, we propose DefenseVGAE, a novel defense method for lever-aging variationalgraphautoencoders (VGAEs) to defend GNNs against such attacks. Specifically, DefenseVGAE is trained to reconstruct the graph structure of the graph data in which the reconstructed adjacency matrix is sparse and can reduce the effects of adversarial perturbations and boost the performance of GCN when facing the adversarial attacks. Our experiments on three typical datasets demonstrate that DefenseVGAE is effective under various threat models and even outperforms the existing defense strategies in certain settings.
graph clustering, a basic problem in machine learning and artificial intelligence, facilitates a variety of real-world applications. How to perform a task of graph clustering, with a relatively high-quality optimizati...
详细信息
graph clustering, a basic problem in machine learning and artificial intelligence, facilitates a variety of real-world applications. How to perform a task of graph clustering, with a relatively high-quality optimization decision and an effective yet efficient way to use graph information, to obtain a more excellent assignment for discrete points is not an ordinary challenge that troubles scholars. Often, many preeminent works on graph clustering neglect an essential element that the defined clustering loss may destroy the feature space. This is also a vital factor that leads to unrepresentative nonsense features that generate poor partitioning decisions. Here, we propose an end-to-end variationalgraph clustering (EVGC) algorithm focusing on preserving the original information of the graph. Specifically, the KL loss with an auxiliary distribution serves as a specific guide to manipulate the embedding space, and consequently disperse data points. A graphauto-encoder plays a propulsive role in maximumly retaining the local structure of the generative distribution of the graph. And each node is represented as a Gaussian distribution in dealing with separating the true embedding position and uncertainty from the graph. Experimental results reveal the importance of preserving local structure, and our EVGC system outperforms state-of-the-art approaches.
LncRNAs are intermediate molecules that participate in the most diverse biological processes in humans, such as gene expression control and X-chromosome inactivation. Numerous researches have associated lncRNAs with a...
详细信息
LncRNAs are intermediate molecules that participate in the most diverse biological processes in humans, such as gene expression control and X-chromosome inactivation. Numerous researches have associated lncRNAs with a wide range of diseases, such as breast cancer, leukemia, and many other conditions. In this work, we propose a graph-based method named PANDA. This method treats the prediction of new associations between lncRNAs and diseases as a link prediction problem in a graph. We start by building a heterogeneous graph that contains the known associations between lncRNAs and diseases and additional information such as gene expression levels and symptoms of diseases. We then use a graphauto-encoder to learn the representation of the nodes' features and edges, finally applying a Neural Network to predict potentially interesting novel edges. The experimental results indicate that PANDA achieved a 0.976 AUC-ROC, surpassing state-of-the-art methods for the same problem, showing that PANDA could be a promising approach to generate embeddings to predict potentially novel lncRNA-disease associations.
graph clustering based on embedding aims to divide nodes with higher similarity into several mutually disjoint groups, but it is not a trivial task to maximumly embed the graph structure and node attributes into the l...
详细信息
graph clustering based on embedding aims to divide nodes with higher similarity into several mutually disjoint groups, but it is not a trivial task to maximumly embed the graph structure and node attributes into the low dimensional feature space. Furthermore, most of the current advanced methods of graph nodes clustering adopt the strategy of separating graph embedding technology and clustering algorithm, and ignore the potential relationship between them. Therefore, we propose an innovative end to-end graph clustering framework with joint strategy to handle the complex problem in a non-Euclidean space. In terms of learning the graph embedding, we propose a new variational graph auto-encoder algorithm based on the graph Convolution Network (GCN), which takes into account the boosting influence of joint generative model of graph structure and node attributes on the embedding output. On the basis of embedding representation, we implement a self-training mechanism through the construction of auxiliary distribution to further enhance the prediction of node categories, thereby realizing the unsupervised clustering mode. In addition, the loss contribution of each cluster is normalized to prevent large clusters from distorting the embedding space. Extensive experiments on real-world graph datasets validate our design and demonstrate that our algorithm has highly competitive in graph clustering over state-of-theart methods. (c) 2021 Elsevier Ltd. All rights reserved.
暂无评论