检索结果-内蒙古大学图书馆

19th IEEE International Symposium on Biomedical Imaging (IEEE ISBI)

作者： Xu, Weijin Yang, Huihua Zhang, Mingying Pan, Xipeng Liu, Wentao Yan, Songlin Beijing Univ Posts & Telecommun Beijing Peoples R China China Elect Standardizat Inst Beijing Peoples R China Guilin Univ Elect Technol Guilin Peoples R China

ISBN: (纸本)9781665429238

The clinical diagnosis of eye disorders including diabetic retinopathy relies heavily on retinal vessel segmentation. CNN-based methods are the preferred approaches for retina vessel segmentation in recent years, but they are data hungry and prone to overfitting on the training set and achieving suboptimal results on the validation set or the test set. Taking this into consideration, we propose to integrate a variational autoencoder reconstruction branch to pose extra regularization on the shared encoder and increase the generalization ability of networks. Furthermore, to deal with the unbalanced vessel scale distribution, a multi-scale context extractor is carefully designed, which employed the regular convolution and dilated convolution to extract multi-scale context and utilized different fusion method to obtain better complementary features. Extensive experiment results demonstrate that our proposed method achieves comparable state-of-the-art performance on the popular datasets: DRIVE and CHASEDB1.

关键词： Retinal vessel segmentation convolutional neural network variational autoencoder multi-scale

来源：评论

学校读者我要写书评

暂无评论

variational Inference via R′enyi Upper-Lower Bound Optimization 21

Variational Inference via R′enyi Upper-Lower Bound Optimiza...

引用

21st IEEE International Conference on Machine Learning and Applications (IEEE ICMLA)

作者： Zalman, Dana Oshri Fine, Shai Reichman Univ Sch Comp Sci Herzliyya Israel

ISBN: (纸本)9781665462839

variational inference provides a way to approximate probability densities. It does so by optimizing an upper or a lower bound on the likelihood of the observed data (the evidence). The classic variational inference approach suggests to maximize the Evidence Lower BOund (ELBO). Recent proposals suggest to optimize the variational R ' enyi bound (VR) and. upper bound. However, these estimates are either biased or difficult to approximate, due to a high variance. In this paper we introduce a new upper bound (termed VRLU) which is based on the existing variational R ' enyi bound. In contrast to the existing VR bound, the Monte Carlo (MC) approximation of the VRLU bound is unbiased. Furthermore, we devise a (sandwiched) upper-lower bound variational inference method (termed VRS) to jointly optimize the upper and lower bounds. We present a set of experiments, designed to evaluate the new VRLU bound, and to compare the VRS method with the classic VAE and the VR methods over a set of digit recognition tasks. The experiments and results demonstrate the VRLU bound advantage, and the wide applicability of the VRS method.

关键词： variational autoencoder R ' enyi Divergence

来源：评论

学校读者我要写书评

暂无评论

Two-Channel VAE-GAN Based Image-To-Video Translation 1

引用

18th International Conference on Intelligent Computing (ICIC)

作者： Wang, Shengli Xieshi, Mulin Zhou, Zhangpeng Zhang, Xiang Liu, Xujie Tang, Zeyi Dai, Yuxing Xu, Xuexin Lin, Pingyuan Maintenance Co State Grid Power Co Gansu Prov Lanzhou 730000 Gansu Peoples R China State Grid Infotelecom Great Power Sci & Technol Fuzhou 350000 Peoples R China Xiamen Univ Sch Informat Xiamen 361005 Peoples R China

ISBN: (数字)9783031138706

ISBN: (纸本)9783031138706;9783031138690

We propose a VAE-GAN network with a two-channel decoder for addressing multiple image-to-video translation tasks, i.e., generating multiple videos of different categories by a single model. We consider this image-to-video translation as a video generation task rather than a video prediction that needs multiple frames as input. After training, the model only requires the first frame of the video and its corresponding attribute to generate the required video. The advantage of combining the variational autoencoder (VAE) and Generative Adversarial Network (GAN) is to avoid the shortcomings of both: VAE components can give rise to blur, and unstable gradients caused by the GAN. Extensive qualitative and quantitative experiments are conducted on the MUG [1] dataset. We draw the following conclusions from this empirical study: compared with state-of-the-art approaches, our approach (VAE-GAN) exhibits significant improvements in generative capability.

关键词： Video generation variational autoencoder Generative adversarial network

来源：评论

学校读者我要写书评

暂无评论

Visualizing population structure with variational autoencoders

引用

G3-GENES GENOMES GENETICS 2021年第1期11卷 jkaa036页

作者： Battey, C. J. Coffing, Gabrielle C. Kern, Andrew D. Univ Oregon Dept Biol Inst Ecol & Evolut Eugene OR 97403 USA

Dimensionality reduction is a common tool for visualization and inference of population structure from genotypes, but popular methods either return too many dimensions for easy plotting (PCA) or fail to preserve global geometry (t-SNE and UMAP). Here we explore the utility of variational autoencoders (VAEs)-generative machine learning models in which a pair of neural networks seek to first compress and then recreate the input data-for visualizing population genetic variation. VAEs incorporate nonlinear relationships, allow users to define the dimensionality of the latent space, and in our tests preserve global geometry better than t-SNE and UMAP. Our implementation, which we call popvae, is available as a command-line python program at ***/kr-colab/popvae. The approach yields latent embeddings that capture subtle aspects of population structure in humans and Anopheles mosquitoes, and can generate artificial genotypes characteristic of a given sample or population.

关键词： population structure population genetics data visualization pca variational autoencoder deep learning machine learning neural network

来源：评论

学校读者我要写书评

暂无评论

Generalized Gumbel-Softmax gradient estimator for generic discrete random variables

引用

Pattern Recognition Letters 2025年 196卷 148-155页

作者： Weonyoung Joo Dongjun Kim Seungjae Shin Il-Chul Moon Department of Statistics EWHA Womans University Seoul Republic of Korea Department of Computer Science Stanford University CA United States Qualcomm AI Research Seoul Republic of Korea Department of Industrial and Systems Engineering Korea Advanced Institute of Science and Technology Daejeon Republic of Korea

Estimating the gradients of stochastic nodes in stochastic computational graphs is one of the crucial research questions in the deep generative modeling community, which enables gradient descent optimization on neural network parameters. Stochastic gradient estimators of discrete random variables, such as the Gumbel-Softmax reparameterization trick for Bernoulli and categorical distributions, are widely explored. Meanwhile, other discrete distribution cases, such as the Poisson, geometric, binomial, multinomial, negative binomial, etc., have not been explored. This paper proposes a generalized version of the Gumbel-Softmax stochastic gradient estimator. The proposed method is able to reparameterize generic discrete distributions, not restricted to the Bernoulli and the categorical, and it enables learning on large-scale stochastic computational graphs with discrete random nodes. Our experiments consist of (1) synthetic examples and applications on variational autoencoders, which show the efficacy of our methods; and (2) topic models, which demonstrate the value of the proposed estimation in practice.

关键词： Deep generative model Discrete random variable Gumbel-softmax trick Reparameterization trick Stochastic gradient estimator variational autoencoder

来源：评论

学校读者我要写书评

暂无评论

variational autoencoders for Baseball Player Evaluation 5

Variational Autoencoders for Baseball Player Evaluation

引用

5th International Conference on Fuzzy Systems and Data Mining (FSDM)

作者： Converse, Geoffrey Arnold, Brooke Curi, Mariana Oliveira, Suely Univ Iowa Iowa City IA 52242 USA Univ Sao Paulo Sao Paulo Brazil

ISBN: (纸本)9781643680194;9781643680187

In the sporting world, baseball has been quicker to embrace the use of data analytics than any other sport, as detailed baseball statistics have become readily available in large and diverse quantities to the general public. Professional baseball teams use this data to develop game plans and evaluate players. In this work, we explore the latter by using a variational autoencoder (VAE), a special class of artificial neural networks. Specifically, we wish to relate a player's season-long batting statistics with the latent skills that a professional athlete needs to succeed in the MLB. In the growing field of sports analytics, we find this work incredibly important as it provides a novel, flexible, and powerful method to predict specific athletic skills based on years of recorded statistics.

关键词： Neural Networks variational autoencoder Interpretability Sports Analytics

来源：评论

学校读者我要写书评

暂无评论

Synthetic data for enhanced privacy: A VAE-GAN approach against membership inference attacks

引用

KNOWLEDGE-BASED SYSTEMS 2025年 309卷

作者： Yan, Jian'en Huang, Haihui Yang, Kairan Xu, Haiyan Li, Yanling Harbin Inst Technol Fac Comp Harbin 150001 Heilongjiang Peoples R China Inner Mongolia Normal Univ Coll Comp Sci & Technol Hohhot 010022 Peoples R China

The raw data utilized in training machine learning models faces a potential threat from membership inference attacks. To mitigate this risk, employing synthetic data instead of real data is proved effective in desensitizing the information. We introduce a novel generative model, combining variational autoencoder and Generative Adversarial Network, to enhance privacy protection by generating synthetic data. In our approach, discrete variables are encoded by conditional generators, and sampling training is employed to ensure the distribution of synthetic data closely aligning with the real data. The modification of the model structure prompts a refinement of the loss function. We leverage Wasserstein distance with gradient penalty and SNorm to keep the stability of the model training process. Experimental results demonstrate that the efficacy of our model surpasses existing state-of-the-art models in terms of data utility metrics. Notably, in the face of membership inference attacks, the similarity from the results indicates the difficulty when distinguish the real data from synthetic data. It means our model have highlighting capabilities for the privacy protection.

关键词： Membership privacy Synthetic data variational autoencoder Generative Adversarial Network Tabular data

来源：评论

学校读者我要写书评

暂无评论

Piecewise convolutional neural network relation extraction with self-attention mechanism

引用

PATTERN RECOGNITION 2025年 159卷

作者： Zhang, Bo Xu, Li Liu, Ke-Hao Yang, Ru Li, Mao-Zhen Guo, Xiao-Yang Shanghai Normal Univ Coll Informat Mech & Elect Engn Shanghai 200234 Peoples R China Shanghai Normal Univ Inst Artificial Intelligence Educ Shanghai 200234 Peoples R China Brunel Univ London Dept Elect & Elect Engn Uxbridge UB8 3PH England Shanghai Normal Univ Shanghai Engn Res Ctr Intelligent Educ & Bigdata Shanghai 200234 Peoples R China Shanghai Newtouch Software Co Ltd Shanghai 200127 Peoples R China

The task of relation extraction in natural language processing is to identify the relation between two specified entities in a sentence. However, the existing model methods do not fully utilize the word feature information and pay little attention to the influence degree of the relative relation extraction results of each word. In order to address the aforementioned issues, we propose a relation extraction method based on self-attention mechanism (SPCNN-VAE) to solve the above problems. First, we use a multi-head self-attention mechanism to process word vectors and generate sentence feature vector representations, which can be used to extract semantic dependencies between words in sentences. Then, we introduce the word position to combine the sentence feature representation with the position feature representation of words to form the input representation of piecewise convolutional neural network (PCNN). Furthermore, to identify the word feature information that is most useful for relation extraction, an attention-based pooling operation is employed to capture key convolutional features and classify the feature vectors. Finally, regularization is performed by a variational autoencoder (VAE) to enhance the encoding ability of model word information features. The performance analysis is performed on SemEval 2010 task 8, and the experimental results show that the proposed relation extraction model is effective and outperforms some competitive baselines.

关键词： Relation extraction Multi-head attention PCNN variational autoencoder

来源：评论

学校读者我要写书评

暂无评论

Integrated damage detection and time-series data augmentation for floating offshore mooring systems via variational semi-supervised learning

引用

OCEAN ENGINEERING 2025年 330卷

作者： Tamuly, Pranjal Sharma, Smriti Nava, Vincenzo Basque Ctr Appl Math Alameda Mazarredo 14 Bilbao 48009 Spain Politecn Torino Corso Duca Abruzzi 24 Turin Italy

The dynamics and stability of the semi-submersible offshore platforms are significantly impacted by the degradation of the mooring system. Identifying structural integrity issues in mooring systems through a data-driven approach is challenging due to the infrequency of damage events and the difficulties in recording them. To address these challenges, this study proposes the Time-Series variational Semi-Supervised Learning (TSVSSL) framework, which effectively bridges the gap between supervised and unsupervised learning by leveraging unlabelled data for damage detection. The proposed framework features a distinctive training procedure in which the encoder-decoder and classifier components are trained concurrently. This process produces a well-clustered latent representation that enhances damage detection and supports class-specific artificial data generation. A numerical study using simulated responses of a 5 MW semi-submersible FOWT under varying metocean conditions demonstrated that the proposed framework outperformed existing deep learning methods in damage detection, achieving superior accuracy, precision, recall, and F1 score. Further, a rejection sampling technique is also introduced to effectively generates artificial data that closely aligns with actual time series displacement response. The novelty of the proposed framework lies in its dual focus on damage detection and artificial data generation marking a significant advancement in the data-driven assessment of mooring systems.

关键词： Offshore structures Damage diagnosis Floating wind turbines Mooring systems Semi-supervised learning variational autoencoder

来源：评论

学校读者我要写书评

暂无评论

An End-to-end Framework for Graph Clustering: Semantic Fusion of Graph Convolutional Network and variational Auto-encoder 3

An End-to-end Framework for Graph Clustering: Semantic Fusio...

引用

3rd International Conference on Image Processing, Computer Vision and Machine Learning, ICICML 2024

作者： Peng, Yongxin Sun Yat-Sen University School of Computer Science and Engineering Guangzhou China

ISBN: (纸本)9798350355413

Attribute graph clustering is a fundamental and challenging task in graph data mining, requiring the adequate utilization of both node attributes and graph structure. Recently, a series of graph clustering methods have been proposed, integrating both graph convolutional networks (GCNs) and autoencoders to capture structural information and node attributes, respectively. However, most existing methods either employ a complicated summation mechanism in the unaligned representation space from GCN and auto-encoder, or select a certain module (e.g. results from GCN) as a biased target selftraining signal. To address this, we propose a new end-to-end graph clustering framework that integrates a GCN and a variational autoencoder (VAE) with a more efficient and reasonable fusion mechanism in the semantic level. To better supervise the clustering, we select a high-confidence set of nodes based on the consensus of two encoders, further boosting the performances. Extensive experiments on multiple benchmark datasets demonstrate that our framework significantly outperforms state-of-the-art baselines, highlighting its efficacy in graph clustering. © 2024 IEEE.

关键词： Clustering Graph Data Mining variational autoencoder

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：