检索结果-内蒙古大学图书馆

variational autoencoder for Image-Based Augmentation of Eye-Tracking Data

JOURNAL OF IMAGING 2021年第5期7卷 83-83页

作者： Elbattah, Mahmoud Loughnane, Colm Guerin, Jean-Luc Carette, Romuald Cilia, Federica Dequen, Gilles Univ Picardie Jules Verne Lab Modelisat Informat Syst MIS F-80080 Amiens France Univ Limerick Fac Sci & Engn Limerick V94 T9PX Ireland Evolucare Technol F-80800 Villers Bretonneux France Univ Picardie Jules Verne Lab CRP CPO F-80000 Amiens France

Over the past decade, deep learning has achieved unprecedented successes in a diversity of application domains, given large-scale datasets. However, particular domains, such as healthcare, inherently suffer from data paucity and imbalance. Moreover, datasets could be largely inaccessible due to privacy concerns, or lack of data-sharing incentives. Such challenges have attached significance to the application of generative modeling and data augmentation in that domain. In this context, this study explores a machine learning-based approach for generating synthetic eye-tracking data. We explore a novel application of variational autoencoders (VAEs) in this regard. More specifically, a VAE model is trained to generate an image-based representation of the eye-tracking output, so-called scanpaths. Overall, our results validate that the VAE model could generate a plausible output from a limited dataset. Finally, it is empirically demonstrated that such approach could be employed as a mechanism for data augmentation to improve the performance in classification tasks.

关键词： deep learning variational autoencoder data augmentation eye-tracking

来源：评论

学校读者我要写书评

暂无评论

Whisper Speech Enhancement Using Joint variational autoencoder for Improved Speech Recognition 22

Whisper Speech Enhancement Using Joint Variational Autoencod...

引用

Interspeech Conference

作者： Agrawal, Vikas Kumar, Shashi Rath, Shakti P. Samsung R&D Inst India Bangalore Karnataka India Reverie Language Technol Bangalore Karnataka India

ISBN: (纸本)9781713836902

Whispering is the natural choice of communication when one wants to interact quietly and privately. Due to vast differences in acoustic characteristics of whisper and natural speech, there is drastic degradation in the performance of whisper speech when decoded by the Automatic Speech Recognition (ASR) system trained on neutral speech. Recently, to handle this mismatched train and test scenario Denoising autoencoders (DA) are used which gives some improvement. To improve over DA performance we propose another method to map speech from whisper domain to neutral speech domain via Joint variational Auto-Encoder (JVAE). The proposed method requires time-aligned parallel data which is not available, so we developed an algorithm to convert parallel data to time-aligned parallel data. JVAE jointly learns the characteristics of whisper and neutral speech in a common latent space which significantly improves whisper recognition accuracy and outperforms traditional autoencoder based techniques. We benchmarked our method against two baselines, first being ASR trained on neutral speech and tested on whisper dataset and second being whisper test set mapped using DA and tested on same neutral ASR. We achieved an absolute improvement of 22.31% in Word Error Rate (WER) over the first baseline and an absolute 5.52% improvement over DA.

关键词： whisper speech recognition autoencoder wTIMIT variational autoencoder jointVAE

来源：评论

学校读者我要写书评

暂无评论

Multi-scale spatial-spectral attention network for multispectral image compression based on variational autoencoder

引用

SIGNAL PROCESSING 2022年 198卷

作者： Kong, Fanqiang Cao, Tongbo Li, Yunsong Li, Dan Hu, Kedi Nanjing Univ Aeronaut & Astronaut Coll Astronaut Nanjing 210016 Peoples R China Xidian Univ State Key Lab Integrated Serv Networks Xian 710071 Peoples R China

Based upon the fact that multispectral image compression needs to remove both spatial and spectral redundancy, recent learnt models via end-to-end manners have shown promising performance. However, most of them ignore the characteristics of multispectral image, i.e., the non-stationarity of spectral correlation and the scale-diversity of spatial features. Meanwhile, they directly utilize fully factorized entropy model, rendering compression performance suboptimal. This paper proposes a Multi-Scale SpatialSpectral Attention Network (MSSSA-Net) based on variational autoencoder (VAE). Our MSSSA-Net (1) incorporates a simple neuroscience-based non-local attention module into attention mechanism to capture the tiny features in adjacent pixels and large-scale features in spatial domain simultaneously, (2) proposes a multi-scale spectral attention block to extract non-stationary correlation of adjacent spectra at different scales. We demonstrate that our MSSSA-Net offers the state-of-the-art performance in comparison with classical algorithms, including JPEG20 0 0 and 3D-SPIHT, and recent learnt image compression models, on 7-band and 8-band datasets from Landsat-8 and WorldView-3 satellites, when measured by PSNR, MSSSIM and Mean Spectral Angle. Extensive ablation experiments have verified the effectiveness of each component, and have demonstrated that, for multispectral image compression, Scale-only Hyperprior can make a better trade-off between compression performance and complexity compared with Mean & Scale Hyperprior and Joint Autoregressive model.

关键词： Hierarchical prior model Learnt multispectral image compression Spatial-spectral attention variational autoencoder

来源：评论

学校读者我要写书评

暂无评论

An Improved Semi-supervised variational autoencoder with Gate Mechanism for Text Classification

引用

INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE 2022年第10期36卷 2253006-2253006页

作者： Ye, Haiming Zhang, Weiwen Nie, Mengna Guangdong Univ Technol Sch Comp Sci & Technol Guangzhou 510006 Peoples R China

In recent years, semi-supervised learning has been investigated to take full advantages of increasing unlabeled data. Although pretrained deep learning models are successfully adopted on a massive amount of unlabeled data, they may not be applicable in specific domains as the data is limited. In this paper, we propose a model, termed Semi-supervised variational autoencoder (SVAE), which consists of Gated Convolutional Neural Networks (GCNN) as both the encoder and the decoder. Since the canonical VAE suffers from Kullback-Leibler (KL) vanishing problem, we attach a layer named Scalar after Batch Normalization (BN) to scale the output of the BN. We conduct experiments on two domain-specific datasets with a small amount of data. The results show that SVAE outperforms other alternative baselines for language modeling and semi-supervised learning studies. Especially, the results in the language modeling validate the effect of combining BN and Scalar for tackling the KL vanishing problem. Moreover, the visualization of the latent representations verifies the performance of SVAE on less data.

关键词： Semi-supervised learning variational autoencoder text classification gated convolutional neural networks Kullback-Leibler vanishing problem

来源：评论

学校读者我要写书评

暂无评论

E-VAN : Enhanced variational autoencoder Network for Mitigating Gender Bias in Static Word Embeddings 22

E-VAN : Enhanced Variational AutoEncoder Network for Mitigat...

引用

Proceedings of the 2022 6th International Conference on Natural Language Processing and Information Retrieval

作者： Swati Tyagi Jiaheng Xie Rick Andrews University Of Delaware Lerner College of Business and Economics USA

ISBN: (纸本)9781450397629

Recent research has shown that pre-trained context-independent word embeddings display biases such as racial bias, gender bias, etc. Using a novel, tunable algorithm, this study attempts to mitigate the hidden gender bias in static embeddings. In order to train the model, an enhanced variational autoencoder (E-VAN) is used to learn the latent space of the embedding. Then the latent distributions are used while adaptively resampling and re-weighting the rare/under-represented data. While the word embeddings retain semantic information, E-VAN effectively mitigates unwanted biased gendered associations. Our method E-VAN outperforms previous state-of-the-art methods in both quantitative and human evaluation.

关键词： Discriminator Gender Bias Natural Language Processing Semi-Supervised Learning. variational autoencoder Word Embedding

来源：评论

学校读者我要写书评

暂无评论

Text Generation with Syntax-Enhanced variational autoencoder

Text Generation with Syntax-Enhanced Variational Autoencoder

引用

International Joint Conference on Neural Networks (IJCNN)

作者： Yuan, Weijie Ding, Linyi Meng, Kui Liu, Gongshen Shanghai Jiao Tong Univ Sch Elect Informat & Elect Engn Shanghai Peoples R China

ISBN: (纸本)9780738133669

Text generation is one of the essential yet challenging tasks in natural language processing. However, the input text alone is usually hard to provide enough information to generate the desired output. Previous work attempts to incorporate syntactic information into the generative models based on variational autoencoder(VAE). But these methods have difficulty in adequately modeling the tree structure of syntactic data. In this paper, we formulate the syntactic structure as a graph and introduce a syntax encoder based on graph neural network(GNN) to model the syntactic information of sentences. Based on the syntax encoder, we propose a novel syntax-enhanced variational autoencoder(SEVAE) with two variants. The variant SEVAEm merges sentence information and syntactic information into one latent space to enrich the fine-grained syntactic information of latent representations. And the variant SEVAE-s with two separate latent spaces allows the sentence decoder to dynamically attend to semantic and syntactic information from two latent variables. Experiments on two benchmark datasets show that our methods achieve significant and consistent improvements compared with previous work.

关键词： text generation variational autoencoder syntactic modeling attention mechanism

来源：评论

学校读者我要写书评

暂无评论

Early Detection of Rotor Faults in Large Hydrogenerators using vibration measurements, variational autoencoder, and Euclidean distance

引用

IEEE Transactions on Industry Applications 2025年

作者： Ibrahim, Rony Zemouri, Ryad Tahan, Antoine Kedjar, Bachir Merkhouf, Arezki Al-Haddad, Kamal École de Technologie Supérieure Montréal H3C 1K3 QC Canada Centre de Recherche d'Hydro-Québec (CRHQ) Varennes J3X 1S1 QC Canada

In this paper, the authors present an Artificial Intelligence (AI) based variational autoencoder (VAE) technique for detecting rotor faults in a large hydrogenerator. The proposed technique is applied to assess health monitoring and classification capabilities on an existing 74MW, 76 poles hydrogenerator. The study uses real vibratory data collected in situ from a healthy machine and faulty signals generated using Finite Element Model (FEM) that are combined to the healthy signals as to constitute a database used for the AI model. The latter is then trained and validated using only one single severity pattern of each fault. The proposed method is tested using a third dataset that revealed the technique's proficiency in clustering various health cases within the model's latent space, effectively clustering them in a 2D user-friendly space. Furthermore, a novel health monitoring metric based on the squared Euclidean distance in the latent space, as well as the statistical law (χ²₂) is presented. The obtained results show the model's ability in detecting rotor anomalies at early stages of their occurrence, thus underscoring its advantages in health monitoring. © 1972-2012 IEEE.

关键词： Broken Damper Bar Diagnosis Eccentricity Fault Detection Health Monitoring Large Hydrogenerators Rotor Inter-Turn Short Circuit variational autoencoder

来源：评论

学校读者我要写书评

暂无评论

Improvement of thermal comfort for underground space: Data enhancement using variational autoencoder

引用

BUILDING AND ENVIRONMENT 2022年第PartA期207卷 108457-108457页

作者： Qiao, Renlu Li, Xiangyu Gao, Shuo Ma, Xiwen Tongji Univ 1239 Siping Rd Shanghai Peoples R China Beijing Univ Technol Fac Architecture Civil & Transportat Engn Beijing Peoples R China Univ Oxford 11a Mansfield Rd Oxford OX1 3SZ England Jackson & Ryan Architects 2370 Rice BlvdSte 210 Houston TX 77005 USA

The proportion of buildings occupying underground space has increased with three-dimensional urban development. Thermal comfort is crucial to the design of underground spaces and plays an important role in the optimization of building environment controls. Owing to limitations in recording various practical environmental parameters, it is difficult to access large data and further to establish an accurate forecasting model for the thermal comfort of an underground space. This paper addresses the problem from the perspective of data enhancement. A model for generating underground space data based on a variational autoencoder is proposed. The model maps data of the thermal comfort of an underground space to a highly compressed latent layer space and generates data in an unsupervised manner. The forecasting models were trained using the generated data, resulting in accuracy improvements of 41.34%-45.31%. Hence, the proposed generative model can learn effective real data features. The results also demonstrate that the adjustment of ventilation is more effective than the adjustment of the temperature and relative humidity in improving the thermal comfort of an underground space. The findings of this research will provide better thermal comfort evaluation for the operational management of building environment in underground spaces.

关键词： Underground space Thermal comfort Data augmentation variational autoencoder Forecasting model

来源：评论

学校读者我要写书评

暂无评论

An Application of Geometric Aspects of variational autoencoder Model to Forgery Detection of Scanned Documents 13

An Application of Geometric Aspects of Variational Autoencod...

引用

13th International Conference on Machine Vision

作者： Janiszewski, Igor Slugin, Dmitry Andreeva, Elena Russian Acad Sci Fed Res Ctr Comp Sci & Control Moscow Russia Smart Engines Serv LLC Moscow Russia

ISBN: (纸本)9781510640412

The paper proposes an approach for matching of digitized copies of business documents. This task arises when comparing two versions of the same document - genuine and forgery - to find possible modifications, for example in the banking sector during the conclusion of contracts in paper form to avoid possible fraud. The matching method of two documents based on comparison images of text lines using variational autoencoder (VAE) trained on genuine images and calculation Fisher information metric to find modifications. Experiments were conducted on the public Payslips dataset (in French). The results show the high quality and reliability of finding document forgeries and are compared to the results of the method which applies OCR and image matching.

关键词： Forgery detection documents matching variational autoencoder Fisher information metric

来源：评论

学校读者我要写书评

暂无评论

Application of domain-adaptive convolutional variational autoencoder for stress-state prediction

引用

KNOWLEDGE-BASED SYSTEMS 2022年 248卷 1页

作者： Lee, Sang Min Park, Sang-Youn Choi, Byoung-Ho Korea Univ Coll Engn Sch Mech Engn Seoul South Korea

Applying data-driven methods such as deep learning in material mechanics is challenging because producing a sufficiently large, labeled dataset is costly resource-wise. This paper outlines a new approach to overcoming this difficulty by transferring knowledge from a source domain of finite-element-analysis data to a target domain of real-world test-specimen images so that a model capable of accurate and robust predictions in both domains may be constructed. To achieve this transfer of knowledge, discrepancy-based unsupervised domain adaptation is adopted into a convolutional variational autoencoder structure. To evaluate the proposed approach, a four-point bending experiment was conducted on 6061 aluminum alloy and 316 stainless steel to produce 550 unlabeled target-domain data images. The same bending situation was analyzed using the finite-element method implemented in the commercial software package ABAQUS to produce 6000 labeled, source-domain data images. The proposed domain-adaptive convolutional variational autoencoder was trained using the maximum mean discrepancy method on the target-and the source-domain data. The predictions using the domain-adapted convolutional variational autoencoder were relatively more accurate than those using the model trained only on the source domain. It is expected that the proposed approach can address the scarcity of labeled data in various applications of material mechanics and provide a base technology for the development of various data-driven approaches.(C) 2022 Elsevier B.V. All rights reserved.

关键词： Unsupervised domain adaptation Stress analysis Four-point bending variational autoencoder Deep learning Convolutional neural network

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：