检索结果-内蒙古大学图书馆

siamese autoencoder Architecture for the Imputation of Data Missing Not at Random

JOURNAL OF COMPUTATIONAL SCIENCE 2024年 78卷

作者： Pereira, Ricardo Cardoso Abreu, Pedro Henriques Rodrigues, Pedro Pereira Univ Coimbra Ctr Informat & Syst Dept Informat Engn P-3030290 Coimbra Portugal Univ Porto Fac Med MEDCIDS Ctr Hlth Technol & Serv Res P-4200319 Porto Portugal

Missing data is an issue that can negatively impact any task performed with the available data and it is often found in real -world domains such as healthcare. One of the most common strategies to address this issue is to perform imputation, where the missing values are replaced by estimates. Several approaches based on statistics and machine learning techniques have been proposed for this purpose, including deep learning architectures such as generative adversarial networks and autoencoders. In this work, we propose a novel siamese neural network suitable for missing data imputation, which we call siamese autoencoder-based Approach for Imputation (SAEI). Besides having a deep autoencoder architecture, SAEI also has a custom loss function and triplet mining strategy that are tailored for the missing data issue. The proposed SAEI approach is compared to seven state-of-the-art imputation methods in an experimental setup that comprises 14 heterogeneous datasets of the healthcare domain injected with Missing Not At Random values at a rate between 10% and 60%. The results show that SAEI significantly outperforms all the remaining imputation methods for all experimented settings, achieving an average improvement of 35%. This work is an extension of the article siamese autoencoder-Based Approach for Missing Data Imputation [1] presented at the International Conference on Computational Science 2023. It includes new experiments focused on runtime, generalization capabilities, and the impact of the imputation in classification tasks, where the results show that SAEI is the imputation method that induces the best classification results, improving the F1 scores for 50% of the used datasets.

关键词： Missing data Imputation siamese autoencoder Missing Not at Random

来源：评论

学校读者我要写书评

暂无评论

Generalizable Sample-Efficient siamese autoencoder for Tinnitus Diagnosis in Listeners With Subjective Tinnitus

引用

IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING 2021年 29卷 1452-1461页

作者： Liu, Zhe Yao, Lina Wang, Xianzhi Monaghan, Jessica J. M. Schaette, Roland He, Zihuai McAlpine, David Univ New South Wales Sch Comp Sci & Engn Sydney NSW 2052 Australia Univ Technol Sydney Sch Comp Sci Ultimo NSW 2007 Australia Natl Acoust Labs Macquarie Pk NSW 2113 Australia Macquarie Univ Dept Linguist Sydney NSW 2109 Australia Univ Coll London UCL Ear Inst London WC1X 8EE England Stanford Univ Sch Med Stanford CA 94305 USA

Electroencephalogram (EEG)-based neurofeedback has been widely studied for tinnitus therapy in recent years. Most existing research relies on experts' cognitive prediction, and studies based on machine learning and deep learning are either data-hungry or not well generalizable to new subjects. In this paper, we propose a robust, data-efficient model for distinguishing tinnitus from the healthy state based on EEG-based tinnitus neurofeedback. We propose trend descriptor, a feature extractor with lower fineness, to reduce the effect of electrode noises on EEG signals, and a siamese encoder-decoder network boosted in a supervised manner to learn accurate alignment and to acquire high-quality transferable mappings across subjects and EEG signal channels. Our experiments show the proposed method significantly outperforms state-of-the-art algorithms when analyzing subjects' EEG neurofeedback to 90dB and 100dB sound, achieving an accuracy of 91.67%-94.44% in predicting tinnitus and control subjects in a subject-independent setting. Our ablation studies on mixed subjects and parameters show the method's stability in performance.

关键词： Electroencephalography Brain modeling Market research Training Neurofeedback Medical treatment Auditory system EEG subject-independent siamese autoencoder domain alignment trend descriptor tinnitus

来源：评论

学校读者我要写书评

暂无评论

Industrial Process Fault Detection Based on siamese Recurrent autoencoder

引用

COMPUTERS & CHEMICAL ENGINEERING 2025年 192卷

作者： Ji, Cheng Ma, Fangyuan Wang, Jingde Sun, Wei Palazoglu, Ahmet Beijing Univ Chem Technol Coll Chem Engn North Third Ring Rd 15 Beijing 100029 Peoples R China Tsinghua Univ Wuxi Res Inst Appl Technol Ctr Proc Monitoring & Data Anal Wuxi 214072 Peoples R China Univ Calif Davis Dept Chem Engn Davis CA 95616 USA

Although deep autoencoders excel at extracting intricate features, their application in process monitoring is limited by the requirement for large sample sizes and interpretability of latent representations. This work presents a special deep learning structure named siamese network to detect abnormal deviations in nonlinear dynamic processes. By leveraging the capability of siamese architecture to process multiple inputs simultaneously, the training sample size expands exponentially, which enhances the learning potential of the model. Furthermore, a long short-term memory unit is integrated to enable the capture of long-term process dynamics. To refine the distribution of latent features extracted from diverse data types, a contrastive loss function is proposed, which strengthens the model's fault detection capabilities and enhances its interpretation of latent representations. Then T2 statistic is established on the latent space to perform fault detection. The effectiveness of the method is demonstrated through case studies on simulation processes and an industrial process.

关键词： Chemical process monitoring Process safety siamese autoencoder Long short-term memory unit Contrastive loss Wax oil hydrogenation reactor

来源：评论

学校读者我要写书评

暂无评论

Semantic Preserving siamese autoencoder for Binary Quantization of Word Embeddings 21

Semantic Preserving Siamese Autoencoder for Binary Quantizat...

引用

Proceedings of the 2021 5th International Conference on Natural Language Processing and Information Retrieval

作者： Wouter Mostard Lambert Schomaker Marco Wiering Bernoulli Institute for Mathematics and Computer Science University of Groningen Netherlands

ISBN: (纸本)9781450387354

Word embeddings are used as building blocks for a wide range of natural language processing and information retrieval tasks. These embeddings are usually represented as continuous vectors, requiring significant memory capacity and computationally expensive similarity measures. In this study, we introduce a novel method for semantic hashing continuous vector representations into lower-dimensional Hamming space while explicitly preserving semantic information between words. This is achieved by introducing a siamese autoencoder combined with a novel semantic preserving loss function. We show that our quantization model induces only a 4% loss of semantic information over continuous representations and outperforms the baseline models on several word similarity and sentence classification tasks. Finally, we show through cluster analysis that our method learns binary representations where individual bits hold interpretable semantic information. In conclusion, binary quantization of word embeddings significantly decreases time and space requirements while offering new possibilities through exploiting semantic information of individual bits in downstream information retrieval tasks.

关键词： Representation learning Semantic hashing siamese autoencoder

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：