咨询与建议

看过本文的还看了

相关文献

该作者的其他文献

文献详情 >OPTIMIZED TWO-STAGE AI-BASED N... 收藏
arXiv

OPTIMIZED TWO-STAGE AI-BASED NEURAL DECODING FOR ENHANCED VISUAL STIMULUS RECONSTRUCTION FROM FMRI DATA

作     者:Veronese, Lorenzo Moglia, Andrea Mainardi, Luca Cerveri, Pietro 

作者机构:Department of Electronics Information and Bioengineering Politecnico di Milano Milan Italy Università di Pavia Pavia Italy 

出 版 物:《arXiv》 (arXiv)

年 卷 期:2024年

核心收录:

主  题:Sensitivity analysis 

摘      要:AI-based neural decoding reconstructs visual perception by leveraging generative models to map brain activity, measured through functional MRI (fMRI), into latent hierarchical representations. Traditionally, ridge linear models transform fMRI into a latent space, which is then decoded using latent diffusion models (LDM) via a pre-trained variational autoencoder (VAE). Due to the complexity and noisiness of fMRI data, newer approaches split the reconstruction into two sequential steps, the first one providing a rough visual approximation, the second on improving the stimulus prediction via LDM endowed by CLIP embeddings. This work proposes a non-linear deep network to improve fMRI latent space representation, optimizing the dimensionality alike. Experiments on the Natural Scenes Dataset showed that the proposed architecture improved the structural similarity of the reconstructed image by about 2% with respect to the state-of-the-art model, based on ridge linear transform. The reconstructed image’s semantics improved by about 4%, measured by perceptual similarity, with respect to the state-of-the-art. The noise sensitivity analysis of the LDM showed that the role of the first stage was fundamental to predict the stimulus featuring high structural similarity. Conversely, providing a large noise stimulus affected less the semantics of the predicted stimulus, while the structural similarity between the ground truth and predicted stimulus was very poor. The findings underscore the importance of leveraging non-linear relationships between BOLD signal and the latent representation and two-stage generative AI for optimizing the fidelity of reconstructed visual stimuli from noisy fMRI data. © 2024, CC BY-SA.

读者评论 与其他读者分享你的观点

用户名:未登录
我的评分