检索结果-内蒙古大学图书馆

Object sequences: encoding categorical and spatial information for a yes/no visual question answering task

IET COMPUTER VISION 2018年第8期12卷 1141-1150页

作者： Garg, Shivam Srivastava, Rajeev Indian Inst Technol BHU Dept Comp Sci & Engn Varanasi 221005 Uttar Pradesh India

The task of visual question answering (VQA) has gained wide popularity in recent times. Effectively solving the VQA task requires the understanding of both the visual content in the image and the language information associated with the text-based question. In this study, the authors propose a novel method of encoding the visual information (categorical and spatial object information) of all the objects present in the image into a sequential format, which is called an object sequence. These object sequences can then be suitably processed by a neural network. They experiment with multiple techniques for obtaining a joint embedding from the visual features (in the form of object sequences) and language-based features obtained from the question. They also provide a detailed analysis on the performance of a neural network architecture using object sequences, on the Oracle task of GuessWhat dataset (a Yes/No VQA task) and benchmark it against the baseline.

关键词： image sequences question answering (information retrieval) text analysis image coding neural net architecture object sequences spatial object information encoding categorical object information encoding yes-no visual question answering task VQA task language information text-based question visual information encoding visual features language-based features neural network architecture GuessWhat dataset Oracle task

来源：评论

学校读者我要写书评

暂无评论

Local receptive field constrained deep networks

引用

information SCIENCES 2016年 349卷 229-247页

作者： Turcsany, Diana Bargiela, Andrzej Maul, Tomas Univ Nottingham Sch Comp Sci Nottingham NG8 1BB England Infohub Ltd Nottingham Sci & Technol Pk Nottingham NG7 2QJ England Univ Nottingham Sch Comp Sci Malaysia Campus Semenyih 43500 Malaysia

Automatic extraction of distinctive features from a visual information stream is challenging due to the large amount of information contained in most image data. In recent years deep neural networks (DNNs) have gained outstanding popularity for solving visual information processing tasks. This study reports novel contributions, including a new DNN architecture and training method, which increase the fidelity of DNN-based representations to encodings extracted by visual processing neurons. Our local receptive field constrained DNNs (LRF-DNNs) are pre-trained with a modified restricted Boltzmann machine, the LRF-RBM, which utilizes biologically inspired Gaussian receptive field constraints to encourage the emergence of local features. Moreover, we propose a method for concurrently finding advantageous receptive field centers, while training the LRF-RBM. By utilizing LRF-RBMs with gradually increasing receptive field sizes on each layer, our LRF-DNN learns features of increasing complexity and demonstrates hierarchical part-based compositionality. We show superior face completion and reconstruction results on the challenging LFW face dataset. (C) 2016 Elsevier Inc. All rights reserved.

关键词： visual information encoding Local receptive field learning Feature hub Deep autoencoder neural network Self-adaptive structure Face completion

来源：评论

学校读者我要写书评

暂无评论

Object Categories Specific Brain Activity Classification with Simultaneous EEG-fMRI 37

Object Categories Specific Brain Activity Classification wit...

引用

37th Annual International Conference of the IEEE-Engineering-in-Medicine-and-Biology-Society (EMBC)

作者： Ahmad, Rana Fayyaz Malik, Aamir Saeed Kamel, Nidal Reza, Faruque Univ Teknol Petronas CISIR Dept Elect & Elect Engn Tronoh 31750 Malaysia Univ Sains Malaysisa Dept Neurosci Kota Baharu 16150 Malaysia

ISBN: (纸本)9781424492701

Any kind of visual information is encoded in terms of patterns of neural activity occurring inside the brain. Decoding neural patterns or its classification is a challenging task. Functional magnetic resonance imaging (fMRI) and Electroencephalography (EEG) are non-invasive neuroimaging modalities to capture the brain activity pattern in term of images and electric potential respectively. To get higher spatiotemporal resolution of human brain from these two complementary neuroimaging modalities, simultaneous EEG-fMRI can be helpful. In this paper, we proposed a framework for classifying the brain activity patterns with simultaneous EEG-fMRI. We have acquired five human participants' data with simultaneous EEG-fMRI by showing different object categories. Further, combined analysis of EEG and fMRI data was carried out. Extracted information through combine analysis is passed to support vector machine (SVM) classifier for classification purpose. We have achieved better classification accuracy using simultaneous EEG-fMRI i.e., 81.8% as compared to fMRI data standalone. This shows that multimodal neuroimaging can improve the classification accuracy of brain activity patterns as compared to individual modalities reported in literature.

关键词： bioelectric potentials biomedical MRI electroencephalography image classification image coding image resolution neurophysiology spatiotemporal phenomena support vector machines electric potential functional magnetic resonance imaging multimodal neuroimaging neural activity pattern decoding spatiotemporal resolution specific brain activity classification support vector machine classifier visual information encoding Accuracy Brain Data acquisition Electroencephalography Neuroimaging Support vector machines visualization

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：