检索结果-内蒙古大学图书馆

Models of MT and MST areas using wake-sleep algorithm

NEURAL NETWORKS 2004年第3期17卷 339-351页

作者： Katayama, K Ando, M Horiguchi, T Tohoku Univ GSIS Dept Math & Comp Sci Sendai Miyagi 9808579 Japan Tohoku Univ Res Inst Elect Commun Sendai Miyagi 9808577 Japan

We present two-layered neural network models with Q ( greater than or equal to 2)-states neurons for a system with middle temporal (MT) neurons and medial superior temporal (MST) neurons by using a wake-sleep algorithm proposed by Hinton et al.;we notice that the wake-sleep algorithm consists of local learning rules. We first investigate a model with binary neurons for response properties of the MST neurons to optical flows as for various types of motion. We next extend the model with binary neurons to a model with Q (greater than or equal to 3)-states neurons and investigate the response properties of the MST neurons for various values of Q (greater than or equal to 3). We obtain better response properties for the model with Q (greater than or equal to 3)-states neurons than for the one with binary neurons. (C) 2003 Elsevier Ltd. All rights reserved.

关键词： two-layered neural network MT neurons MST neurons optical flow learning wake-sleep algorithm Q-states neurons

来源：评论

学校读者我要写书评

暂无评论

Dynamical analysis of the wake-sleep algorithm

引用

ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE 2002年第1期85卷 41-49页

作者： Shimasaki, S Kabashima, Y Tokyo Inst Technol Interdisciplinary Grad Sch Sci & Engn Yokohama Kanagawa 2268502 Japan

We employ statistical dynamics to study the convergence of the wake-sleep (W-S) algorithm, which is a learning algorithm for neural network models having hidden units. Although there have been several reports on the effectiveness of the NV-S algorithm based on experimental methods, the theoretical side is not clear even for a simple network. In this paper, we investigate the dynamic characteristics of the W-S algorithm expressed by a single factor analysis problem, which is the simplest state setting. The advantage of our approach is the ability to quantitatively evaluate the effect that the learning coefficients have on the convergence, which is difficult when using other methods. The result was that the settings of the learning coefficients, particularly in the sleep step, had a substantial effect on the convergence of the algorithm. (C) 2001 Scripta Technica.

关键词： wake-sleep algorithm factor analysis neural network statistical dynamics learning theory

来源：评论

学校读者我要写书评

暂无评论

On the locality of the natural gradient for learning in deep Bayesian networks

引用

INFORMATION GEOMETRY 2023年第1期6卷 1-49页

作者： Ay, Nihat Max Planck Inst Math Sci D-04103 Leipzig Germany Univ Leipzig D-04109 Leipzig Germany Santa Fe Inst Santa Fe NM 87501 USA

We study the natural gradient method for learning in deep Bayesian networks, including neural networks. There are two natural geometries associated with such learning systems consisting of visible and hidden units. One geometry is related to the full system, the other one to the visible sub-system. These two geometries imply different natural gradients. In a first step, we demonstrate a great simplification of the natural gradient with respect to the first geometry, due to locality properties of the Fisher information matrix. This simplification does not directly translate to a corresponding simplification with respect to the second geometry. We develop the theory for studying the relation between the two versions of the natural gradient and outline a method for the simplification of the natural gradient with respect to the second geometry based on the first one. This method suggests to incorporate a recognition model as an auxiliary model for the efficient application of the natural gradient method in deep networks.

关键词： Natural gradient Fisher-Rao metric Deep learning Helmholtz machines wake-sleep algorithm

来源：评论

学校读者我要写书评

暂无评论

wake-sleep Variational Autoencoders for Language Modeling 24th

Wake-Sleep Variational Autoencoders for Language Modeling

引用

24th International Conference on Neural Information Processing (ICONIP)

作者： Shen, Xiaoyu Su, Hui Niu, Shuzi Klakow, Dietrich Saarland Univ Spoken Language Syst LSV Saarbrucken Germany Univ Chinese Acad Sci Software Inst Beijing Peoples R China

ISBN: (纸本)9783319700878;9783319700861

Variational Autoencoders (VAEs) are known to easily suffer from the KL-vanishing problem when combining with powerful autoregressive models like recurrent neural networks (RNNs), which prohibits their wide application in natural language processing. In this paper, we tackle this problem by tearing the training procedure into two steps: learning effective mechanisms to encode and decode discrete tokens (wake step) and generalizing meaningful latent variables by reconstructing dreamed encodings (sleep step). The training pattern is similar to the wake-sleep algorithm: these two steps are trained alternatively until an equilibrium is achieved. We test our model in a language modeling task. The results demonstrate significant improvement over the current state-of-the-art latent variable models.

关键词： Variational Autoencoder wake-sleep algorithm Language modeling Latent variable

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：