检索结果-内蒙古大学图书馆

arXiv 2018年

作者： Ke, Nan Rosemary Zolna, Konrad Sordoni, Alessandro Lin, Zhouhan Trischler, Adam Bengio, Yoshua Pineau, Joelle Charlin, Laurent Pal, Chris Montreal Institute for Learning Algorithms Montreal Canada Polytechnique Montreal MontréalQuébec Canada Microsoft Research Montreal Canada Jagiellonian University Cracow Poland AdeptMind Scholar University of Montreal MontréalQuébec Canada Senior Cifar Member McGill University MontréalQuébec Canada Facebook AI Research Montreal HEC Montreal Canada

Recurrent Neural Networks (RNNs) with attention mechanisms have obtained state-of-the-art results for many sequence processing tasks. Most of these models use a simple form of encoder with attention that looks over the entire sequence and assigns a weight to each token independently. We present a mechanism for focusing RNN encoders for sequence modelling tasks which allows them to attend to key parts of the input as needed. We formulate this using a multilayer conditional sequence encoder that reads in one token at a time and makes a discrete decision on whether the token is relevant to the context or question being asked. The discrete gating mechanism takes in the context embedding and the current hidden state as inputs and controls information flow into the layer above. We train it using policy gradient methods. We evaluate this method on several types of tasks with different attributes. First, we evaluate the method on synthetic tasks which allow us to evaluate the model for its generalization ability and probe the behavior of the gates in more controlled settings. We then evaluate this approach on large scale Question Answering tasks including the challenging MS MARCO and SearchQA tasks. Our models shows consistent improvements for both tasks over prior work and our baselines. It has also shown to generalize significantly better on synthetic tasks as compared to the baselines. Copyright © 2018, The Authors. All rights reserved.

关键词： Recurrent neural networks

来源：评论

学校读者我要写书评

暂无评论

MovieGraphs: Towards understanding human-centric situations from videos

arXiv

引用

arXiv 2017年

作者： Vicol, Paul Tapaswi, Makarand Castrejón, Lluís Fidler, Sanja University of Toronto Vector Institute Montreal Institute for Learning Algorithms

There is growing interest in artificial intelligence to build socially intelligent robots. This requires machines to have the ability to "read" people's emotions, motivations, and other factors that affect behavior. Towards this goal, we introduce a novel dataset called MovieGraphs which provides detailed, graph-based annotations of social situations depicted in movie clips. Each graph consists of several types of nodes, to capture who is present in the clip, their emotional and physical attributes, their relationships (i.e., parent/child), and the interactions between them. Most interactions are associated with topics that provide additional details, and reasons that give motivations for actions. In addition, most interactions and many attributes are grounded in the video with time stamps. We provide a thorough analysis of our dataset, showing interesting common-sense correlations between different social aspects of scenes, as well as across scenes over time. We propose a method for querying videos and text with graphs, and show that: 1) our graphs contain rich and sufficient information to summarize and localize each scene;and 2) subgraphs allow us to describe situations at an abstract level and retrieve multiple semantically relevant situations. We also propose methods for interaction understanding via ordering, and reason understanding. MovieGraphs is the first benchmark to focus on inferred properties of human-centric situations, and opens up an exciting avenue towards socially-intelligent AI agents. Copyright © 2017, The Authors. All rights reserved.

关键词： Graphic methods

来源：评论

学校读者我要写书评

暂无评论

Image segmentation by iterative inference from conditional score estimation

arXiv

引用

arXiv 2017年

作者： Romero, Adriana Drozdzal, Michal Erraqabi, Akram Jégou, Simon Bengio, Yoshua Montreal Institute for Learning Algorithms MontrealQC Canada Imagia Cybernetics MontrealQC Canada

Inspired by the combination of feedforward and iterative computations in the visual cortex, and taking advantage of the ability of denoising autoencoders to estimate the score of a joint distribution, we propose a novel approach to iterative inference for capturing and exploiting the complex joint distribution of output variables conditioned on some input variables. This approach is applied to image pixel-wise segmentation, with the estimated conditional score used to perform gradient ascent towards a mode of the estimated conditional distribution. This extends previous work on score estimation by denoising autoencoders to the case of a conditional distribution, with a novel use of a corrupted feedforward predictor replacing Gaussian corruption. An advantage of this approach over more classical ways to perform iterative inference for structured outputs, like conditional random fields (CRFs), is that it is not any more necessary to define an explicit energy function linking the output variables. To keep computations tractable, such energy function parametrizations are typically fairly constrained, involving only a few neighbors of each of the output variables in each clique. We experimentally find that the proposed iterative inference from conditional score estimation by conditional denoising autoencoders performs better than comparable models based on CRFs or those not using any explicit modeling of the conditional joint distribution of outputs. Copyright © 2017, The Authors. All rights reserved.

关键词： Image segmentation

来源：评论

学校读者我要写书评

暂无评论

Plug and play generative networks: Conditional iterative generation of images in latent space 30

Plug and play generative networks: Conditional iterative gen...

引用

30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017

作者： Nguyen, Anh Clune, Jeff Bengio, Yoshua Dosovitskiy, Alexey Yosinski, Jason University of Wyoming United States Uber AI Labs University of Wyoming United States Montreal Institute for Learning Algorithms Canada University of Freiburg Germany Uber AI Labs Canada

ISBN: (纸本)9781538604571

Generating high-resolution, photo-realistic images has been a long-standing goal in machine learning. Recently, Nguyen et al. [37] showed one interesting way to synthesize novel images by performing gradient ascent in the latent space of a generator network to maximize the activations of one or multiple neurons in a separate classifier network. In this paper we extend this method by introducing an additional prior on the latent code, improving both sample quality and sample diversity, leading to a state-of-the-art generative model that produces high quality images at higher resolutions (227 × 227) than previous generative models, and does so for all 1000 ImageNet categories. In addition, we provide a unified probabilistic interpretation of related activation maximization methods and call the general class of models "Plug and Play Generative Networks." PPGNs are composed of 1) a generator network G that is capable of drawing a wide range of image types and 2) a replaceable "condition" network C that tells the generator what to draw. We demonstrate the generation of images conditioned on a class (when C is an ImageNet or MIT Places classification network) and also conditioned on a caption (when C is an image captioning network). Our method also improves the state of the art of Multifaceted Feature Visualization [40], which generates the set of synthetic inputs that activate a neuron in order to better understand how deep neural networks operate. Finally, we show that our model performs reasonably well at the task of image inpainting. While image models are used in this paper, the approach is modality-agnostic and can be applied to many types of data. © 2017 IEEE.

关键词： Chemical activation

来源：评论

学校读者我要写书评

暂无评论

Improved training of wasserstein GANs

arXiv

引用

arXiv 2017年

作者： Gulrajani, Ishaan Ahmed, Faruk Arjovsky, Martin Dumoulin, Vincent Courville, Aaron Montreal Institute for Learning Algorithms Courant Institute of Mathematical Sciences CIFAR Google Brain

Generative Adversarial Networks (GANs) are powerful generative models, but suffer from training instability. The recently proposed Wasserstein GAN (WGAN) makes progress toward stable training of GANs, but sometimes can still generate only poor samples or fail to converge. We find that these problems are often due to the use of weight clipping in WGAN to enforce a Lipschitz constraint on the critic, which can lead to undesired behavior. We propose an alternative to clipping weights: penalize the norm of gradient of the critic with respect to its input. Our proposed method performs better than standard WGAN and enables stable training of a wide variety of GAN architectures with almost no hyperparameter tuning, including 101-layer ResNets and language models with continuous generators. We also achieve high quality generations on CIFAR-10 and LSUN bedrooms. Copyright © 2017, The Authors. All rights reserved.

关键词： Generative adversarial networks

来源：评论

学校读者我要写书评

暂无评论

Machine comprehension by text-to-text neural question generation

arXiv

引用

arXiv 2017年

作者： Yuan, Xingdi Wang, Tong Gulcehre, Caglar Sordoni, Alessandro Bachman, Philip Subramanian, Sandeep Zhang, Saizheng Trischler, Adam Microsoft Maluuba Montreal Institute for Learning Algorithms Université de Montréal

We propose a recurrent neural model that generates natural-language questions from documents, conditioned on answers. We show how to train the model using a combination of supervised and reinforcement learning. After teacher forcing for standard maximum likelihood training, we fine-tune the model using policy gradient techniques to maximize several rewards that measure question quality. Most notably, one of these rewards is the performance of a question-answering system. Our model is trained and evaluated on the recent question-answering dataset SQuAD. Copyright © 2017, The Authors. All rights reserved.

关键词： Maximum likelihood

来源：评论

学校读者我要写书评

暂无评论

*** - Reproducing intuition

arXiv

引用

arXiv 2017年

作者： Cohen, Joseph Paul Lo, Henry Z. Institute for Reproducible Research Montreal Institute for Learning Algorithms Université of Montréal Institute for Reproducible Research

We present ***, a platform for post-publication discussion of research papers. On ***, the research community can read and write summaries of papers in order to increase accessible and reproducibility. Summaries contain the perspective and insight of other readers, why they liked or disliked it, and their attempt to demystify complicated sections. *** has over 600 paper summaries, all of which are searchable and organized by paper, conference, and year. Many regular contributors are expert machine learning researchers. We present statistics from the last year of operation, user demographics, and responses from a usage survey. Results indicate that ShortScience benefits students most, by providing short, understandable summaries reflecting expert opinions. Copyright © 2017, The Authors. All rights reserved.

关键词： Paper

来源：评论

学校读者我要写书评

暂无评论

Multi-Region bilinear convolutional neural networks for person re-identification

Multi-Region bilinear convolutional neural networks for pers...

引用

IEEE Conference on Advanced Video and Signal Based Surveillance (AVSS)

作者： Evgeniya Ustinova Yaroslav Ganin Victor Lempitsky Skolkovo Institute of Science and Technology Moscow Skolkovo Institute of Science and Technology Moscow Montreal Institute for Learning Algorithms Montreal Quebec

In this work we propose a new architecture for person re-identification. As the task of re-identification is inherently associated with embedding learning and non-rigid appearance description, our architecture is based on the deep bilinear convolutional network (Bilinear-CNN) that has been proposed recently for fine-grained classification of highly non-rigid objects. While the last stages of the original Bilinear-CNN architecture completely removes the geometric information from consideration by performing orderless pooling, we observe that a better embedding can be learned by performing bilinear pooling in a more local way, where each pooling is confined to a predefined region. Our architecture thus represents a compromise between traditional convolutional networks and bilinear CNNs and strikes a balance between rigid matching and completely ignoring spatial information. We perform the experimental validation of the new architecture on the three popular benchmark datasets (Market-1501, CUHK01, CUHK03), comparing it to baselines that include Bilinear-CNN as well as prior art. The new architecture outperforms the baseline on all three datasets, while performing better than state-of-the-art on two out of three. The code and the pretrained models of the approach will be made available at the time of publication.

关键词： Feature extraction Computer architecture Convolutional codes Measurement Streaming media Neural networks

来源：评论

学校读者我要写书评

暂无评论

A deep reinforcement learning chatbot

arXiv

引用

arXiv 2017年

作者： Serban, Iulian V. Sankar, Chinnadhurai Germain, Mathieu Zhang, Saizheng Lin, Zhouhan Subramanian, Sandeep Kim, Taesup Pieper, Michael Chandar, Sarath Ke, Nan Rosemary Rajeshwar, Sai de Brebisson, Alexandre Sotelo, Jose M.R. Suhubdy, Dendi Michalski, Vincent Nguyen, Alexandre Pineau, Joelle Bengio, Yoshua Montreal Institute for Learning Algorithms MontrealQC Canada School of Computer Science McGill University CIFAR

We present MILABOT: a deep reinforcement learning chatbot developed by the montreal institute for learning algorithms (MILA) for the Amazon Alexa Prize competition. MILABOT is capable of conversing with humans on popular small talk topics through both speech and text. The system consists of an ensemble of natural language generation and retrieval models, including template-based models, bag-of-words models, sequence-to-sequence neural network and latent variable neural network models. By applying reinforcement learning to crowdsourced data and real-world user interactions, the system has been trained to select an appropriate response from the models in its ensemble. The system has been evaluated through A/B testing with real-world users, where it performed significantly better than many competing systems. Due to its machine learning architecture, the system is likely to improve with additional data. Copyright © 2017, The Authors. All rights reserved.

关键词： Deep learning

来源：评论

学校读者我要写书评

暂无评论

On random weights for texture generation in one layer CNNS

On random weights for texture generation in one layer CNNS

引用

IEEE International Conference on Acoustics, Speech and Signal Processing

作者： Mihir Mongia Kundan Kumar Akram Erraqabi Yoshua Bengio Stanford University United States of America IIT Kanpur India Montreal Institute for Learning Algorithms Canada

ISBN: (纸本)9781509041183

Recent work in the literature has shown experimentally that one can use the lower layers of a trained convolutional neural network (CNN) to model natural textures. More interestingly, it has also been experimentally shown that only one layer with random filters can also model textures although with less variability. In this paper we ask the question as to why one layer CNNs with random filters are so effective in generating textures? We theoretically show that one layer convolutional architectures (without a non-linearity) paired with the an energy function used in previous literature, can in fact preserve and modulate frequency coefficients in a manner so that random weights and pretrained weights will generate the same type of images. Based on the results of this analysis we question whether similar properties hold in the case where one uses one convolution layer with a non-linearity. We show that in the case of ReLu non-linearity there are situations where only one input will give the minimum possible energy whereas in the case of no nonlinearity, there are always infinite solutions that will give the minimum possible energy. Thus we can show that in certain situations adding a ReLu non-linearity generates less variable images.

关键词： Texture Generation CNN Random Weights nonlinearity one layer textures Random

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：