检索结果-内蒙古大学图书馆

25th ACM International Conference on Multimedia (MM)

作者： Phan, Sang Miyao, Yusuke Satoh, Shin'ichi Natl Inst Informat Tokyo Japan

ISBN: (纸本)9781450349062

Exploiting multimodal features has become a standard approach towards many video applications, including the video captioning task. One problem with the existing work is that it models the relevance of each type of features evenly, which neutralizes the impact of each individual modality to the word to be generated. In this paper, we propose a novel Modal Attention Network (MANet) to account for this issue. Our MANet extends the standard encoder-decoder network by adapting the attention mechanism to video modalities. As a result, MANet emphasizes the impact of each modality with respect to the word to be generated. Experimental results show that our MANet effectively utilizes multimodal features to generate better video descriptions. Especially, our MANet system was ranked among the top three systems at the 2nd Video to Language Challenge in both automatic metrics and human evaluations.

关键词： video captioning attention mechanism multimodal encoder-decoder

来源：评论

学校读者我要写书评

暂无评论

Question Answering with Character-Level LSTM encoders and Model-Based Data Augmentation

Question Answering with Character-Level LSTM Encoders and Mo...

引用

第十六届全国计算语言学学术会议暨第五届基于自然标注大数据的自然语言处理国际学术研讨会

作者： Run-Ze Wang Chen-Di Zhan Zhen-Hua Ling National Engineering Laboratory for Speech and Language Information Processing University of Science and Technology of ChinaHefeiChina

ISBN: (纸本)9783319690049

This paper presents a character-level encoder-decoder mod-eling method for question answering(QA)from large-scale knowledge bases(KB).This method improves the existing approach [9] from three ***,long short-term memory(LSTM)structures are adopted to replace the convolutional neural networks(CNN)for encoding the can-didate entities and ***,a new strategy of generating neg-ative samples for model training is ***,a data augmentation strategy is applied to increase the size of the training set by generating factoid questions using another trained encoder-decoder ***-mental results on the SimpleQuestions dataset and the Freebase5M KB demonstrates the effectiveness of the proposed method,which improves the state-of-the-art accuracy from 70.3%to 78.8%when augmenting the training set with 70,000 generated triple-question pairs.

关键词： Question Answering Knowledge Base Long Short-TermMemory encoder-decoder

来源：评论

学校读者我要写书评

暂无评论

面向事件的社交媒体文本自动摘要研究

面向事件的社交媒体文本自动摘要研究

引用

作者：官宸宇武汉大学

学位级别：硕士

自动文本摘要技术是自然语言处理领域重要的一个分支,初期主要广泛应用于长文本摘要任务中,例如科技论文、新闻领域等。近年来微博、Twitter等短文本形式的社交媒体快速广泛地流行起来,其方便和快捷的使用方式以及平台上的海量信息资源... 详细信息

自动文本摘要技术是自然语言处理领域重要的一个分支,初期主要广泛应用于长文本摘要任务中,例如科技论文、新闻领域等。近年来微博、Twitter等短文本形式的社交媒体快速广泛地流行起来,其方便和快捷的使用方式以及平台上的海量信息资源,使得人们开始通过社交媒体平台来实时地获取各种信息资源,尤其是真实的社会热点事件信息。然而社交媒体文本具有篇幅短小、内容碎片化等特点,且海量数据伴随着巨大的冗余,给用户识别和理解带来了很大的困难。因此,以社交媒体文本为目标数据集的自动摘要任务受到重视。已有的自动摘要方法大多是基于关键句子抽取的方式来组合形成摘要,但是该类方法由于忽略了对文本结构和语言特征的分析和理解,生成的摘要可读性较差,也不可避免地存在冗余问题。随着深度学习技术的不断进步,其在自动摘要领域的表现很好地弥补了抽取式摘要方法的不足,然而当前的研究对象仅在句子和段落级别,对于实际任务缺乏应用性。本文针对社交媒体上引起广泛讨论的真实社会事件,应用自动摘要技术生成一段可以较为全面地概括该事件的摘要文本来提供给用户,从而节省用户获取事件信息的时间和精力。第一,通过结合抽取式和抽象式摘要方法各自的优势,提出了将事件摘要任务划分为两步走的策略。第二,通过使用Canopy和K-means相结合的聚类技术和时间戳技术对事件的关键方面或其发展过程进行识别,形成了事件下的多个子主题簇。第三,受到人工摘要产生过程的启发,提出了一种改进的基于注意力模型的encoder-decoder框架模型MEOD作为本文使用的摘要生成模型,将第一步产生的子主题文本作为模型输入来生成子摘要,进而组合形成最终的事件摘要。通过对实验结果进行自动评测和人工评测,均显示本文摘要方法优于对比方法,有效证明了本文摘要方法的有效性。其中,子主题识别环节添加的社会特征和时间戳等信息有效地提高了子主题划分的准确性和完整性,基于encoder-decoder框架的摘要生成模型显著地提升了摘要质量,尤其是可读性方面。另外,本文提出的结合抽取式和抽象式两种摘要方法的思路,为面向短文本的多文档事件摘要研究提供了新的思考方向。

关键词：自动摘要 encoder-decoder 聚类算法社交媒体事件

来源：评论

学校读者我要写书评

暂无评论

Perceptual Loss with Fully Convolutional for Image Residual Denoising

Perceptual Loss with Fully Convolutional for Image Residual ...

引用

7th Chinese Conference on Pattern Recognition (CCPR)

作者： Tao Pan Fu Zhongliang Wang Lili Zhu Kai Chinese Acad Sci Chengdu Inst Comp Applicat Chengdu 610041 Sichuan Peoples R China Univ Chinese Acad Sci Beijing 100049 Peoples R China

ISBN: (纸本)9789811030055;9789811030048

In this paper we propose a fully convolutional encoder-decoder framework for image residual transformation tasks. Instead of only using per-pixel loss function, the proposed framework learn end-to-end mapping combined with perceptual loss function that depend on low-level features from a pre-trained network. Pointing out the mapping function in order to handle noise-free image by introduce identity mapping. And through an analysis of the interplay between the neural networks and the underlying noisy distribution which they seeking to learn. We also show how to construct a uniform transform, which is then used to make a single deep neural network work well across different levels of noise. Comparing with previous approaches, ours achieves better performance. The experimental results indicate the efficiency of the proposed algorithm to cope with image denoising tasks.

关键词： Residual denoising encoder-decoder Perceptual loss

来源：评论

学校读者我要写书评

暂无评论

JOINTLY LEARNING TO ALIGN AND CONVERT GRAPHEMES TO PHONEMES WITH NEURAL ATTENTION MODELS

JOINTLY LEARNING TO ALIGN AND CONVERT GRAPHEMES TO PHONEMES ...

引用

IEEE Workshop on Spoken Language Technology (SLT)

作者： Toshniwal, Shubham Livescu, Karen Toyota Technol Inst Chicago IL 60637 USA

ISBN: (纸本)9781509049035

We propose an attention-enabled encoder-decoder model for the problem of grapheme-to-phoneme conversion. Most previous work has tackled the problem via joint sequence models that require explicit alignments for training. In contrast, the attention-enabled encoder-decoder model allows for jointly learning to align and convert characters to phonemes. We explore different types of attention models, including global and local attention, and our best models achieve state-of-the-art results on three standard data sets (CMU-Dict, Pronlex, and NetTalk).

关键词： grapheme-to-phoneme LSTM encoder-decoder attention

来源：评论

学校读者我要写书评

暂无评论

ANALYSIS OF SEQUENCE TO SEQUENCE NEURAL NETWORKS ON GRAPHEME TO PHONEME CONVERSION TASK

ANALYSIS OF SEQUENCE TO SEQUENCE NEURAL NETWORKS ON GRAPHEME...

引用

International Joint Conference on Neural Networks (IJCNN)

作者： Achanta, Sivanand Pandey, Ayushi Gangashetty, Suryakanth V. IIIT Speech & Vis Lab Hyderabad Andhra Pradesh India

ISBN: (纸本)9781509006199

In this paper, we analyze the performance of various sequence to sequence neural networks on the task of grapheme to phoneme (G2P) conversion. G2P is a very important component in applications like text-to-speech, automatic speech recognition etc,. Because the number of graphemes that a word consists of and the corresponding number of phonemes are different, they are first aligned and then mapped. With the recent advent of sequence to sequence neural networks, the alignment step can be skipped allowing us to directly map the input and output sequences. Although the sequence to sequence neural nets have been applied for this task very recently, there are some questions concerning the architecture that need to be addressed. We show in this paper that, complex recurrent neural network units (like long-short term memory cells) may not be required to achieve good performance on this task. Instead simple recurrent neural networks (RNN) will suffice. We also show that the encoder can be a uni-directional RNN as opposed to the usually preferred bi-directional RNN. Further, our experiments reveal that encoder-decoder models with soft-alignment outperforms fixed vector context counterpart. The results demonstrate that with very few parameters we can indeed achieve comparable performance to much more complicated architectures.

关键词： Grapheme to phoneme recurrent neural networks LSTM encoder-decoder

来源：评论

学校读者我要写书评

暂无评论

DIALOG STATE TRACKING WITH ATTENTION-BASED SEQUENCE-TO-SEQUENCE LEARNING

DIALOG STATE TRACKING WITH ATTENTION-BASED SEQUENCE-TO-SEQUE...

引用

IEEE Workshop on Spoken Language Technology (SLT)

作者： Hori, Takaaki Wang, Hai Hori, Chiori Watanabe, Shinji Harsham, Bret Le Roux, Jonathan Hershey, John R. Koji, Yusuke Jing, Yi Zhu, Zhaocheng Aikawa, Takeyuki Mitsubishi Elect Res Labs Cambridge MA 02139 USA Toyota Technol Inst Chicago IL USA Mitsubishi Electr Corp Informat Technol R&D Ctr Tokyo Tokyo Japan Peking Univ Beijing Peoples R China

ISBN: (纸本)9781509049035

We present an advanced dialog state tracking system designed for the 5th Dialog State Tracking Challenge (DSTC5). The main task of DSTC5 is to track the dialog state in a human-human dialog. For each utterance, the tracker emits a frame of slot-value pairs considering the full history of the dialog up to the current turn. Our system includes an encoder-decoder architecture with an attention mechanism to map an input word sequence to a set of semantic labels, i.e., slot-value pairs. This handles the problem of the unknown alignment between the utterances and the labels. By combining the attention-based tracker with rule-based trackers elaborated for English and Chinese, the F-score for the development set improved from 0.475 to 0.507 compared to the rule-only trackers. Moreover, we achieved 0.517 F-score by refining the combination strategy based on the topic and slot level performance of each tracker. In this paper, we also validate the efficacy of each technique and report the test set results submitted to the challenge.

关键词： Dialog state tracking attention model sequence-to-sequence learning encoder-decoder long short-term memory

来源：评论

学校读者我要写书评

暂无评论

Topic-specific Image Caption Generation

Topic-specific Image Caption Generation

引用

第十六届全国计算语言学学术会议暨第五届基于自然标注大数据的自然语言处理国际学术研讨会

作者： Chang Zhou Yuzhao Mao Xiao jie Wang School of Computer Beijing University of Posts and TelecommunicationsBeijingChina

ISBN: (纸本)9783319690049

Recently,image caption which aims to generate a textual description for an image automatically has attracted researchers from various *** performance has been achieved by applying deep neural *** of these works aim at generating a single caption which may be incomprehensive,especially for complex *** paper proposes a topic-specific multi-caption generator,which in-fer topics from image first and then generate a variety of topic-specific captions,each of which depicts the image from a particular *** per-form experiments on flickr8k,flickr30k and *** results show that the proposed model performs better than single-caption generator when generating topic-specific *** proposed model effectively generates diversity of captions under reasonable topics and they differ from each other in topic level.

关键词： Image caption Topic model encoder-decoder

来源：评论

学校读者我要写书评

暂无评论

A linear approach for sparse coding by a two-layer neural network

引用

NEUROCOMPUTING 2015年第PartC期149卷 1315-1323页

作者： Montalto, Alessandro Tessitore, Giovanni Prevete, Roberto Univ Ghent Data Anal Dept Ghent Belgium Univ Naples Federico II Dept Phys Sci Naples Italy Univ Naples Federico II DIETI Naples Italy

Many approaches to transform classification problems from non-linear to linear by feature transformation have been recently presented in the literature. These notably include sparse coding methods and deep neural networks. However, many of these approaches require the repeated application of a learning process upon the presentation of unseen data input vectors, or else involve the use of large numbers of parameters and hyper-parameters, which must be chosen through cross-validation, thus increasing running time dramatically. In this paper, we propose and experimentally investigate a new approach for the purpose of overcoming limitations of both kinds. The proposed approach makes use of a linear auto-associative network (called SCNN) with just one hidden layer. The combination of this architecture with a specific error function to be minimized enables one to learn a linear encoder computing a sparse code which turns out to be as similar as possible to the sparse coding that one obtains by re-training the neural network. Importantly, the linearity of SCNN and the choice of the error function allow one to achieve reduced running time in the learning phase. The proposed architecture is evaluated on the basis of two standard machine learning tasks. Its performances are compared with those of recently proposed non-linear auto-associative neural networks. The overall results suggest that linear encoders can be profitably used to obtain sparse data representations in the context of machine learning problems, provided that an appropriate error function is used during the learning phase. (c) 2014 Elsevier B.V. All rights reserved.

关键词： Neural networks Sparse coding Linear approach encoder-decoder

来源：评论

学校读者我要写书评

暂无评论

Design of Digital Functional Blocks using Hybrid Memristor Structures 35

Design of Digital Functional Blocks using Hybrid Memristor S...

引用

IEEE Region 10 Conference

作者： Kumar, Gaurav Datta, Kamalika Natl Inst Technol Dept Comp Sci & Engn Shillong Meghalaya India

ISBN: (纸本)9781479986415

The memristor device has emerged as the missing fourth fundamental circuit element after resistor, inductor and capacitor. Various implementations of memristors have been reported, with the one using a TiO2 layer sandwiched between two platinum electrodes considered to be most promising. Because of its very small feature sizes and low power consumption, it is projected to replace CMOS technology in several application areas. Various memory and logic design styles using memristors have been reported. A hybrid technology that combines memristors with CMOS gates is promising, and can be fabricated on the same silicon wafer. The present paper proposes the designs of various functional blocks like multiplexers, encoders and decoders using the hybrid memristor structure, with analyses regarding their design complexities. The design methodology is general, and can be used to synthesize arbitrary functional blocks as well.

关键词： Memristor Hybrid CMOS encoder-decoder multiplexer

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：