检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

时间限定

出版年份：

文献类型

图书期刊文献学位论文多媒体

馆藏选择

电子馆藏纸本馆藏

核心期刊

全部期刊 SCI 收录期刊 SSCI 收录期刊 EI 收录期刊 CSCD 收录期刊 CSSCI 收录期刊

语言

中文英文

文献类型

期刊文献图书学位论文标准纸本馆藏

帮助

文字说明：

T=题名（书名、题名），A=作者（责任者），K=主题词，P=出版物名称，PU=出版社名称，O=机构（作者单位、学位授予单位、专利申请人），L=中图分类号，C=学科分类号，U=全部字段，Y=年（出版发行年、学位年度、标准发布年）

检索规则说明：

AND代表“并且”；OR代表“或者”；NOT代表“不包含”；(注意必须大写,运算符两边需空一格)

检索范例：

范例一：(K=图书馆学 OR K=情报学) AND A=范并思 AND Y=1982-2016
范例二：P=计算机应用与软件 AND (U=C++ OR U=Basic) NOT K=Visual AND Y=2011-2016

分类表

所选分类

>> <<

限定检索结果

文献类型

551 篇 期刊文献
335 篇 会议
15 篇 学位论文
1 册 图书

馆藏范围

902 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

847 篇 工学
- 570 篇 计算机科学与技术...
- 324 篇 电气工程
- 130 篇 软件工程
- 88 篇 信息与通信工程
- 77 篇 控制科学与工程
- 58 篇 仪器科学与技术
- 48 篇 测绘科学与技术
- 40 篇 生物医学工程（可授...
- 33 篇 土木工程
- 28 篇 环境科学与工程（可...
- 27 篇 机械工程
- 24 篇 电子科学与技术（可...
- 19 篇 石油与天然气工程
- 14 篇 动力工程及工程热...
- 11 篇 水利工程
- 10 篇 光学工程
- 10 篇 材料科学与工程（可...
- 10 篇 交通运输工程
- 9 篇 力学（可授工学、理...
- 8 篇 建筑学
187 篇 理学
- 82 篇 物理学
- 47 篇 地球物理学
- 35 篇 生物学
- 26 篇 化学
- 12 篇 数学
171 篇 医学
- 130 篇 临床医学
- 26 篇 基础医学(可授医学...
- 21 篇 特种医学
61 篇 管理学
- 56 篇 管理科学与工程(可...
10 篇 文学
- 8 篇 外国语言文学
7 篇 农学
5 篇 艺术学
2 篇 法学
1 篇 经济学
1 篇 教育学

主题

902 篇 encoder-decoder
159 篇 deep learning
81 篇 attention mechan...
60 篇 attention
54 篇 semantic segment...
48 篇 feature extracti...
46 篇 lstm
38 篇 transformer
36 篇 decoding
35 篇 convolutional ne...
32 篇 cnn
24 篇 semantics
22 篇 image captioning
22 篇 u-net
22 篇 convolutional ne...
21 篇 speech recogniti...
20 篇 task analysis
20 篇 machine learning
19 篇 long short-term ...
18 篇 image segmentati...

机构

8 篇 univ chinese aca...
8 篇 univ sci & techn...
6 篇 chinese acad sci...
6 篇 univ sci & techn...
6 篇 univ sci & techn...
5 篇 nanjing univ sci...
4 篇 jiangnan univ sc...
4 篇 microsoft corp r...
4 篇 chongqing univ c...
4 篇 wuhan univ sch r...
4 篇 shenyang siasun ...
4 篇 southeast univ s...
4 篇 univ sci & techn...
4 篇 toyota technol i...
4 篇 iflytek res peop...
4 篇 google inc mount...
4 篇 shanghai jiao to...
4 篇 northeastern uni...
4 篇 tianjin univ sch...
4 篇 chinese acad sci...

作者

8 篇 watanabe shinji
7 篇 du jun
6 篇 liu yang
6 篇 chen kai
6 篇 zhang jianshu
5 篇 ling zhen-hua
5 篇 zhang limao
5 篇 wang feng
5 篇 wang jun
4 篇 gaur yashesh
4 篇 zhang chunyu
4 篇 wu chengdong
4 篇 wang qing
4 篇 wang bing
4 篇 li chen
4 篇 song xiao
4 篇 xu bo
4 篇 xu fang
4 篇 jiao licheng
4 篇 toshniwal shubha...

语言

839 篇 英文
31 篇 中文
29 篇 其他
3 篇 法文
1 篇 德文

检索条件"主题词=encoder-decoder"

共 902 条记录，以下是881-890 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

基于深度神经网络的家居设计方法

基于深度神经网络的家居设计方法

引用

作者：吴丹北京理工大学

学位级别：硕士

家居设计是室内场景理解的一个重要方向,一个合理有效的智能家居模型可以为设计工作提供很多帮助,具有良好的应用前景。当前家居布局方法都是基于传统机器学习算法,需要多个模型共同完成任务,因此在模型简化上存在一定的提升空间。同时... 详细信息

家居设计是室内场景理解的一个重要方向,一个合理有效的智能家居模型可以为设计工作提供很多帮助,具有良好的应用前景。当前家居布局方法都是基于传统机器学习算法,需要多个模型共同完成任务,因此在模型简化上存在一定的提升空间。同时,在艺术设计领域,空间序列级的家居设计往往是掌握整体空间、把控设计手法的一种重要途径。因此,本文在序列思想的启发下,提出了一种新的基于深度神经网络的家居设计方法,延伸了深度神经网络的应用领域。首先,为验证序列思想的有效性,本文提出了顺序结构的家居设计方法。在数据预处理阶段,我们标注本文模型所需的数据集,并利用基于循环神经网络的encoderdecoder框架以及近年提出的序列模型来实现顺序结构预测任务。给定一个家居序列后,本文模型将会提取字符间的时序特征并预测接下来的家居序列,将给定的条件序列与目标预测连接起来就构成一个完整的家居设计方案。其次,我们提出了双向分层家居设计模型。为使模型更贴近于实际应用,我们引入参数约束,即限制家居物体的相对尺寸,利用在encoder-decoder上的改进模型构建双向分层的家居设计模型并实现家居序列设计工作。完整的双向分层家居设计是将预测任务根据四面墙体分解为四个模块,其中包括了数据预处理、特征提取、分层预测与参数筛选等几个部分,其中分层预测与参数筛选是同时进行的两个模块,即对于每一层的预测任务我们都要事先给定该层的参数约束,同时在预测阶段对参数进行筛选。最后,我们利用Unity3D引擎绘制模型所设计出的家居场景,通过与近年来流行的机器学习方法进行效果对比,显示了本文提出的模型具有实用性的参考价值,同时,我们在模型的风格学习方面做了尝试,表明了基于深度学习的序列思想在家居设计领域具有一定的理论意义与实用价值。同时,我们在最后对本文的工作做了总结与展望,分析模型具有的一些问题,同时也是今后这项工作要参考和改进的方向。

关键词：家居设计空间序列深度学习循环神经网络 encoder-decoder

来源：评论

学校读者我要写书评

暂无评论

Topic-Specific Image Caption Generation 16th

Topic-Specific Image Caption Generation

引用

16th China National Conference on Computational Linguistics (CCL) / 5th International Symposium on Natural Language Processing Based on Naturally Annotated Big Data (NLP-NABD)

作者： Zhou, Chang Mao, Yuzhao Wang, Xiaojie Beijing Univ Posts & Telecommun Sch Comp Beijing Peoples R China

ISBN: (纸本)9783319690056;9783319690049

Recently, image caption which aims to generate a textual description for an image automatically has attracted researchers from various fields. Encouraging performance has been achieved by applying deep neural networks. Most of these works aim at generating a single caption which may be incomprehensive, especially for complex images. This paper proposes a topic-specific multi-caption generator, which infer topics from image first and then generate a variety of topic-specific captions, each of which depicts the image from a particular topic. We perform experiments on flickr8k, flickr30k and MSCOCO. The results show that the proposed model performs better than single-caption generator when generating topic-specific captions. The proposed model effectively generates diversity of captions under reasonable topics and they differ from each other in topic level.

关键词： Image caption Topic model encoder-decoder

来源：评论

学校读者我要写书评

暂无评论

基于深度学习的唇读识别研究

基于深度学习的唇读识别研究

引用

作者：吴大江天津大学

学位级别：硕士

机器唇读,是一种非常新颖,只使用视觉信息即可理解讲话内容的技术。唇读识别是人工智能和计算机视觉领域重要的研究课题,借助唇部特征的辨识,可将其应用在后天聋哑人士的语言功能重建、刑事侦查、身份认证等领域。人工智能在现代社会的... 详细信息

机器唇读,是一种非常新颖,只使用视觉信息即可理解讲话内容的技术。唇读识别是人工智能和计算机视觉领域重要的研究课题,借助唇部特征的辨识,可将其应用在后天聋哑人士的语言功能重建、刑事侦查、身份认证等领域。人工智能在现代社会的各个学科和领域中已经得到了广泛地应用,在各个领域都取得了很好的效果。以深度学习为核心的人工智能技术克服了一般机器学习方法中人工提取特征的困难,实现了机器自主提取特征的过程。唇读识别可以简单分为词语级和句子级两大类,词语级可以看做是判别式分类问题,而句子级可以看做是判别式序列到序列问题。目前,国内外已有学者开始研究自然场景下的唇读识别,并取得了一些成就,但研究的语言种类主要是英语。有关汉语的自然场景下的唇读识别研究目前鲜有触及。因此本文对唇读识别技术充分调研后,重点落在了自然场景下汉语唇读识别问题。本文的主要研究工作如下:1、对国内外的唇读识别技术进行了深入的对比研究,尤其是基于深度学习的唇读识别研究,初步确定了研究课题的整个工作流程。2、唇读识别领域取得进展的主要障碍之一是数据集的匮乏。目前英文唇读数据集也并不充分,可用的数据量远远不足以训练可扩展的模型。而汉语更是没有公开可用的数据集。基于这样的现状,本课题首先采用自动化的办法制作了汉语普通话唇读数据集TMLRD-20(Tianjin University Mandarin Lip Reading dataset20 hours),并详细给出了完整的制作流程。3、参考已有的在动作识别领域的研究成果,设计了几种词语级的唇读识别应用,并在LRW(Lip Reading Word)数据集上进行了测试,并给出了实验结果。这些设计也为后面句子级唇读识别应用设计特征提取前端提供参考。4、设计了基于改进的CTC(connectionist temporal classification)汉语句子级唇读识别模型,并在TMLRD-20上给出了实验结果和分析。识别结果表明该模型对于汉语句子级唇读识别应用具有可行性。5、设计了基于改进的encoder-decoder汉语句子级唇读识别模型MLRN(Mandarin Lip Reading Network),将该模型在TMLRD-20数据集和Grid数据集上给出了测试,实验结果表明该模型的性能要优于改进的基于CTC汉语句子级唇读识别模型的性能,并且在Grid数据集上也表现出非常有竞争性的识别结果。

关键词：唇读深度学习词语级句子级 TMLRD-20 汉语 CTC encoder-decoder

来源：评论

学校读者我要写书评

暂无评论

基于word2vec的SVC和AT-LSTM应用于文本分类的比较和结合

基于word2vec的SVC和AT-LSTM应用于文本分类的比较和结合

引用

作者：李世强华东师范大学

学位级别：硕士

在自然语言处理中,文本分类是一项基础而又重要的工作。文本分类是对自然语言文本信息的分类任务,分类的类别是语言中想表达的意图。word2vec自问世以来,就以其向量空间结构化的优势风靡全球,并已经在自然语言处理的方方面面得到了广泛... 详细信息

在自然语言处理中,文本分类是一项基础而又重要的工作。文本分类是对自然语言文本信息的分类任务,分类的类别是语言中想表达的意图。word2vec自问世以来,就以其向量空间结构化的优势风靡全球,并已经在自然语言处理的方方面面得到了广泛的应用。本文以word2vec为基础,以维基百科中文百科语料库为语料生成中文词向量,分别以时下流行的分类器模型支持向量机和自然语言处理的新生力军AT-LSTM为工具,对实习期间收集的酒店评论数据进行文本分类(情感分析)的实验尝试,并对这两种算法的结果和性能做比较。然后尝试结合这两种算法,各取所长,给出了分类精度更好的结合模型。以word2vec生成的词向量作为输入,支持向量机还需要由词向量到句子向量的中间步骤,这一步骤不仅充满困难还直接决定了最后的分类效果;ATLSTM则避免了这一问题,通过encoder-decoder模型框架提取句子特征,直接对句子进行分类。在文本分类任务中,相同样本下,AT-LSTM的分类结果明显优于以词向量均值为句子向量的支持向量机分类结果。但以词向量“投票”的方式生成句子向量的支持向量机在分类效果上不逊于AT-LSTM模型,即支持向量机的分类结果更依赖于生成句子向量的方式,而AT-LSTM可以看作是一种特征提取方式。本文结合两者,先通过AT-LSTM提取句子的语义向量,然后用支持向量机进行分类,最终得到了比单纯的支持向量机和AT-LSTM分类效果更好的结合模型。

关键词：文本分类 word2vec 支持向量机 AT-LSTM encoder-decoder

来源：评论

学校读者我要写书评

暂无评论

Question Answering with Character-Level LSTM encoders and Model-Based Data Augmentation 16th

Question Answering with Character-Level LSTM Encoders and Mo...

引用

16th China National Conference on Computational Linguistics (CCL) / 5th International Symposium on Natural Language Processing Based on Naturally Annotated Big Data (NLP-NABD)

作者： Wang, Run-Ze Zhan, Chen-Di Ling, Zhen-Hua Univ Sci & Technol China Natl Engn Lab Speech & Language Informat Proc Hefei Anhui Peoples R China

ISBN: (纸本)9783319690056;9783319690049

This paper presents a character-level encoder-decoder modeling method for question answering (QA) from large-scale knowledge bases (KB). This method improves the existing approach [9] from three aspects. First, long short-term memory (LSTM) structures are adopted to replace the convolutional neural networks (CNN) for encoding the candidate entities and predicates. Second, a new strategy of generating negative samples for model training is adopted. Third, a data augmentation strategy is applied to increase the size of the training set by generating factoid questions using another trained encoder-decoder model. Experimental results on the SimpleQuestions dataset and the Freebase5M KB demonstrates the effectiveness of the proposed method, which improves the state-of-the-art accuracy from 70.3% to 78.8% when augmenting the training set with 70,000 generated triple-question pairs.

关键词： Question answering Knowledge base Long short-term memory encoder-decoder

来源：评论

学校读者我要写书评

暂无评论

Gaussian Prediction based Attention for Online End-to-End Speech Recognition 18

Gaussian Prediction based Attention for Online End-to-End Sp...

引用

18th Annual Conference of the International-Speech-Communication-Association (INTERSPEECH 2017)

作者： Hou, Junfeng Zhang, Shiliang Dai, Lirong Univ Sci & Technol China Natl Engn Lab Speech & Language Informat Proc Hefei Anhui Peoples R China

ISBN: (纸本)9781510848764

Recently end-to-end speech recognition has obtained much attention. One of the popular models to achieve end-to-end speech recognition is attention based encoder-decoder model, which usually generating output sequences iteratively by attending the whole representations of the input sequences. However. predicting outputs until receiving the whole input sequence is not practical for online or low time latency speech recognition. In this paper, we present a simple but effective attention mechanism which can make the encoder-decoder model generate outputs without attending the entire input sequence and can apply to online speech recognition. At each prediction step, the attention is assumed to be a time-moving gaussian window with variable size and can be predicted by using previous input and output information instead of the content based computation on the whole input sequence. To further improve the online performance of the model, we employ deep convolutional neural networks as encoder. Experiments show that the gaussian prediction based attention works well and under the help of deep convolutional neural networks the online model achieves 19.5% phoneme error rate in TIMIT ASR task.

关键词： Automatic Speech Recognition encoder-decoder Online Gaussian Prediction based Attention Deep Convolutional encoder

来源：评论

学校读者我要写书评

暂无评论

Attention Aware Semi-supervised Framework for Sentiment Analysis 26th

Attention Aware Semi-supervised Framework for Sentiment Anal...

引用

26th International Conference on Artificial Neural Networks (ICANN)

作者： Liu, Jingshuang Rong, Wenge Tian, Chuan Gao, Min Xiong, Zhang Beihang Univ Sch Comp Sci & Engn Beijing Peoples R China Chonqing Univ Sch Software Engn Chongqing Peoples R China

ISBN: (纸本)9783319686127;9783319686110

Using sentiment analysis methods to retrieve useful information from the accumulated documents in the Internet has become an important research subject. In this paper, we proposed a semi-supervised framework, which uses the unlabeled data to promote the learning ability of the long short memory (LSTM) network. It is composed of an unsupervised attention aware long short term memory (LSTM) encoder-decoder and a single LSTM model used for feature extraction and classification. Experimental study on commonly used datasets has demonstrated our framework's good potential for sentiment classification tasks. And it has shown that the unsupervised learning part can improve the LSTM network's learning ability.

关键词： Sentiment analysis Semi-supervised learning Attention Long short term memory encoder-decoder

来源：评论

学校读者我要写书评

暂无评论

Low Frequency Words Compression in Neural Conversation System 24th

Low Frequency Words Compression in Neural Conversation Syste...

引用

24th International Conference on Neural Information Processing (ICONIP)

作者： Wu, Sixing Li, Ying Wu, Zhonghai Peking Univ Sch Software & Microelect Beijing Peoples R China Peking Univ Natl Res Ctr Software Engn Beijing Peoples R China

ISBN: (纸本)9783319700960;9783319700953

Recently, encoder-decoder, a framework for sequence-to-sequence (seq2seq) tasks has been widely used in the open domain generation-based conversation system. One of the most difficult challenges in encoder-decoder based open domain conversation systems is the Unknown Words Issue, that is, numerous words become out-of-vocabulary words (OOVs) due to the restriction of vocabulary's volume, while a conversation system always tries to avoid their appearances. This paper proposes a novel approach named Low Frequency Words Compression (LFWC) to address this problem by selectively using K-Components shared symbol for word representations of low frequency words. Compared to the standard encoder-decoder works at word-level, our LFWC encoder-decoder works at symbol-level, and we propose Sequence Transform to transform a word-level sequence into a symbol-level sequence and LFWC-Predictor to decode from a symbol-level sequence into a word-level sequence. To measure the interference of OOVs in neural conversation system, besides log-perplexity (LP), we apply two more suitable metrics UP-LP and UP-Delta to evaluate the interference of OOVs. The experiment shows that the performance of decoding from compressed symbol-level sequences to word-level sequences achieves a recall@1 score of 60.9%, which is much above 16.7% of baseline, with the strongest compression ratio. It also shows our approach outperforms the standard encoder-decoder model in reducing interference of OOVs, which achieves almost the half score of UP-Delta in the most of configurations.

关键词： seq2seq Conversation system Vocabulary encoder-decoder OOVs

来源：评论

学校读者我要写书评

暂无评论

MANet: A Modal Attention Network for Describing Videos 17

MANet: A Modal Attention Network for Describing Videos

引用

25th ACM International Conference on Multimedia (MM)

作者： Phan, Sang Miyao, Yusuke Satoh, Shin'ichi Natl Inst Informat Tokyo Japan

ISBN: (纸本)9781450349062

Exploiting multimodal features has become a standard approach towards many video applications, including the video captioning task. One problem with the existing work is that it models the relevance of each type of features evenly, which neutralizes the impact of each individual modality to the word to be generated. In this paper, we propose a novel Modal Attention Network (MANet) to account for this issue. Our MANet extends the standard encoder-decoder network by adapting the attention mechanism to video modalities. As a result, MANet emphasizes the impact of each modality with respect to the word to be generated. Experimental results show that our MANet effectively utilizes multimodal features to generate better video descriptions. Especially, our MANet system was ranked among the top three systems at the 2nd Video to Language Challenge in both automatic metrics and human evaluations.

关键词： video captioning attention mechanism multimodal encoder-decoder

来源：评论

学校读者我要写书评

暂无评论

Question Answering with Character-Level LSTM encoders and Model-Based Data Augmentation

Question Answering with Character-Level LSTM Encoders and Mo...

引用

第十六届全国计算语言学学术会议暨第五届基于自然标注大数据的自然语言处理国际学术研讨会

作者： Run-Ze Wang Chen-Di Zhan Zhen-Hua Ling National Engineering Laboratory for Speech and Language Information Processing University of Science and Technology of ChinaHefeiChina

ISBN: (纸本)9783319690049

This paper presents a character-level encoder-decoder mod-eling method for question answering(QA)from large-scale knowledge bases(KB).This method improves the existing approach [9] from three ***,long short-term memory(LSTM)structures are adopted to replace the convolutional neural networks(CNN)for encoding the can-didate entities and ***,a new strategy of generating neg-ative samples for model training is ***,a data augmentation strategy is applied to increase the size of the training set by generating factoid questions using another trained encoder-decoder ***-mental results on the SimpleQuestions dataset and the Freebase5M KB demonstrates the effectiveness of the proposed method,which improves the state-of-the-art accuracy from 70.3%to 78.8%when augmenting the training set with 70,000 generated triple-question pairs.

关键词： Question Answering Knowledge Base Long Short-TermMemory encoder-decoder

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共91页 << < 82 83 84 85 86 87 88 89 90 91 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：