检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

285 篇 会议
112 篇 期刊文献

馆藏范围

397 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

270 篇 工学
- 196 篇 计算机科学与技术...
- 179 篇 软件工程
- 71 篇 信息与通信工程
- 22 篇 生物工程
- 20 篇 机械工程
- 17 篇 电气工程
- 17 篇 控制科学与工程
- 15 篇 电子科学与技术（可...
- 12 篇 光学工程
- 10 篇 化学工程与技术
- 9 篇 生物医学工程（可授...
- 3 篇 仪器科学与技术
- 3 篇 动力工程及工程热...
- 2 篇 材料科学与工程（可...
- 2 篇 土木工程
169 篇 理学
- 111 篇 物理学
- 69 篇 数学
- 39 篇 统计学（可授理学、...
- 26 篇 生物学
- 20 篇 系统科学
- 11 篇 化学
38 篇 管理学
- 27 篇 图书情报与档案管...
- 9 篇 管理科学与工程(可...
- 4 篇 工商管理
- 2 篇 公共管理
6 篇 法学
- 5 篇 社会学
5 篇 医学
- 5 篇 临床医学
- 2 篇 基础医学(可授医学...
- 2 篇 公共卫生与预防医...
2 篇 文学
- 2 篇 外国语言文学
1 篇 经济学
1 篇 教育学
1 篇 农学
1 篇 艺术学

主题

66 篇 speech recogniti...
40 篇 training
38 篇 hidden markov mo...
29 篇 neural machine t...
25 篇 machine translat...
20 篇 decoding
19 篇 handwriting reco...
18 篇 computer aided l...
16 篇 recurrent neural...
15 篇 feature extracti...
15 篇 transducers
14 篇 vocabulary
12 篇 databases
12 篇 error analysis
11 篇 speech
10 篇 pattern recognit...
10 篇 humans
9 篇 training data
9 篇 optimization
9 篇 context

机构

52 篇 human language t...
40 篇 human language t...
37 篇 apptek gmbh aach...
32 篇 human language t...
31 篇 human language t...
23 篇 apptek gmbh aach...
20 篇 human language t...
16 篇 human language t...
13 篇 human language t...
12 篇 pattern recognit...
10 篇 human language t...
9 篇 spoken language ...
9 篇 human language t...
8 篇 computer science...
8 篇 human language t...
6 篇 pattern recognit...
6 篇 human language t...
6 篇 human language t...
6 篇 human language t...
5 篇 a2ia sa

作者

210 篇 ney hermann
80 篇 hermann ney
61 篇 schlüter ralf
22 篇 ralf schluter
21 篇 ralf schlüter
20 篇 casacuberta fran...
19 篇 wuebker joern
18 篇 zeyer albert
17 篇 zhou wei
16 篇 gao yingbo
14 篇 kim yunsu
14 篇 herold christian
13 篇 mansour saab
13 篇 zeineldeen moham...
13 篇 patrick doetsch
12 篇 peris álvaro
12 篇 peter jan-thorst...
12 篇 michel wilfried
12 篇 peitz stephan
12 篇 huck matthias

语言

395 篇 英文
1 篇 西班牙文
1 篇 中文

检索条件"机构=Human Language Technology and Pattern Recognition Group"

共 397 条记录，以下是161-170 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

Discriminative reordering models for statistical machine translation

Discriminative reordering models for statistical machine tra...

引用

2006 Workshop on Statistical Machine Translation, WMT 2006, collocated with the HLT-NAACL 2006

作者： Zens, Richard Ney, Hermann Human Language Technology and Pattern Recognition Lehrstuhl für Informatik 6 Computer Science Department RWTH Aachen University AachenD-52056 Germany

We present discriminative reordering models for phrase-based statistical machine translation. The models are trained using the maximum entropy principle. We use several types of features: based on words, based on word classes, based on the local context. We evaluate the overall performance of the reordering models as well as the contribution of the individual feature types on a word-aligned corpus. Additionally, we show improved translation performance using these reordering models compared to a state-of-the-art baseline system. © HLT-NAACL *** right reserved.

关键词： Computer aided language translation

来源：评论

学校读者我要写书评

暂无评论

How Much Self-Attention Do We Needƒ Trading Attention for Feed-Forward Layers

How Much Self-Attention Do We Needƒ Trading Attention for F...

引用

International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

作者： Kazuki Irie Alexander Gerstenberger Ralf Schlüter Hermann Ney Human Language Technology and Pattern Recognition Group RWTH Aachen University Aachen Germany

ISBN: (数字)9781509066315

ISBN: (纸本)9781509066322

We propose simple architectural modifications in the standard Transformer with the goal to reduce its total state size (defined as the number of self-attention layers times the sum of the key and value dimensions, times position) without loss of performance. Large scale Transformer language models have been empirically proved to give very good performance. However, scaling up results in a model that needs to store large states at evaluation time. This can increase the memory requirement dramatically for search e.g., in speech recognition (first pass decoding, lattice rescoring, or shallow fusion). In order to efficiently increase the model capacity without increasing the state size, we replace the single-layer feed-forward module in the Transformer layer by a deeper network, and decrease the total number of layers. In addition, we also evaluate the effect of key-value tying which directly divides the state size in half. On TED-LIUM 2, we obtain a model of state size 4 times smaller than the standard Transformer, with only 2% relative loss in terms of perplexity, which makes the deployment of Transformer language models more convenient.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Layer-Normalized LSTM for Hybrid-Hmm and End-To-End ASR

Layer-Normalized LSTM for Hybrid-Hmm and End-To-End ASR

引用

IEEE International Conference on Acoustics, Speech and Signal Processing

作者： Mohammad Zeineldeen Albert Zeyer Ralf Schluter Hermann Ney Human Language Technology and Pattern Recognition Group RWTH Aachen University Aachen Germany

ISBN: (数字)9781509066315

ISBN: (纸本)9781509066322

Training deep neural networks is often challenging in terms of training stability. It often requires careful hyperparameter tuning or a pretraining scheme to converge. Layer normalization (LN) has shown to be a crucial ingredient in training deep encoder-decoder models. We explore various LN long short-term memory (LSTM) recurrent neural networks (RNN) variants by applying LN to different parts of the internal recurrency of LSTMs. There is no previous work that investigates this. We carry out experiments on the Switchboard 300h task for both hybrid and end-to-end ASR models and we show that LN improves the final word error rate (WER), the stability during training, allows to train even deeper models, requires less hyperparameter tuning, and works well even without pre-training. We find that applying LN to both forward and recurrent inputs globally, which we denoted by Global Joined Norm variant, gives a 10% relative improvement in WER.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Efficient approximations to model-based joint tracking and recognition of continuous sign language

Efficient approximations to model-based joint tracking and r...

引用

International Conference on Automatic Face and Gesture recognition

作者： Philippe Dreuw Jens Forster Thomas Deselaers Hermann Ney Human Language Technology and Pattern Recognition Group RWTH Aachen University Aachen Germany

We propose several tracking adaptation approaches to recover from early tracking errors in sign language recognition by optimizing the obtained tracking paths w.r.t. to the hypothesized word sequences of an automatic sign language recognition system. Hand or head tracking is usually only optimized according to a tracking criterion. As a consequence, methods which depend on accurate detection and tracking of body parts lead to recognition errors in gesture and sign language processing. We analyze an integrated tracking and recognition approach addressing these problems and propose approximation approaches over multiple hand hypotheses to ease the time complexity of the integrated approach. Most state-of-the-art systems consider tracking as a preprocessing feature extraction part. Experiments on a publicly available benchmark database show that the proposed methods strongly improve the recognition accuracy of the system.

关键词： Handicapped aids pattern recognition Feature extraction Particle tracking Principal component analysis humans Head Spatial databases Image recognition Fuses

来源：评论

学校读者我要写书评

暂无评论

The RWTH Phrase-based Statistical Machine Translation System 2

The RWTH Phrase-based Statistical Machine Translation System

引用

2nd International Workshop on Spoken language Translation, IWSLT 2005

作者： Zens, Richard Bender, Oliver Hasan, Saša Khadivi, Shahram Matusov, Evgeny Xu, Jia Zhang, Yuqi Ney, Hermann Human Language Technology and Pattern Recognition Lehrstuhl für Informatik VI Computer Science Department RWTH Aachen University AachenD-52056 Germany

We give an overview of the RWTH phrase-based statistical machine translation system that was used in the evaluation campaign of the International Workshop on Spoken language Translation 2005. We use a two pass approach. In the first pass, we generate a list of the N best translation candidates. The second pass consists of rescoring and reranking this N-best list. We will give a description of the search algorithm as well as the models that are used in each pass. We participated in the supplied data tracks for manual transcriptions for the following translation directions: Arabic-English, Chinese-English, English-Chinese and Japanese-English. For Japanese-English, we also participated in the C-Star track. In addition, we performed translations of automatic speech recognition output for Chinese-English and Japanese-English. For both language pairs, we translated the single-best ASR hypotheses. Additionally, we translated Chinese ASR lattices. © 2005 International Workshop on Spoken language Translation, IWSLT 2005.

关键词： Speech recognition

来源：评论

学校读者我要写书评

暂无评论

A Comprehensive Study of Residual CNNS for Acoustic Modeling in ASR

A Comprehensive Study of Residual CNNS for Acoustic Modeling...

引用

IEEE International Conference on Acoustics, Speech and Signal Processing

作者： Vitalii Bozheniuk Albert Zeyer Ralf Schluter Hermann Ney Human Language Technology and Pattern Recognition Group RWTH Aachen University Aachen Germany

ISBN: (数字)9781509066315

ISBN: (纸本)9781509066322

Long short-term memory (LSTM) networks are the dominant architecture for large vocabulary continuous speech recognition (LVCSR) acoustic modeling due to their good performance. However, LSTMs are hard to tune and computationally expensive. To build a system with lower computational costs and which allows online streaming applications, we explore convolutional neural networks (CNN). To the best of our knowledge there is no overview on CNN hyper-parameter tuning for LVCSR in the literature, so we present our results explicitly. Apart from recognition performance, we focus on the training and evaluation speed and provide a time-efficient setup for CNNs. We faced an overfitting problem in training and solved it with data augmentation, namely SpecAugment. The system achieves results competitive with the top LSTM results. We significantly increased the speed of CNN in training and decoding approaching the speed of the offline LSTM.

关键词： acoustic modeling CNN ResNet LACE dense prediction

来源：评论

学校读者我要写书评

暂无评论

Optimizing Energies for Pose-Invariant Face recognition

Optimizing Energies for Pose-Invariant Face Recognition

引用

International Conference on pattern recognition

作者： Harald Hanselmann Hermann Ney Human Language Technology and Pattern Recognition Group RWTH Aachen University Aachen Germany

One of the most difficult challenges in face recognition is the large variation in pose. One approach to handle this problem is to use a 2D-Warping algorithm in a nearest-neighbor classifier. The 2D-Warping algorithm optimizes an energy function that captures the cost of matching pixels between two images while respecting the 2D dependencies defined by local pixel neighborhoods. Optimizing this energy function is an NP-complete problem and is therefore approached with algorithms that aim to approximate the optimal solution. In this paper we compare two algorithms that do this without discarding any 2D dependencies and we study the effect of the quality of the approximate solutions on the classification performance. Additionally, we propose a new algorithm that is capable of finding better solutions and obtaining better energies than the other methods. The experimental evaluation on the CMU-MultiPIE database shows that the proposed algorithm also achieves state-of-the-art recognition accuracies.

关键词： Two dimensional displays Face recognition Approximation algorithms Dynamic programming Face Heuristic algorithms Message passing

来源：评论

学校读者我要写书评

暂无评论

ELoPE: Fine-Grained Visual Classification with Efficient Localization, Pooling and Embedding

ELoPE: Fine-Grained Visual Classification with Efficient Loc...

引用

IEEE Workshop on Applications of Computer Vision (WACV)

作者： Harald Hanselmann Hermann Ney Human Language Technology and Pattern Recognition Group RWTH Aachen University Aachen Germany

ISBN: (数字)9781728165530

ISBN: (纸本)9781728165547

The task of fine-grained visual classification (FGVC) deals with classification problems that display a small inter-class variance such as distinguishing between different bird species or car models. State-of-the-art approaches typically tackle this problem by integrating an elaborate attention mechanism or (part-) localization method into a standard convolutional neural network (CNN). Also in this work the aim is to enhance the performance of a backbone CNN such as ResNet by including three efficient and lightweight components specifically designed for FGVC. This is achieved by using global k-max pooling, a discriminative embedding layer trained by optimizing class means and an efficient localization module that estimates bounding boxes using only class labels for training. The resulting model achieves state-of-the-art recognition accuracies on multiple FGVC benchmark datasets.

关键词： Training Task analysis Automobiles Standards Visualization Birds Testing

来源：评论

学校读者我要写书评

暂无评论

Phrase Model Training for Statistical Machine Translation with Word Lattices of Preprocessing Alternatives 12

Phrase Model Training for Statistical Machine Translation wi...

引用

Workshop on Statistical Machine Translation

作者： Joern Wuebker Hermann Ney Human Language Technology and Pattern Recognition Group RWTH Aachen University Aachen Germany

ISBN: (纸本)9781622765928

In statistical machine translation, word lattices are used to represent the ambiguities in the preprocessing of the source sentence, such as word segmentation for Chinese or morphological analysis for German. Several approaches have been proposed to define the probability of different paths through the lattice with external tools like word segmenters, or by applying indicator features. We introduce a novel lattice design, which explicitly distinguishes between different preprocessing alternatives for the source sentence. It allows us to make use of specific features for each preprocessing type and to lexicalize the choice of lattice path directly in the phrase translation model. We argue that forced alignment training can be used to learn lattice path and phrase translation model simultaneously. On the news-commentary portion of the German→English WMT 2011 task we can show moderate improvements of up to 0.6% Bleu over a state-of-the-art baseline system.

关键词： machine translation crystal lattices Word Pretreatment

来源：评论

学校读者我要写书评

暂无评论

Exploring A Zero-Order Direct Hmm Based on Latent Attention for Automatic Speech recognition

Exploring A Zero-Order Direct Hmm Based on Latent Attention ...

引用

IEEE International Conference on Acoustics, Speech and Signal Processing

作者： Parnia Bahar Nikita Makarov Albert Zeyer Ralf Schluter Hermann Ney Human Language Technology and Pattern Recognition Group RWTH Aachen University Aachen Germany

ISBN: (数字)9781509066315

ISBN: (纸本)9781509066322

In this paper, we study a simple yet elegant latent variable attention model for automatic speech recognition (ASR) which enables an integration of attention sequence modeling into the direct hidden Markov model (HMM) concept. We use a sequence of hidden variables that establishes a mapping from output labels to input frames. Inspired by the direct HMM model, we assume a decomposition of the label sequence posterior into emission and transition probabilities using zero-order assumption and incorporate both Transformer and LSTM attention models into it. The method keeps the explicit alignment as part of the stochastic model and combines the ease of the end-to-end training of the attention model as well as an efficient and simple beam search. To study the effect of the latent model, we qualitatively analyze the alignment behavior of the different approaches. Our experiments on three ASR tasks show promising results in WER with more focused alignments in comparison to the attention models.

关键词：

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共40页 << < 13 14 15 16 17 18 19 20 21 22 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：