检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

时间限定

出版年份：

文献类型

图书期刊文献学位论文多媒体

馆藏选择

电子馆藏纸本馆藏

核心期刊

全部期刊 SCI 收录期刊 SSCI 收录期刊 EI 收录期刊 CSCD 收录期刊 CSSCI 收录期刊

语言

中文英文

文献类型

期刊文献图书学位论文标准纸本馆藏

帮助

文字说明：

T=题名（书名、题名），A=作者（责任者），K=主题词，P=出版物名称，PU=出版社名称，O=机构（作者单位、学位授予单位、专利申请人），L=中图分类号，C=学科分类号，U=全部字段，Y=年（出版发行年、学位年度、标准发布年）

检索规则说明：

AND代表“并且”；OR代表“或者”；NOT代表“不包含”；(注意必须大写,运算符两边需空一格)

检索范例：

范例一：(K=图书馆学 OR K=情报学) AND A=范并思 AND Y=1982-2016
范例二：P=计算机应用与软件 AND (U=C++ OR U=Basic) NOT K=Visual AND Y=2011-2016

分类表

所选分类

>> <<

限定检索结果

文献类型

542 篇 会议
394 篇 期刊文献
4 册 图书

馆藏范围

940 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

597 篇 工学
- 437 篇 计算机科学与技术...
- 376 篇 软件工程
- 132 篇 信息与通信工程
- 69 篇 控制科学与工程
- 54 篇 生物工程
- 43 篇 生物医学工程（可授...
- 38 篇 电气工程
- 31 篇 机械工程
- 26 篇 电子科学与技术（可...
- 22 篇 化学工程与技术
- 16 篇 光学工程
- 14 篇 动力工程及工程热...
367 篇 理学
- 166 篇 物理学
- 134 篇 数学
- 89 篇 生物学
- 65 篇 统计学（可授理学、...
- 44 篇 系统科学
- 25 篇 化学
133 篇 管理学
- 76 篇 图书情报与档案管...
- 52 篇 管理科学与工程(可...
- 29 篇 工商管理
77 篇 医学
- 66 篇 临床医学
- 52 篇 基础医学(可授医学...
- 24 篇 公共卫生与预防医...
- 21 篇 药学(可授医学、理...
49 篇 教育学
- 38 篇 教育学
- 23 篇 心理学(可授教育学...
44 篇 法学
- 42 篇 社会学
20 篇 文学
- 14 篇 外国语言文学
- 13 篇 中国语言文学
17 篇 农学
12 篇 经济学
2 篇 哲学
2 篇 军事学
1 篇 历史学
1 篇 艺术学

主题

80 篇 speech recogniti...
32 篇 hidden markov mo...
27 篇 training
24 篇 speech
22 篇 semantics
21 篇 humans
20 篇 computer science
20 篇 machine translat...
20 篇 decoding
20 篇 machine learning
18 篇 feature extracti...
16 篇 transducers
15 篇 deep learning
14 篇 computer aided l...
14 篇 neural machine t...
13 篇 natural language...
13 篇 natural language...
12 篇 computational li...
11 篇 support vector m...
11 篇 students

机构

43 篇 apptek gmbh aach...
36 篇 human language t...
29 篇 human language t...
18 篇 human language t...
17 篇 institute of res...
16 篇 human language t...
14 篇 trustworthy huma...
13 篇 department of in...
12 篇 department of la...
11 篇 department of co...
11 篇 computer science...
9 篇 department of co...
9 篇 human language t...
9 篇 human language t...
9 篇 hkust human lang...
9 篇 future technolog...
9 篇 school of medici...
8 篇 department of co...
8 篇 health managemen...
8 篇 julius centre fo...

作者

138 篇 ney hermann
62 篇 schlüter ralf
34 篇 wu dekai
33 篇 hermann ney
22 篇 habernal ivan
18 篇 dredze mark
18 篇 hosseinzadeh meh...
17 篇 ralf schlüter
17 篇 pascale fung
17 篇 zhou wei
16 篇 rahmani amir mas...
15 篇 fung pascale
14 篇 gao yingbo
14 篇 michel wilfried
13 篇 ralf schluter
12 篇 yousefpoor moham...
12 篇 zeineldeen moham...
12 篇 badriyya b.al-on...
12 篇 zeyer albert
12 篇 zens richard

语言

913 篇 英文
23 篇 其他
4 篇 中文

检索条件"机构=Department of Computer Science Department of Language and Human Development"

共 940 条记录，以下是751-760 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

Feature selection for log-linear acoustic models

Feature selection for log-linear acoustic models

引用

International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

作者： S. Wiesler A. Richard Y. Kubo R. Schlüter H. Ney Human Language Technology and Pattern Recognition Computer Science Department RWTH Aachen University Aachen Germany

ISBN: (纸本)9781457705380

Log-linear acoustic models have been shown to be competitive with Gaussian mixture models in speech recognition. Their high training time can be reduced by feature selection. We compare a simple univariate feature selection algorithm with ReliefF - an efficient multivariate algorithm. An alternative to feature selection is ℓ 1 -regularized training, which leads to sparse models. We observe that this gives no speedup when sparse features are used, hence feature selection methods are preferable. For dense features, ℓ 1 -regularization can reduce training and recognition time. We generalize the well known Rprop algorithm for the optimization of ℓ 1 -regularized functions. Experiments on the Wall Street Journal corpus showed that a large number of sparse features could be discarded without loss of performance. A strong regularization led to slight performance degradations, but can be useful on large tasks, where training the full model is not tractable.

关键词： Training Hidden Markov models Acoustics Speech recognition Polynomials Optimization Complexity theory

来源：评论

学校读者我要写书评

暂无评论

A comparative analysis of dynamic network decoding

A comparative analysis of dynamic network decoding

引用

International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

作者： David Rybach Ralf Schlüter Hermann Ney Human Language Technology and Pattern Recognition Computer Science Department RWTH Aachen University Aachen Germany

ISBN: (纸本)9781457705380

The use of statically compiled search networks for ASR systems using huge vocabularies and complex language models often becomes challenging in terms of memory requirements. Dynamic network decoders introduce additional computations in favor of significantly lower memory consumption. In this paper we investigate the properties of two well-known search strategies for dynamic network decoding, namely history conditioned tree search and WFST-based search using dynamic transducer composition. We analyze the impact of the differences in search graph representation, search space structure, and language model look-ahead techniques. Experiments on an LVCSR task illustrate the influence of the compared properties.

关键词： Decoding Transducers Hidden Markov models History Speech recognition Context Vocabulary

来源：评论

学校读者我要写书评

暂无评论

EM-style optimization of hidden conditional random fields for grapheme-to-phoneme conversion

EM-style optimization of hidden conditional random fields fo...

引用

International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

作者： Georg Heigold Stefan Hahn Patrick Lehnen Hermann Ney Human Language Technology and Pattern Recognition Computer Science Department RWTH Aachen University Aachen Germany

We have recently proposed an EM-style algorithm to optimize log-linear models with hidden variables. In this paper, we use this algorithm to optimize a hidden conditional random field, i.e., a conditional random field with hidden variables. Similar to hidden Markov models, the alignments are the hidden variables in the examples considered. Here, EM-style algorithms are iterative optimization algorithms which are guaranteed to improve the training criterion in each iteration without the need for tuning step sizes, sophisticated update schemes or numerical line optimization (with hardly predictable complexity). This is a rather strong property which conventional gradient-based optimization algorithms do not have. We present experimental results for a grapheme-to-phoneme conversion task and compare the convergence behavior of the EM-style algorithm with L-BFGS and Rprop.

关键词： Optimization Training Geographic Information Systems Mathematical model Convergence Error analysis Equations

来源：评论

学校读者我要写书评

暂无评论

Incorporating alignments into Conditional Random Fields for grapheme to phoneme conversion

Incorporating alignments into Conditional Random Fields for ...

引用

International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

作者： Patrick Lehnen Stefan Hahn Andreas Guta Hermann Ney Human Language Technology and Pattern Recognition Computer Science Department RWTH Aachen University Aachen Germany

Conditional Random Fields (CRFs) are a state-of-the-art approach to natural language processing tasks like grapheme-to phoneme (g2p) conversion which is used to produce pronunciations or pronunciation variants for almost all ASR pronunciation lexica. One drawback of CRFs is that for training, an alignment is needed between graphemes and phonemes, usually even 1-to-l. The quality of the g2p result heavily depends on this alignment. Since these alignments are usually not annotated within the corpora, external models have to be used to produce such an alignment in a preprocessing step. In this work, we propose two approaches to integrate the alignment generation directly and efficiently into the CRF training process. Whereas the first approach relies on linear segmentation as starting point, the second approach considers all possible alignments given certain constraints. Both methods have been evaluated on two English g2p tasks, namely NETtalk and Celex, on which state-of-the-art results have been reported in the literature. The proposed approaches lead to results comparable to the state-of-the art.

关键词： Training Joints Hidden Markov models Biological system modeling Manuals Automata Mathematical model

来源：评论

学校读者我要写书评

暂无评论

Interactive Coin Addition: How Hands Can Help Us Think 33

Interactive Coin Addition: How Hands Can Help Us Think

引用

33rd Annual Meeting of the Cognitive science Society: Expanding the Space of Cognitive science, CogSci 2011

作者： Neth, Hansjörg Payne, Stephen J. Center for Adaptive Behavior and Cognition Max Planck Institute for Human Development Lentzeallee 94 Berlin14195 Germany Department of Computer Science University of Bath BathBA2 7AY United Kingdom

ISBN: (纸本)9780976831877

Does using our hands help us to add the value of a set of coins? We test the benefits and costs of direct interaction with a mental arithmetic task in a computerized yoked design in which groups of participants vary in their interactive mode (move vs. look) and the initial configuration of coins (pseudo-random vs. another mover’s final layout). By assessing performance and conducting a microgenetic analysis of the strategies employed we argue that the purpose of movement is the result, rather than the process of moving. Participants move coins in order to sort, rather than to mark, and select them by value, rather than by location. They spontaneously create remarkably smart solutions, thereby incidentally creating physical configurations that can help other problem solvers. © CogSci 2011.

关键词： complementary strategies embodied cognition epistemic actions immediate interactive behavior

来源：评论

学校读者我要写书评

暂无评论

ASYMMETRIC ACOUSTIC MODELING OF MIXED language SPEECH

ASYMMETRIC ACOUSTIC MODELING OF MIXED LANGUAGE SPEECH

引用

IEEE International Conference on Acoustics, Speech and Signal Processing

作者： Ying Li Pascale Fung Ping Xu Yi Liu Human Language Technology Center Department of Electronic and Computer Engineering Hong Kong University of Science and Technology Hong Kong China

We propose to improve speech recognition performance on speaker-independent, mixed language speech by asymmetric acoustic modeling. Mixed language is either inter-sentential code switching from the source matrix language to a foreign language or intra-sentential code mixing between the matrix language and embedded foreign words or phrases. In either case, the foreign phrases are pronounced by the matrix language speaker with varying degrees of accent. Our proposed system using selective decision tree merging between a bilingual model and an accented embedded speech model outperforms previous approaches of either using a bilingual model with model retraining by 21.51percent, or using adaptation by 15.88percent. It outperforms all models on both code mixing and code switching cases. We successfully improved recognition on embedded foreign speech without degrading the performance on the matrix language speech.

关键词： Speech Adaptation models Acoustics Data models Speech recognition Hidden Markov models Switches

来源：评论

学校读者我要写书评

暂无评论

A Multi-Model Method for Short-Utterance Speaker Recognition

A Multi-Model Method for Short-Utterance Speaker Recognition

引用

2011年亚太信号与信息处理协会年会

作者： Jyh-Shing Roger Jang Thomas Fang Zheng Department of Computer Science Tsing Hua University Hsin-chu Center for Speech and Language Technologies Division of Technical Innovation and Development Tsinghua National Laboratory for Information Science and Technology Department of Computer Science and Technology Tsinghua University

The length of the test speech greatly influences the performance of GMM-UBM based text-independent speaker recognition system, for example when the length of valid speech is as short as 1～5 seconds, the performance decreases significantly because the GMM-UBM based speaker recognition method is a statistical one, of which sufficient data is the foundation. Considering that the use of text information will be helpful to speaker recognition, a multi-model method is proposed to improve short-utterance speaker recognition (SUSR) in Chinese. We build a few phoneme class models for each speaker to represent different parts of the characteristic space and fuse the scores to fit the test data on the models with the purpose of increasing the matching degree between training models and test utterance. Experimental results showed that the proposed method achieved a relative EER reduction of about 26% compared with the traditional GMM-UBM method.

关键词： GMM UBM A Multi-Model Method for Short-Utterance Speaker Recognition Model

来源：评论

学校读者我要写书评

暂无评论

A Universal Phoneme-Set Based language Independent Short Utterance Speaker Recognition

A Universal Phoneme-Set Based Language Independent Short Utt...

引用

第十一届全国人机语音通讯学术会议(NCMMSC2011)

作者： Nakhat FATIMA Xiaojun Wu Thomas Fang ZHENG ZHANG Chenhao WANG Gang Center for Speech and Language Technologies Division of Technical Innovation and DevelopmentTsinghua National Laboratory for Information Science and Technology Department of Computer Science and Technology Tsinghua University

来源：评论

学校读者我要写书评

暂无评论

Using Class Purity as Criterion for Speaker Clustering in Multi-Speaker Detection Tasks

Using Class Purity as Criterion for Speaker Clustering in Mu...

引用

2011年亚太信号与信息处理协会年会

作者： Thomas Fang Zheng Center for Speech and Language Technologies Division of Technical Innovation and Development Tsinghua National Laboratory for Information Science and Technology Department of Computer Science and Technology Tsinghua University

Speaker clustering is an important step in multispeaker detection tasks and its performance directly affects the speaker detection performance. It is observed that the shorter the average length of single-speaker speech segments after segmentation is, the worse performance of the following speaker recognition will be achieved, therefore a reasonable solution to better multi-speaker detection performance is to enlarge the average length of after-segmentation single-speaker speech segments, which is equivalently to cluster as many true samespeaker segments into one as possible. In other words, the average class purity of each speaker segment should be as bigger as possible. Accordingly, a speaker-clustering algorithm based on the class purity criterion is proposed, where a Reference Speaker Model (RSM) scheme is adopted to calculate the distance between speech segments, and the maximal class purity, or equivalently the minimal within-class dispersion, is taken as the criterion. Experiments on the NIST SRE 2006 database showed that, compared with the conventional Hierarchical Agglomerative Clustering (HAC) algorithm, for speech segments with average lengths of 2 seconds, 5 seconds and 8 seconds, the proposed algorithm increased the valid class speech length by 2.7%, 3.8% and 4.6%, respectively, and finally the target speaker detection recall rate was increased by 7.6%, 6.2% and 5.1%, respectively.

关键词： SRE Using Class Purity as Criterion for Speaker Clustering in Multi-Speaker Detection Tasks

来源：评论

学校读者我要写书评

暂无评论

Discrimination-Emphasized Mel-Frequency-Warping for Time-Varying Speaker Recognition

Discrimination-Emphasized Mel-Frequency-Warping for Time-Var...

引用

2011年亚太信号与信息处理协会年会

Performance degradation with time varying is a generally acknowledged phenomenon in speaker recognition and it is widely assumed that speaker models should be updated from time to time to maintain representativeness. However, it is costly, user-unfriendly, and sometimes, perhaps unrealistic, which hinders the technology from practical applications. From a pattern recognition point of view, the time-varying issue in speaker recognition requires such features that are speakerspecific, and as stable as possible across time-varying sessions. Therefore, after searching and analyzing the most stable parts of feature space, a Discrimination-emphasized Mel-frequencywarping method is proposed. In implementation, each frequency band is assigned with a discrimination score, which takes into account both speaker and session information, and Melfrequency-warping is done in feature extraction to emphasize bands with higher scores. Experimental results show that in the time-varying voiceprint database, this method can not only improve speaker recognition performance with an EER reduction of 19.1%, but also alleviate performance degradation brought by time varying with a reduction of 8.9%.

关键词： session Discrimination-Emphasized Mel-Frequency-Warping for Time-Varying Speaker Recognition Mel

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共94页 << < 72 73 74 75 76 77 78 79 80 81 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：