检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

时间限定

出版年份：

文献类型

图书期刊文献学位论文多媒体

馆藏选择

电子馆藏纸本馆藏

核心期刊

全部期刊 SCI 收录期刊 SSCI 收录期刊 EI 收录期刊 CSCD 收录期刊 CSSCI 收录期刊

语言

中文英文

文献类型

期刊文献图书学位论文标准纸本馆藏

帮助

文字说明：

T=题名（书名、题名），A=作者（责任者），K=主题词，P=出版物名称，PU=出版社名称，O=机构（作者单位、学位授予单位、专利申请人），L=中图分类号，C=学科分类号，U=全部字段，Y=年（出版发行年、学位年度、标准发布年）

检索规则说明：

AND代表“并且”；OR代表“或者”；NOT代表“不包含”；(注意必须大写,运算符两边需空一格)

检索范例：

范例一：(K=图书馆学 OR K=情报学) AND A=范并思 AND Y=1982-2016
范例二：P=计算机应用与软件 AND (U=C++ OR U=Basic) NOT K=Visual AND Y=2011-2016

分类表

所选分类

>> <<

限定检索结果

文献类型

136 篇 会议

馆藏范围

136 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

75 篇 理学
- 75 篇 系统科学
32 篇 工学
- 15 篇 计算机科学与技术...
- 14 篇 软件工程
- 7 篇 机械工程
- 7 篇 信息与通信工程
- 5 篇 控制科学与工程
- 2 篇 仪器科学与技术
- 1 篇 网络空间安全
28 篇 文学
- 25 篇 中国语言文学
- 3 篇 外国语言文学
16 篇 法学
- 10 篇 社会学
- 6 篇 民族学
2 篇 经济学
- 2 篇 应用经济学
2 篇 管理学
- 2 篇 管理科学与工程(可...
2 篇 艺术学
- 2 篇 设计学（可授艺术学...

主题

15 篇 语音识别
4 篇 语种识别
4 篇 计算机辅助语言学...
3 篇 中国科学院自动化...
3 篇 frame
3 篇 ncmmsc2009
3 篇 微软亚洲研究院
3 篇 语音合成
3 篇 语言模型
3 篇 电子工程系
2 篇 说话人确认
2 篇 检索机
2 篇 鲁棒性
2 篇 ncmmsc
2 篇 置信度
2 篇 哼唱识别
2 篇 语音学
2 篇 元音
2 篇 情感识别
2 篇 模糊匹配

机构

19 篇 清华大学
5 篇 西北民族大学
5 篇 中国传媒大学
4 篇 北京交通大学
4 篇 中国社会科学院语...
3 篇 ibm中国研究院
3 篇 哈尔滨工业大学
3 篇 北京理工大学
3 篇 ibm中国研究中心
2 篇 清华信息科学技术...
2 篇 太原理工大学
2 篇 national institu...
2 篇 北京邮电大学
2 篇 清华-讯飞语音技术...
2 篇 北京语言大学
2 篇 中国科学院自动化...
2 篇 graduate school ...
2 篇 北京信息科技大学
2 篇 中国科学技术大学
2 篇 清华大学信息技术...

作者

7 篇 郑方
6 篇 秦勇
5 篇 蔡莲红
5 篇 于洪志
4 篇 邬晓钧
4 篇 李爱军
4 篇 徐明星
3 篇 谢湘
3 篇 苗振江
3 篇 刘轶
3 篇 夏云庆
3 篇 徐波
3 篇 吴及
3 篇 孟子厚
3 篇 双志伟
3 篇 戴礼荣
2 篇 王士进
2 篇 张雪英
2 篇 吐尔洪江·阿布都克...
2 篇 匡镜明

语言

90 篇 中文
46 篇 英文

检索条件"任意字段=第十届全国人机语音通讯学术会议暨国际语音语言处理研讨会"

共 136 条记录，以下是131-140 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

Speech Structure:a new framework of speech processing inspired from infants’ behaviors and animals’ behaviors

Speech Structure:a new framework of speech processing inspir...

引用

第十届全国人机语音通讯学术会议暨国际语音语言处理研讨会

作者： Nobuaki Minematsu Graduate School of Information Science and Technology The University of Tokyo

Speech communication has several steps of production (encoding), transmission, and hearing (decoding). In every step, acoustic and static distortions are involved inevitably by differences of gender, age, microphone, room, line, auditory characteristics, etc. In spite of these variations, human listeners can extract linguistic information from speech so easily as if the variations do not disturb the communication at all. One may hypothesize that listeners modify their internal acoustic models whenever either of a speaker, a room, a microphone, or a line is changed. Another one may hypothesize that the linguistic information in speech can be represented separately from the extra-linguistic factors. In this study, being inspired from infants' behaviors and animals' behaviors, our solution to the intrinsic and inevitable variations in speech is described [1,2,3]. Speech structures, invariant to these variations, are derived as completely transform-invariant features [4] and their linguistic and psychological validity is discussed here. Further, some speech applications of ASR [3] and CALL [5] using the structures are shown, where extremely robust performance with speaker variability can be obtained with speech structures.

关键词： Speech structures, extra-linguistic feature, vocal imitation, invariance, f-divergence, ASR, and CALL

来源：评论

学校读者我要写书评

暂无评论

FORWARD CONTROL OF A 3D PHYSIOLOGICAL ARTICULATORY MODEL FOR THE INVESTIGATION OF SPEECH PRODUCTION

FORWARD CONTROL OF A 3D PHYSIOLOGICAL ARTICULATORY MODEL FOR...

引用

第十届全国人机语音通讯学术会议暨国际语音语言处理研讨会

作者： Akikazu Nishikido School of Information Science Japan Advanced Institute of Science and Technology 923-12111-8 AsahidaiNomi CityIshikawaJapan

A three-dimensional (3D) physiological articulatory model has been developed to account for effects of biomechanical properties of speech organs in speech production [1]. In order to control the model investigate the mechanism of speech production, an efficient control module is necessary to estimate muscle activation patterns, which is used to manipulate the 3D physiological articulatory model, according to desired articulatory posture. For this purpose, a feedforward control strategy is elaborated by mapping articulatory target to corresponding muscle activation pattern via the intrinsic representation of vowel articulation. In this process, the articulatory postures are, first, mapped to corresponding intrinsic representations;second, the articulatory postures are clustered in the space of intrinsic representations;third, for each cluster, a nonlinear function is approximated to map the intrinsic representation of vowel articulation to muscle activation pattern by using General Regression Neural Network (GRNN). The results show that the proposed feedforward control module is able to manipulate the proposed 3D physiological articulatory model for vowel production with high accuracy both acoustically and articulatorily.

关键词： speech production, articulatory model, articulatory posture, intrinsic dimension, feed forward control

来源：评论

学校读者我要写书评

暂无评论

Speech-to-Singing Synthesis System:Vocal Conversion from Speaking Voices to Singing Voices by Controlling Acoustic Features Unique to Singing Voices

Speech-to-Singing Synthesis System:Vocal Conversion from Spe...

引用

第十届全国人机语音通讯学术会议暨国际语音语言处理研讨会

作者： Takeshi SAITOU Masataka GOTO Masashi UNOKI Masato AKAGI National Institute of Advanced Industrial Science and Technology (AIST) School of Information Science Japan Advanced Institute of Science and Technology (JAIST) School of Information Science Japan Advanced Institute of Science and Technology (JAIST)

This paper introduces a speech-to-singing synthesis system, called SingBySpeaking, which can synthesize a singing voice, given a speaking voice reading the lyrics of a song and its musical score. The system is based on the speech manipulation system STRAIGHT and is comprised of four models controlling three acoustic parameters: the fundamental frequency (F0), phoneme duration, and spectrum. Given the musical score and its tempo, the F0 control model generates the F0 contour of the singing voice by controlling four types of F0 fluctuations: overshoot, vibrato, preparation, and fine fluctuation. The duration control model lengthens the duration of phoneme in the speaking voice by taking into consideration the duration of its musical note. The spectral-control model converts the spectral envelope of the speaking voice into that of the singing voice by controlling both the singing formant and the amplitude modulation of formants in synchronization with vibrato. SingBySpeaking enables us to synthesize natural singing voices merely by reading the lyrics of a song and to better understand differences between speaking and singing voices.

关键词： singing voice synthesis STRAIGHT vocal conversion singing voice perception

来源：评论

学校读者我要写书评

暂无评论

A Methodological Study of Multimodal Child Behavior Analysis Focused on Demonstrative Expressions

A Methodological Study of Multimodal Child Behavior Analysis...

引用

第十届全国人机语音通讯学术会议暨国际语音语言处理研讨会

作者： Yoichi Takebayashi Shinya Kiriyama Shogo Ishikawa Shigeyoshi Kitazawa Graduate School of Science and Technology Shizuoka University Faculty of Informatics Shizuoka University

Our goal is to represent commonsense knowledge as computational models, which are applied to spoken dialogue systems that realize smart man-machine communications by correctly understanding speakers' intentions and emotions. For this purpose, we have constructed a multimodal speech behavior corpus which includes metadata annotated from various viewpoints, such as: utterances, actions, emotions and thinking, for analyzing behavioral factors in thinking processes from various perspectives in everyday life. This paper describes a methodology of modeling of thinking processes in problem solving based on child development, by analyzing multimodal interaction data stored the corpus. We have especially focused on demonstrative expressions behavior which has a role as a signal when communicating with other people. We formulated a hypothesis on developmental process in children which links physical expression skills and mental situations such as attentive ability and sociality. Based on this hypothesis, we constructed a demonstrative expression model which is utilized to visualize actual scenes. The results of our analysis show that our proposed method is an effective way of in-depth analysis of thinking processes in demonstrative expressions behavior.

关键词： commonsense knowledge multimodal speech behavior corpus child behavior analysis demonstrative expression

来源：评论

学校读者我要写书评

暂无评论

The Recent Development of Thai Speech and Language Resources

The Recent Development of Thai Speech and Language Resources

引用

第十届全国人机语音通讯学术会议暨国际语音语言处理研讨会

作者： Ananlada Chotimongkol Human Language Technology Laboratory NECTEC Thailand

Research on Thai language processing has been investigated since the early 80’s;however the performances of current applications that employ human language technologies are still far from market expectation. Shortage of shared resources and lack of standard evaluation guidelines are among the causes that hinder the progress in this field. As a national research organization of Thailand, the National Electronics and Computer Technology Center (NECTEC) has taken responsibility in developing and sharing language resources for research and education purposes since 1997. For speech processing research, a variety of speech corpora are necessary as acoustic characteristics of speech signals, conditions of input channels, and application domains are diverse. Similar to other major languages, our development started from a read-speech corpus collected in a controlled environment. The subsequent distributions, however, aim more toward spontaneous speech collected in real environments such as telephone speech and broadcast news speech. In addition, we also design and construct speech corpora that cover extensive acoustic events, such as phone sequences and intonation patterns, suitable for speech analysis and speech synthesis research. A larger collection of speech corpora will help driving speech technology research in Thailand toward real-world applications. NECTEC also takes initiation in setting up standards for various issues in Thai language processing. Together with experts from various universities and organizations, we have organized BEST (Benchmark for Enhancing the Standard of Thai language processing), a series of contests on Thai language processing such as word segmentation and named-entity recognition. With a standard evaluation protocol and annotation guidelines along with a large amount of annotated data provided, the BEST events can help accelerate the progress of Thai language processing technologies through knowledge and resource sharing and the benchmarkin

关键词： The Recent Development of Thai Speech and Language Resources

来源：评论

学校读者我要写书评

暂无评论

Structural Analysis of Chinese Dialect Speakers and Their Automatic Classification

Structural Analysis of Chinese Dialect Speakers and Their Au...

引用

第十届全国人机语音通讯学术会议暨国际语音语言处理研讨会

作者： Nobuaki Minematsu Akira Nemoto Max Takazawa Keikichi Hirose Graduate School of Information Science and Technology The University of Tokyo College of Chinese Language & Culture Nankai University Graduate School of Engineering The University of Tokyo

In China, there are many different kinds of dialects and sub-dialects. Because there are many grammatical, lexical, phonological, and phonetic differences among them in varying degrees, people from different dialect regions always have difficulties in oral communication. Since 1956, standard Mandarin has been popularized all over the country as official language and almost every dialect speaker began to learn Mandarin just as a second language. But affected by their native dialects, many of them speak Mandarin with regional accents. In modern speech processing technologies, speech is represented by spectrum which contains not only the dialectal linguistic information but also extra-linguistic information such as the gender and age of the speaker. In order to focus exclusively on the linguistic features of dialectal utterances, a speaker-invariant structural representation of speech, which was originally proposed by the second author inspired by infants' language acquisition [1, 2], is proposed to represent the pronunciation of Chinese dialect speakers. Since the purely dialectal information can be extracted by removing the extra-linguistic information from dialect speech, this pronunciation structure can be applied to estimate which dialect or sub-dialect region a speaker belongs to and to assess the pronunciation. In order to testify the validity of our approach, speaker classification based on the dialectal utterances of 38 Chinese finals are investigated especially in terms of robustness to speaker variability. The result is linguistically reasonable and highly independent of age and gender. After that, a sub-dialect corpus is developed with a list of characters as reading materials, which is originally used for linguists' investigation of dialect speakers' pronunciation. Then after the sub-dialect pronunciation structure is built for every speaker, their pronunciations are classified based on the distances among their structures. The result shows that the sub-di

关键词： Chinese dialects, extra-linguistic feature, pronunciation structure, Bhattacharyya distance, speaker classification

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共14页 << < 5 6 7 8 9 10 11 12 13 14 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：