We discuss the evaluation of the RPI language identification system by the National Institute of Standards and Technologies (NIST). This system is based on an acousto-phonetic approach where the phonemes present in a ...
详细信息
We discuss the evaluation of the RPI language identification system by the National Institute of Standards and Technologies (NIST). This system is based on an acousto-phonetic approach where the phonemes present in a language are identified by a hidden semi-Markov model (HSMM). The HSMM was also developed at RPI. knowledge of these phonemes provides us with the necessary probabilistic framework for classifier design. The classifier used in this system is designed in such a way that language specific scores generated during an evaluation form a random walk. Random walk theory has extensive applications in ecology, metallurgy, chemistry and physics. Until recently random walk theory has been primarily used as a tool for the measurement of the territory covered by a diffusing particle. We now show that random walk theory can be used to effectively design a language identification system.
VIENA Classroom is a distance education system in which the teaching material is prepared as hypermedia documents and presented to the students within a CSCW environment. By applying languageengineering techniques to...
详细信息
ISBN:
(纸本)354061656X
VIENA Classroom is a distance education system in which the teaching material is prepared as hypermedia documents and presented to the students within a CSCW environment. By applying languageengineering techniques to the question support facilities of the system we create a multimodal naturallanguage interface so that the students can formulate their questions directly in Japanese. Based on the computed semantic representations the questions are either answered by accessing a FAQ knowledge base or collected and transferred to the teacher for later processing. As valuable assistance for formulating questions we provide the possibility to browse through automatically generated FAQ lists. languageengineering is performed in an integrated framework by utilizing deductive object-oriented database technology.
A new automatic speech segmentation procedure, called the "Blind" speech segmentation, is presented. This procedure allows a speech sample to be segmented into sub-word units without the knowledge of any lin...
详细信息
A new automatic speech segmentation procedure, called the "Blind" speech segmentation, is presented. This procedure allows a speech sample to be segmented into sub-word units without the knowledge of any linguistic information (such as, orthographic or phonetic transcription). Hence, this procedure involves finding the optimal number of sub-word segments in the given speech sample, before locating the sub-word segment boundaries.
The authors analyzed the relationship between semantics of utterances and movements of the head in a natural dialogue and a task oriented dialogue in Japanese. They show that visual information such as head movement w...
详细信息
The authors analyzed the relationship between semantics of utterances and movements of the head in a natural dialogue and a task oriented dialogue in Japanese. They show that visual information such as head movement will be useful for managing a dialogue and reducing the vagueness of semantics. First they extract the head movements calculated automatically in a natural conversation and indicate the role of this. After this they present an analysis of head movements during a cooperative problem solving task to construct a natural dialogue system in which the initiative of the conversation moves. They show the effectiveness of using visual information in a multimodal dialogue system.
The SoftDocs project at Concordia University investigates knowledge acquisition from software documents and the analysis of that knowledge for reverse engineering of legacy systems. It focusses on the recognition and ...
详细信息
The paper presents an approach to using prosodic information for the integration of acoustic and linguistic knowledge in continuous Mandarin speech with very large vocabulary. Since the overhead computation incurred f...
详细信息
ISBN:
(纸本)0780335554
The paper presents an approach to using prosodic information for the integration of acoustic and linguistic knowledge in continuous Mandarin speech with very large vocabulary. Since the overhead computation incurred from unification of search space is confined to the syllable boundaries, the use of prosodic information to reduce the syllable boundary hypotheses as well as the syllable matching length, is shown to be effective. The inherent complexity with the very large vocabulary is also reduced by the use of phrase boundary hypotheses conjectured via the phrase-final lengthening. Experimental results show a 47.2% recognition time save with only a 5.67% error rate increase using the syllable and phrase boundary hypotheses conjectured from prosodic information.
This paper presents a fast and accurate recognition of continuous Mandarin speech with very large vocabulary using an improved segmental probability model (SPM) approach. In order to extensively utilize the acoustic a...
详细信息
ISBN:
(纸本)0780331923
This paper presents a fast and accurate recognition of continuous Mandarin speech with very large vocabulary using an improved segmental probability model (SPM) approach. In order to extensively utilize the acoustic and linguistic knowledge to further improve the recognition performance, a few special techniques are thus developed. Preliminary simulation results show that the final achievable rate for the base syllable recognition with the improved segmental probability modeling is as high as 91.62%, which indicates a 18.48% error rate reduction and more than 3 times faster than the well-studied sub-syllable-based CHMM. Also, a tone recognizer and a word-based Chinese language model are included and the achieved recognition accuracy for the final decoded Chinese characters is 92.10%.
In the work described, we automatically deduce dialogue structures from a corpus with probabilistic methods. Each utterance in the corpus is annotated with a speaker label and an utterance type called IFT (illocutiona...
详细信息
In the work described, we automatically deduce dialogue structures from a corpus with probabilistic methods. Each utterance in the corpus is annotated with a speaker label and an utterance type called IFT (illocutionary force type). We use an ergodic HMM (hidden Markov model) and the ALERGIA algorithm, an algorithm for learning probabilistic automata by means of state merging, to model the speaker-IFT sequences. Our experiments successfully extract typical dialogue structures such as turn-taking and speech act sequencing.
Fundamentals of fuzzy knowledge base for image understanding are dealt with. It consists of the data-word transformation part (to transform numerical data, derived from image processing, into words) and answer generat...
详细信息
ISBN:
(纸本)0780324625
Fundamentals of fuzzy knowledge base for image understanding are dealt with. It consists of the data-word transformation part (to transform numerical data, derived from image processing, into words) and answer generation part (to answer about objects and phenomena in the image world). Ambiguous recognition results are handled by a fuzzy matching and a fuzzy frame knowledge-based system. By applying fuzzy IF - THEN rules, the process of image understanding can be made independent of weather conditions and daylight changing. This system also employs user friendly information retrieval system by applying naturallanguage instructions.
naturallanguage and diagrams of various types are commonly used for specifications on digital systems containing descriptions and requirements. The objective of the research reported here is to develop an algorithm f...
详细信息
ISBN:
(纸本)3540601619
naturallanguage and diagrams of various types are commonly used for specifications on digital systems containing descriptions and requirements. The objective of the research reported here is to develop an algorithm for the automatic integration of requirements using schemata to aid in the automatic detection and joining of common references (coreferences) to objects in naturallanguage and other specifications. This paper describes a rule-based algorithm for the integration of requirements which are expressed as conceptual graphs. The algorithm uses design knowledge in the form of schemas to detect coreferences and perform joins. The algorithm is demonstrated by a small example.
暂无评论