Pronunciation errors are often made by language learners. Especially, systematic mispronunciations, consisting of substitutions of native sounds for sounds of the target language that do not exist in the native langua...
详细信息
ISBN:
(纸本)9781538634226
Pronunciation errors are often made by language learners. Especially, systematic mispronunciations, consisting of substitutions of native sounds for sounds of the target language that do not exist in the native language, are considered a big problem for language leaners. Therefore, automatic detection of this kind of errors is essential to building a computer-assistedlanguagelearning (CALL) system supporting language learners to improve their pronunciation. In this research, we focused on detecting systematic pronunciation errors made by Vietnamese learners of English. To this end, we used SVM classifiers, which are trained by a native corpuses (TIMIT) and a non-native corpus (V.E Corpus). The non-native corpus, constructed by the researchers and annotated by two Vietnamese trained professionals, includes 1550 utterances from 31 Vietnamese students. Each of the students was asked to read 50 English sentences designed to contain English phonemes frequently mispronounced by Vietnamese speakers. The experimental results showed that the detectors can achieve at least 79% SAR and 10% FAR.
Speaking coherently in a second language requires knowledge and execution of pronunciation. Often, pronunciation training is seldom emphasized in languagelearning. However, explicitly teaching pronunciation can incre...
详细信息
Speaking coherently in a second language requires knowledge and execution of pronunciation. Often, pronunciation training is seldom emphasized in languagelearning. However, explicitly teaching pronunciation can increase confidence and attitudes towards pronunciation in learners. We present a visual feedback system for vowels using a speaker calibrated vowel chart. The demo includes vowel chart calibration, an interactive tutorial for learning how to read a vowel chart, and a practice page for three vowel minimal pairs.
In spoken communication, intonation often conveys meaning of an utterance. Thus, incorrect intonation, typically made by second language (L2) learners, could result in miscommunication. We, in this work, consider the ...
详细信息
ISBN:
(纸本)9781538682357
In spoken communication, intonation often conveys meaning of an utterance. Thus, incorrect intonation, typically made by second language (L2) learners, could result in miscommunication. We, in this work, consider the problem of automatically detecting the intonation of British English (BE) utterances which could be useful for providing feedback to the L2 learners. Typically, in BE, the meaning is conveyed through four intonation classes Glide-up, Glide-down, Dive and Takeoff. We hypothesize that these classes could be discriminated using temporal structure in utterance-level pitch patterns. These patterns could be represented by either stylized pitch or tones from automatic tone and break indices (AuToBI) tool. We model these temporal structures for the intonation classification using three techniques, namely, n-gram, deep neural network and long short term memory recurrent networks. Experiments are conducted on the speech data collected from a spoken English training material for teaching intonation of BE. We obtain better unweighted average recall (UAR) with the proposed schemes compared to the baseline scheme, that does not exploit temporal structure in the utterance-level pitch patterns. Among different proposed schemes, the highest absolute improvement in the UAR is found to be 933% over the baseline scheme.
Processing children's speech is challenging due to high speaker variability arising from vocal tract size and scarce amounts of publicly available linguistic resources. In this work, we tackle such challenges by p...
详细信息
ISBN:
(纸本)9781713820697
Processing children's speech is challenging due to high speaker variability arising from vocal tract size and scarce amounts of publicly available linguistic resources. In this work, we tackle such challenges by proposing an unsupervised feature adaptation approach based on adversarial multi-task training in a neural framework. A front-end feature transformation module is positioned prior to an acoustic model trained on adult speech (1) to leverage on the readily available linguistic resources on adult speech or existing models, and (2) to reduce the acoustic mismatch between child and adult speech. Experimental results demonstrate that our proposed approach consistently outperforms established baselines trained on adult speech across a variety of tasks ranging from speech recognition to pronunciation assessment and fluency score prediction.
The Japanese language have a great number of onomatopoeic expressions, which is one of the key characteristics of this language as well as other East-Asian languages like Korean and Indonesian. When a foreigner learns...
详细信息
ISBN:
(纸本)9781467327435;9781467327428
The Japanese language have a great number of onomatopoeic expressions, which is one of the key characteristics of this language as well as other East-Asian languages like Korean and Indonesian. When a foreigner learns Japanese, thus, it is important to master them, but, due to its quite subjective nature, many feel great difficulty, and it is indeed the case with foreigners who wants to work in Japan. However they are often neglected in rapid Japanese teaching, and a supportive e-learning system for mastering Japanese onomatopoeic expressions is desirable. Based on linguistic reconsideration on Japanese onomatopoeic expressions, we are developing an online e-learning system of Japanese onomatopoeic expressions for foreign workers. Our system aims to present not only explanation of individual onomatopoeic expressions but various contextual information, and to offer a question-answer communication device that enables learners to have a quick, probable answer with relevant examples found in online data and instructors to explain the expression afterwards. Evaluation of our prototype system by foreign Japanese learners indicates that using this kind of e-learning system is suitable for complementing Japanese learning at school.
There has been a rapid increase in the availability of computerassisted instruction (CAI) software for teaching oral language skills. Despite the growing popularity of CAI in education, such an approach to language t...
详细信息
There has been a rapid increase in the availability of computerassisted instruction (CAI) software for teaching oral language skills. Despite the growing popularity of CAI in education, such an approach to language teaching fragments and isolates languagelearning from the context of its use and conflicts with current theory and research in language development and learning. The greatest potential for microcomputers in languagelearning may be as a medium for increasing student opportunities for using language by bringing students and teachers together around a shared activity. .overlined { text-decoration: overline; } .struck { text-decoration:line-through; } .underlined { text-decoration:underline; } .doubleUnderlined { text-decoration:underline;border-bottom:1px solid #000; } Enhanced Article (HTML) Get PDF (406K)Get PDF (406K) More content like thisFind more content: like this articleFind more content written by: Curt Dudley-Marling Dennis Searle All Authors
作者:
Yan, KeChina Acad Engn Phys
New Generat Informat Technol Ctr Inst Comp Applicat Mianyang Peoples R China
Posterior probability measure is widely accepted as the most promising feature for automatic pronunciation quality evaluation. However, this measure is not phonetically consistent. This work presents a novel trainable...
详细信息
ISBN:
(纸本)9781479945658
Posterior probability measure is widely accepted as the most promising feature for automatic pronunciation quality evaluation. However, this measure is not phonetically consistent. This work presents a novel trainable phone-dependent transformation of posterior probability to deal with the problem. Both linear and non-linear transforms are investigated. Close form solution is found for linear transformation and gradient-based method is derived for nonlinear transformation. Experimental results on the database of 3685 people showed significant improvement. The cross-correlation between human and machine scores increases from 0.582 to 0.760.
computer-based training can be effective in improving second language learners' perceptions and productions of segmental speech contrasts. However, because most previous studies have addressed specific theoretical...
详细信息
This paper considers the use of pre- and post-test results as formative evaluation tools and describes a small group evaluation of courseware dealing with French for banking. The results highlight the problems that oc...
详细信息
This paper considers the use of pre- and post-test results as formative evaluation tools and describes a small group evaluation of courseware dealing with French for banking. The results highlight the problems that occurred in the course of the evaluation process and suggest that test results should be interpreted with caution.
Sentence-level writing errors seem immune to many of the feedback forms devised over the years, apart from the slow accumulation of examples from the environment itself, which second language (L2) learners gradually n...
详细信息
暂无评论