版权所有:内蒙古大学图书馆 技术提供:维普资讯• 智图
内蒙古自治区呼和浩特市赛罕区大学西街235号 邮编: 010021
作者机构:Instituto de Telecomunicações-polo de Coimbra Coimbra Portugal Department of Electrical and Computer Engineering FCTUC Universidade de Coimbra Coimbra Portugal
出 版 物:《Journal of the Brazilian Computer Society》 (J. Braz. Comput. Soc.)
年 卷 期:2013年第19卷第2期
页 面:127-134页
核心收录:
学科分类:0711[理学-系统科学] 08[工学] 0714[理学-统计学(可授理学、经济学学位)] 0811[工学-控制科学与工程] 0701[理学-数学] 0812[工学-计算机科学与技术(可授工学、理学学位)]
基 金:The two first authors acknowledge Instituto de Telecomunicações (Arlindo Veiga) and Science and Technology Foundation-FCT (Sara Candeias SFRH/ BPD/36584/2007) for their scholarships. This work was also fundedby FCT under the Project (PTDC/CLE-LIN/11 2411/2009) and partially supported by FCT (Instituto de Telecomunicações multiannual funding PEst-OE/EEI/ LA0008/2011)
主 题:Stochastic models
摘 要:This paper addresses the problem of grapheme to phoneme conversion to create a pronunciation dictionary from a vocabulary of the most frequent words in European Portuguese. A system based on a mixed approach funded on a stochastic model with embedded rules for stressed vowel assignment is described. The implemented model can generate pronunciations from unrestricted words;however, a dictionary with the 40k most frequent words was constructed and corrected interactively. The dictionary includes homographs with multiplepronunciations. The vocabulary was defined using the CETEMPúblico corpus. The model and dictionary are publicly available. © 2012 The Brazilian Computer Society.