版权所有:内蒙古大学图书馆 技术提供:维普资讯• 智图
内蒙古自治区呼和浩特市赛罕区大学西街235号 邮编: 010021
作者机构:Division of Computer Science Department of Electrical Engineering & Computer Science Korea Terminology Research Center for Language and Knowledge Engineering (KORTERM) Advanced Information Technology Research Center (AITrc) Korea Advanced Institute of Science and Technology (KAIST) 373-1 Kusong-dong Yusong-gu Taejon 305-701 Korea
出 版 物:《International Journal of Computer Processing of Languages》
年 卷 期:2001年第14卷第2期
页 面:109-131页
学科分类:08[工学] 0812[工学-计算机科学与技术(可授工学、理学学位)]
主 题:Information Retrieval Korean Information Retrieval Transliteration Back-Transliteration Word Mismatch Problem Foreign Word
摘 要:In Korean text these days, the use of English words with or without phonetic translations are growing at a high speed. To make matters worse, the Korean transliteration of an English word may vary greatly. The mixed use of English words and their various transliterations in the same document or document collection may cause severe word mismatch problems in Korean information retrieval. There are two possible approaches to tackle this problem: transliteration and back-transliteration method. We argue that our newly proposed transliteration approach is more advantageous for the resolution of the word mismatch problem than the previously proposed back-transliteration approach. Our information retrieval experiment results support this argument.