咨询与建议

看过本文的还看了

相关文献

该作者的其他文献

文献详情 >Bitext Alignment 收藏

Bitext Alignment

双文本对齐

丛 书 名:Synthesis Lectures on Human Language Technologies

版本说明:1

作     者:Tiedemann, Jorg 

I S B N:(纸本) 9781608455102 

出 版 社:Morgan & Claypool Publishers 

出 版 年:2011年

主 题 词:language technologies human lectures alignment synthesis bitext 

学科分类:08[工学] 0812[工学-计算机科学与技术(可授工学、理学学位)] 

摘      要:This book provides an overview of various techniques for the alignment of bitexts. It describes general concepts and strategies that can be applied to map corresponding parts in parallel documents on various levels of granularity. Bitexts are valuable linguistic resources for many different research fields and practical applications. The most predominant application is machine translation, in particular, statistical machine translation. However, there are various other threads that can be followed which may be supported by the rich linguistic knowledge implicitly stored in parallel resources. Bitexts have been explored in lexicography, word sense disambiguation, terminology extraction, computer-aided language learning and translation studies to name just a few. The book covers the essential tasks that have to be carried out when building parallel corpora starting from the collection of translated documents up to sub-sentential alignments. In particular, it describes various approaches to document alignment, sentence alignment, word alignment and tree structure alignment. It also includes a list of resources and a comprehensive review of the literature on alignment techniques. Table of Contents: Introduction / Basic Concepts and Terminology / Building Parallel Corpora / Sentence Alignment / Word Alignment / Phrase and Tree Alignment / Concluding Remarks

读者评论 与其他读者分享你的观点

用户名:未登录
我的评分