咨询与建议

看过本文的还看了

相关文献

该作者的其他文献

文献详情 >Building and Using Comparable ... 收藏

Building and Using Comparable Corpora for Multilingual Natural Language Processing

丛 书 名:Synthesis Lectures on Human Language Technologies

版本说明:1

作     者:Serge Sharoff Reinhard Rapp Pierre Zweigenbaum 

I S B N:(纸本) 9783031313837;9783031313868 

出 版 社:Springer Cham 

出 版 年:1000年

页      数:VIII, 133页

主 题 词:Natural Language Processing (NLP) Artificial Intelligence Computer Applications Computer Science, general Computational Linguistics Machine Learning 

摘      要:This book provides a comprehensive overview of methods to build comparable corpora and of their applications, including machine translation, cross-lingual transfer, and various kinds of multilingual natural language processing. The authors begin with a brief history on the topic followed by a comparison to parallel resources and an explanation of why comparable corpora have become more widely used. In particular, they provide the basis for the multilingual capabilities of pre-trained models, such as BERT or GPT. The book then focuses on building comparable corpora, aligning their sentences to create a database of suitable translations, and using these sentence translations to produce dictionaries and term banks. Then, it is explained how comparable corpora can be used to build machine translation engines and to develop a wide variety of multilingual applications.

读者评论 与其他读者分享你的观点

用户名:未登录
我的评分