版权所有:内蒙古大学图书馆 技术提供:维普资讯• 智图
内蒙古自治区呼和浩特市赛罕区大学西街235号 邮编: 010021
丛 书 名:Synthesis Lectures on Human Language Technologies
I S B N:(纸本) 9781636390864;9781636390888
出 版 社:Morgan & Claypool
出 版 年:2021年
主 题 词:bisac category: computers \/ artificial intelligence \/ natural language processing \/ computers \/ data science \/ machine learning \/ language arts \u0026 disciplines \/ linguistics \/ general
学科分类:0501[文学-中国语言文学] 0303[法学-社会学] 050102[文学-语言学及应用语言学] 03[法学] 030303[法学-人类学] 05[文学] 08[工学] 0812[工学-计算机科学与技术(可授工学、理学学位)]
摘 要:Opportunity and Curiosity find similar rocks on Mars. One can generally understand this statement if one knows that Opportunity and Curiosity are instances of the class of Mars rovers, and recognizes that, as signalled by the word on, ROCKS are located on Mars. Two mental operations contribute to understanding: recognize how entities/concepts mentioned in a text interact and recall already known facts (which often themselves consist of relations between entities/concepts). Concept interactions one identifies in the text can be added to the repository of known facts, and aid the processing of future texts. The amassed knowledge can assist many advanced language-processing tasks, including summarization, question answering and machine translation. Semantic relations are the connections we perceive between things which interact. The book explores two, now intertwined, threads in semantic relations: how they are expressed in texts and what role they play in knowledge repositories. A historical perspective takes us back more than 2000 years to their beginnings, and then to developments much closer to our time: various attempts at producing lists of semantic relations, necessary and sufficient to express the interaction between entities/concepts. A look at relations outside context, then in general texts, and then in texts in specialized domains, has gradually brought new insights, and led to essential adjustments in how the relations are seen. At the same time, datasets which encompass these phenomena have become available. They started small, then grew somewhat, then became truly large. The large resources are inevitably noisy because they are constructed automatically. The available corpora—to