咨询与建议

看过本文的还看了

相关文献

该作者的其他文献

文献详情 >Identification of Boundaries i... 收藏

Identification of Boundaries in Parallel Noun Phrases: A Probabilistic Swapping Model

作     者:YONG-SEOK CHOI JI-AE SHIN KEY-SUN CHOI 

作者机构:Department of Computer Science Division of Electrical Engineering and Computer Science Korea Advanced Institute of Science and Technology (KAIST) 373-1 Guseong-dong Yuseong-gu Daejeon 305-701 Korea Division of Engineering Information and Communications University (ICU) 103-6 MunJi-dong Yuseong-gu Daejeon 305-714 Korea 

出 版 物:《International Journal of Computer Processing of Languages》 

年 卷 期:2006年第19卷第2N03期

页      面:109-132页

学科分类:08[工学] 0812[工学-计算机科学与技术(可授工学、理学学位)] 

主  题:Parallel structure Parallel noun phrase Symmetry Reversibility Sentence parsing Alignment 

摘      要:Parallel structure is a way to factor out common constituents in the expressions, which makes an effect of simplification of expressions. The complexity can be greatly reduced at the phase of sentence parsing by identifying such boundaries of parallel structure. In this paper, we propose a probabilistic model to identify parallel cores (corresponding constituents) as well as boundaries of parallel noun phrases conjoined by wa/gwa (conjunctive particle in Korean). It is based on the idea of swapping constituents, utilizing symmetry (two or more identical constituents are repeated) and reversibility (the order of constituents is changeable) in parallel structure. The probabilities are calculated from (unlabelled) corpus with parallel structures, which is an advantage over the approaches trained with labeled corpus. Our model, moreover, is not dependent on languages. It is also shown that the semantic features of the modifiers around parallel noun phrase and the patterns among words can be utilized further to correct the boundaries identified by the swapping model. Experiment shows that our probabilistic swapping model performs much better than symmetry-based model and machine learning based approaches.

读者评论 与其他读者分享你的观点

用户名:未登录
我的评分