The article discusses a knowledge management system which has been developed for chineselanguage arts teachers. Details about the system and the subject of chineselanguage arts are provided. The complexity of Chines...
详细信息
The article discusses a knowledge management system which has been developed for chineselanguage arts teachers. Details about the system and the subject of chineselanguage arts are provided. The complexity of chineselanguage arts textbooks are examined and the management program is presented as a way to effectively organize class material. A diagram illustrating the schema of the database program is presented and several images illustrating the usefulness and functionality of the program are also provided.
This thesis addresses two important problems in chinese information processing, namely chinese chunk segmentation and the similarity measure of chinese sentences. The three main contributions reported in this thesis a...
详细信息
This thesis addresses two important problems in chinese information processing, namely chinese chunk segmentation and the similarity measure of chinese sentences. The three main contributions reported in this thesis are: (1) a novel chinese chunk segmentation technique using a statistical model combined with correction rules generated using an error-correction mechanism; (2) a novel similarity measure of chinese sentences using both word/chunk sequences and POS (Part of Speech) tag sequences of chinese sentences; and (3) the optimization of parameters used in the combined similarity measure approach by applying a relevance feedback technique and a neural network model. In the first investigation, a statistical model combined with correction rules generated by an error-correction mechanism is proposed for chinese chunk segmentation. Chunk segmentation of chinese sentences in the training corpus was carried out manually to provide a ground rule for training the statistical model with which preliminary chunk segmentation results will be obtained. The chunk segmentation result (correctly and incorrectly segmented chunks) from the statistical model is utilized to generate a set of correction rules for refining the segmentation result. This set of correction rules is generated by an error-correction mechanism in which a comparison between the preliminary segmentation result and the manually segmented result is performed. The statistical model and the learned correction rules can then be used to perform chinese chunk segmentation of unseen sentences. In the second investigation, novel similarity measures of chinese sentences are proposed by using word/chunk sequences and POS tag sequences of chinese sentences. The sentence similarity measure is one of very important components in example-based machine translation (EBMT). For chinese sentences there is no delimiter between any two words, which is different from English sentences. Hence, chinese word/chunk delimitation shoul
暂无评论