This paper presents a topic-driven framework for generating a generic summary from multi-documents. Our approach is based on the intuition that, from the statistical point of view, the summary's probability distri...
详细信息
Recent kernel-based PPI extraction systems achieve promising performance because of their capability to capture structural syntactic information, but at the expense of computational complexity. This paper incorporates...
详细信息
Recent kernel-based PPI extraction systems achieve promising performance because of their capability to capture structural syntactic information, but at the expense of computational complexity. This paper incorporates dependency information as well as other lexical and syntactic knowledge in a feature-based framework. Our motivation is that, considering the large amount of biomedical literature being archived daily, feature-based methods with comparable performance are more suitable for practical applications. Additionally, we explore the difference of lexical characteristics between biomedical and newswire domains. Experimental evaluation on the AIMed corpus shows that our system achieves comparable performance of 54.7 in F1-Score with other state-of-the-art PPI extraction systems, yet the best performance among all the feature-based ones.
This paper proposes a dependency-driven scheme to dynamically determine the syntactic parse tree structure for tree kernel- based anaphoricity determination in coreference resolution. Given a full syntactic parse tree...
详细信息
This paper proposes a dependency-driven scheme to dynamically determine the syntactic parse tree structure for tree kernel- based anaphoricity determination in coreference resolution. Given a full syntactic parse tree, it keeps the nodes and the paths related with current mention based on constituent dependencies from both syntactic and semantic perspectives, while removing the noisy information, eventually leading to a dependency-driven dynamic syntactic parse tree (D-DSPT). Evaluation on the ACE 2003 corpus shows that the D-DSPT outperforms all previous parse tree structures on anaphoricity determination, and that applying our anaphoricity determination module in coreference resolution achieves the so far best performance.
This paper proposes a new scheme to determine the tree span structure for tree kernel-based anaphoricity determination in coreference resolution. Given a sentence and current mention, it gets all the dependencies, enc...
详细信息
Side chain packing is a crucial step for protein structure prediction. And the quality of rotamer library has important implications in side chain packing. This paper assess dunbrack rotamer libraries and simple sampl...
详细信息
With the development of the 3G technology, M-learning enters a period of rapid development. But meanwhile 3G also restrains the development of M-Iearning due to several drawbacks in its early days. According to the ap...
详细信息
It is very important to find a criterion when to perform a resampling step. Aimed at this problem, an adaptive resampling algorithm in particle filter based on diversity measures is presented. Based on the analysis an...
详细信息
Sidechain prediction is an important subproblem of protein design and structure prediction. Construction of rotamer library is the basis for protein sidechain prediction because it provides the basic searching space f...
详细信息
Sidechain prediction is an important subproblem of protein design and structure prediction. Construction of rotamer library is the basis for protein sidechain prediction because it provides the basic searching space for prediction. However, the state-of-the-art rotamer libraries focus on the statistical information of individual amino acids, ignoring the direct affection of its adjacent amino acids. This article presents a sequence- and backbone-dependent rotamer library. Both the conformation information of adjacent amino acids and torsion angle of the current residue are taken into account to construct a sequence- and backbone-dependent library by HMM. Evaluation on all 13 free modeling targets of CASP8 based on our rotamer library is conducted. Comparing with side-chain prediction based on the state-of-the-art rotamer library, our library outperforms the sidechain prediction accuracy on all the test targets to a certain extent.
This paper presents a topic-driven framework for generating a generic summary from multi-documents. Our approach is based on the intuition that, from the statistical point of view, the summary's probability distri...
详细信息
This paper presents a topic-driven framework for generating a generic summary from multi-documents. Our approach is based on the intuition that, from the statistical point of view, the summary's probability distribution over the topics should be consistent with the multi-documents' probability distribution over the inherent topics. Here, the topics are defined as weighted “bag-of-words” and derived by Latent Dirichlet Allocation from a collection of documents, either the given multi-documents or a related large-scale corpus. In this sense, we could represent various kinds of text units, such as word, sentence, summary, document and multi-documents, using a single vector space model via their corresponding probability distributions over the derived topics. Therefore, we are able to extract a sentence or summary by calculating the similarity between a sentence/summary and the given multi-documents via their topic probability distributions. In particular, we propose two methods in similarity measurement: the static method and the dynamic method. While the former is employed to detect the salience of information in a static way, the later further controls redundancy in a dynamic way. In addition, we integrate various popular features to improve the performance. Evaluation on the TAC 2008 update summarization task shows encouraging results.
Aimed at the deficiency of the resampling algorithm in PF, diversity measures ESS (effective sample size) and PDF (population diversity factor) are evaluated respectively. Combined with the estimation result, diversit...
详细信息
暂无评论