A mathematical framework based on probability theory is presented that enables us to analyze one important aspect of SI algorithms: the population diversity. Firstly the population density degree is defined for the po...
详细信息
We describe for dependency parsing an annotation adaptation strategy, which can automatically transfer the knowledge from a source corpus with a different annotation standard to the desired target parser, with the sup...
详细信息
Thinning algorithms can be classified into two general types: serial and parallel algorithms. Several algorithms have been proposed, but they have limitations. A new thinning algorithm based on the centroid of the blo...
详细信息
This paper applied the Methods which based on GEP in compress multi-streams. The contributions of this paper include: 1) giving an introduction to data function finding based on GEP(DFF-GEP), defining the main concept...
详细信息
Tree-based statistical machine translation models have made significant progress in recent years, especially when replacing 1-best trees with packed forests. However, as the parsing accuracy usually goes down dramatic...
详细信息
Many researchers of swarm intelligence (SI) algorithms take their ideas from physical and biological systems. This approach, however, is mostly qualitative and many ideas remain vague and ill-defined. In this paper, a...
详细信息
Manually annotated corpora are valuable but scarce resources, yet for many annotation tasks such as treebanking and sequence lab.ling there exist multiple corpora with different and incompatible annotation guidelines ...
详细信息
Jointly parsing two languages has been shown to improve accuracies on either or both sides. However, its search space is much bigger than the monolingual case, forcing existing approaches to employ complicated modelin...
详细信息
Shot type is useful information for semantic sports video analysis. Most existing approaches utilize predefined rules and domain knowledge to derive shot types in sports video. Although these methods have achieved pro...
详细信息
ISBN:
(纸本)9781605588407
Shot type is useful information for semantic sports video analysis. Most existing approaches utilize predefined rules and domain knowledge to derive shot types in sports video. Although these methods have achieved promising results in some specific games, it is hard to extend them from one sport to another. To address this problem, we propose a generic approach to classify shots in sports video. Our approach utilizes bag of visual words model to represent key frame for each shot based on Scale Invariant Feature Transform (SIFT) feature points;either Support Vector Machine (SVM) or Probabilistic Latent Semantic Analysis (PLSA) are then employed to classify key frame to determine shot type. As our approach relies little on domain knowledge, it can be more easily extended to different sports. We have evaluated our shot classification approach over five types of sports video and have achieved promising results. To show the usefulness and effectiveness of our shot classification, we apply the results of shot type to detect events in basketball video via a generative-discriminative model. In addition, we have observed that some common visual parts frequently appear across various shots in the same sport or even different but relevant sports. For instance, soccer and basketball are relevant sports in the sense of field-ball game. Motivated by this observation, we attempt to alleviate the problem of insufficient sports video data in some applications by sharing these visual parts across different but relevant kinds of sports. Copyright 2009 ACM.
Manually annotated corpora are valuable but scarce resources, yet for many annotation tasks such as treebanking and sequence lab.ling there exist multiple corpora with different and incompatible annotation guidelines ...
ISBN:
(纸本)9781932432459
Manually annotated corpora are valuable but scarce resources, yet for many annotation tasks such as treebanking and sequence lab.ling there exist multiple corpora with different and incompatible annotation guidelines or standards. This seems to be a great waste of human efforts, and it would be nice to automatically adapt one annotation standard to another. We present a simple yet effective strategy that transfers knowledge from a differently annotated corpus to the corpus with desired annotation. We test the efficacy of this method in the context of Chinese word segmentation and part-of-speech tagging, where no segmentation and POS tagging standards are widely accepted due to the lack of morphology in Chinese. Experiments show that adaptation from the much larger People's Daily corpus to the smaller but more popular Penn Chinese Treebank results in significant improvements in both segmentation and tagging accuracies (with error reductions of 30.2% and 14%, respectively), which in turn helps improve Chinese parsing accuracy.
暂无评论