Multiword expressions (MWEs) have been proved useful for many natural language processing tasks. However, how to use them to improve performance of statistical machine translation (SMT) is not well studied. This paper...
详细信息
Based on the minimize spanning tree and Fiedler vector, a new feature matching algorithm is proposed in this paper. Firstly, a weighted complete graph is constructed with the feature points of each image respectively,...
详细信息
Many researchers of swarm intelligence (SI) algorithms take their ideas from physical and biological systems. This approach, however, is mostly qualitative and many ideas remain vague and ill-defined. In this paper, a...
详细信息
Based on the center of graph, a point pattern feature matching method is proposed here. Firstly, a weighted complete graph is constructed with the feature points of each image, and then the center of each graph is fou...
详细信息
Manually annotated corpora are valuable but scarce resources, yet for many annotation tasks such as treebanking and sequence lab.ling there exist multiple corpora with different and incompatible annotation guidelines ...
详细信息
Jointly parsing two languages has been shown to improve accuracies on either or both sides. However, its search space is much bigger than the monolingual case, forcing existing approaches to employ complicated modelin...
详细信息
This paper applied the Methods which based on GEP in compress multi-streams. The contributions of this paper include: 1) giving an introduction to data function finding based on GEP(DFF-GEP), defining the main concept...
详细信息
To improve the accuracy and efficiency for solving the EFIE (electric field integral equation) by the method of moment, a novel scheme is put forward to improve the calculation precision based on the EFIE. Firstly, th...
详细信息
To improve the accuracy and efficiency for solving the EFIE (electric field integral equation) by the method of moment, a novel scheme is put forward to improve the calculation precision based on the EFIE. Firstly, the inaccurate induced current is obtained by solving the EFIE under coarse mesh grids. Secondly, the scattered magnetic field at any given point on the surface of the object is calculated by the inaccurate induced current. Finally, the accurate induced current at any given point is determined through total magnetic field. The proposed approach is applied to the case of infinitely perfectly electric conducting circular cylinder and square cylinder. The numerical results are presented to demonstrate the high precision and efficiency when the size of mesh grid is large.
Shot type is useful information for semantic sports video analysis. Most existing approaches utilize predefined rules and domain knowledge to derive shot types in sports video. Although these methods have achieved pro...
详细信息
ISBN:
(纸本)9781605588407
Shot type is useful information for semantic sports video analysis. Most existing approaches utilize predefined rules and domain knowledge to derive shot types in sports video. Although these methods have achieved promising results in some specific games, it is hard to extend them from one sport to another. To address this problem, we propose a generic approach to classify shots in sports video. Our approach utilizes bag of visual words model to represent key frame for each shot based on Scale Invariant Feature Transform (SIFT) feature points;either Support Vector Machine (SVM) or Probabilistic Latent Semantic Analysis (PLSA) are then employed to classify key frame to determine shot type. As our approach relies little on domain knowledge, it can be more easily extended to different sports. We have evaluated our shot classification approach over five types of sports video and have achieved promising results. To show the usefulness and effectiveness of our shot classification, we apply the results of shot type to detect events in basketball video via a generative-discriminative model. In addition, we have observed that some common visual parts frequently appear across various shots in the same sport or even different but relevant sports. For instance, soccer and basketball are relevant sports in the sense of field-ball game. Motivated by this observation, we attempt to alleviate the problem of insufficient sports video data in some applications by sharing these visual parts across different but relevant kinds of sports. Copyright 2009 ACM.
Manually annotated corpora are valuable but scarce resources, yet for many annotation tasks such as treebanking and sequence lab.ling there exist multiple corpora with different and incompatible annotation guidelines ...
ISBN:
(纸本)9781932432459
Manually annotated corpora are valuable but scarce resources, yet for many annotation tasks such as treebanking and sequence lab.ling there exist multiple corpora with different and incompatible annotation guidelines or standards. This seems to be a great waste of human efforts, and it would be nice to automatically adapt one annotation standard to another. We present a simple yet effective strategy that transfers knowledge from a differently annotated corpus to the corpus with desired annotation. We test the efficacy of this method in the context of Chinese word segmentation and part-of-speech tagging, where no segmentation and POS tagging standards are widely accepted due to the lack of morphology in Chinese. Experiments show that adaptation from the much larger People's Daily corpus to the smaller but more popular Penn Chinese Treebank results in significant improvements in both segmentation and tagging accuracies (with error reductions of 30.2% and 14%, respectively), which in turn helps improve Chinese parsing accuracy.
暂无评论