This paper presents a novel filtration criterion to restrict the rule extraction for the hierarchical phrase-based translation model, where a bilingual but relaxed wellformed dependency restriction is used to filter o...
详细信息
In order to support mobile service of long-distance monitoring and controlling UPS based on Web, a kind of design and implementation solution of embedded UPS (EUPS) system is brought forward in this paper. The design ...
详细信息
With the expansion of the Web, automatically organizing large scale text resources, e.g. Web pages, becomes very important. Many Web sites, like Google and Yahoo, use hierarchical classification trees to organize text...
详细信息
A mathematical framework based on probability theory is presented that enables us to analyze one important aspect of SI algorithms: the population diversity. Firstly the population density degree is defined for the po...
详细信息
We describe for dependency parsing an annotation adaptation strategy, which can automatically transfer the knowledge from a source corpus with a different annotation standard to the desired target parser, with the sup...
详细信息
Thinning algorithms can be classified into two general types: serial and parallel algorithms. Several algorithms have been proposed, but they have limitations. A new thinning algorithm based on the centroid of the blo...
详细信息
This paper applied the Methods which based on GEP in compress multi-streams. The contributions of this paper include: 1) giving an introduction to data function finding based on GEP(DFF-GEP), defining the main concept...
详细信息
Tree-based statistical machine translation models have made significant progress in recent years, especially when replacing 1-best trees with packed forests. However, as the parsing accuracy usually goes down dramatic...
详细信息
Many researchers of swarm intelligence (SI) algorithms take their ideas from physical and biological systems. This approach, however, is mostly qualitative and many ideas remain vague and ill-defined. In this paper, a...
详细信息
Manually annotated corpora are valuable but scarce resources, yet for many annotation tasks such as treebanking and sequence lab.ling there exist multiple corpora with different and incompatible annotation guidelines ...
详细信息
暂无评论