Parallel corpus is an indispensable resource for translation model training in statistical machine translation (SMT). Instead of collecting more and more parallel training corpora, this paper aims to improve SMT perfo...
详细信息
In this paper, we propose forest-to-string rules to enhance the expressive power of tree-to-string translation models. A forest-to-string rule is capable of capturing non-syntactic phrase pairs by describing the corre...
详细信息
For Hyper Surface Classification (HSC), based on the concept of Minimal Consistent Subset for a disjoint Cover set (MCSC), a judgmental sampling method is proposed to select a representative subset from the original s...
详细信息
For Hyper Surface Classification (HSC), based on the concept of Minimal Consistent Subset for a disjoint Cover set (MCSC), a judgmental sampling method is proposed to select a representative subset from the original sample set in this *** sampling method depends on sample *** can directly solve the nonlinear multi-class classification problems and observe the sample *** sample distribution is obtained by adaptively dividing the sample space, and the classification model of hyper surface is directly used to classify large database based on Jordan Curve Theorem in Topology while sampling for *** number of MCSC is *** has the same classification model with the entire sample set and can totally reflect its classification *** any subset of the sample set that contains MCSC, the classification ability remains the ***, a formula is put forward that can predict the testing accuracy exactly when some samples are deleted from *** MCSC is the best way of sampling from the original sample set for Hyper Surface Classification method.
Structure alignment could help to find shape similarities between proteins and guide structure classification and fold recognition. Common substructure detection and extraction are especially important, for which coul...
详细信息
ISBN:
(纸本)9781424415786
Structure alignment could help to find shape similarities between proteins and guide structure classification and fold recognition. Common substructure detection and extraction are especially important, for which could guide the biologist to discover binding site or active site. We represent each segment of alpha-carbon backbone by using dihedral angles and curve moment invariants. Then, local and global structure alignment could be performed by iterative closest point algorithm. Maximum common substructures between a pair of proteins or within a protein could be found. Active sites also could be detected by the proposed algorithm.
An Immune Genetic Algorithm (IGA) is used to solve weapon-target assignment problem (WTA). The used immune system serves as a local search mechanism for genetic algorithm. Besides, in our implementation, a new crossov...
详细信息
Superimpose one protein tertiary structure to another can help to find similarity between them and further identify functional and evolutionary relationships. We first extract invariant features under rigid body trans...
详细信息
Predicting functional properties of proteins is needed in a number of applications. A protein is represented as an ordered list of amino acids, where each amino acid has a sequence and a structure component (the terms...
详细信息
One basic observation for pedestrian detection in video sequences is that both appearance and motion information are important to model the moving people. Based on this observation, we propose a new kind of features, ...
详细信息
In this paper, we give an overview of the ICT statistical machine translation systems for the evaluation campaign of the International Workshop on Spoken Language Translation (IWSLT) 2007. In this year’s evaluation, ...
详细信息
In this paper, a technique for the extraction of roads in a high resolution synthetic aperture radar (SAR) image is presented. And a three-step method is developed for the extraction of road network from space borne S...
详细信息
ISBN:
(纸本)9780819469540
In this paper, a technique for the extraction of roads in a high resolution synthetic aperture radar (SAR) image is presented. And a three-step method is developed for the extraction of road network from space borne SAR image: the process of the feature points, road candidate detection and connection. Roads in a high resolution SAR image can be modeled as a homogeneous dark area bounded by two parallel boundaries. Dark areas, which represent the candidate positions for roads, are extracted from the image by a Gaussian probability iteration segmentation. Possible road candidates are further processed using the morphological operators. And the roads are accurately detected by Hough Transform, and the extraction of lines is achieved by searching the peak values in Hough Space. In this process, to detect roads more accurately, post-processing, including noisy dark regions removal and false roads removal is performed. At last, Road candidate connection is carried out hierarchically according to road established models. Finally, the main road network is established from the SAR image successfully. As an example, using the ERS-2SAR image data, automatic detection of main road network in Shanghai Pudong area is presented.
暂无评论