In this paper,volume models are obtained from closed surface models by an accurate voxelization method which can handle the hidden cavities. This kind of 3D binary images is then converted to gray-level images by a fa...
详细信息
In this paper,volume models are obtained from closed surface models by an accurate voxelization method which can handle the hidden cavities. This kind of 3D binary images is then converted to gray-level images by a fast Euclidean distance transform (EDT).Moment invariants (MIs) which are invariant shape descriptors under similarity transformations,are then computed based on the gray images. Applications in shape analysis area such as principal axis determination,skeleton and medial axis extraction,and shape retrieval can be carried out base on EDT and MIs.
This paper proposed a motion vector error function to segment the objects in video sequence. The key point of the segmentation technique is how to set the appropriate threshold to distinguish the global motion region ...
详细信息
This paper proposed a motion vector error function to segment the objects in video sequence. The key point of the segmentation technique is how to set the appropriate threshold to distinguish the global motion region from the local motion region precisely. This paper also proposed a hierarchical threshold technique based on global motion estimation to solve this problem. Experimental results show that the proposed techniques are robust techniques which refine the set of global motion pixels hierarchically and segment the video objects effectively.
To solve the time-consuming problem of the fitness assignment in the multi-objective evolutionary algorithm, this paper proposes a novel fitness assignment-dominating tree. The dominating tree preserves the necessary ...
详细信息
To solve the time-consuming problem of the fitness assignment in the multi-objective evolutionary algorithm, this paper proposes a novel fitness assignment-dominating tree. The dominating tree preserves the necessary relationships among individuals, contains the density information implicitly, and reduces the comparisons among individuals distinctly. In addition, a smart eliminating strategy based on the dominating tree maintains the diversity of the population without extra expenses. A new multi-objective evolutionary algorithm based on dominating tree is proposed on these innovations. By examining three performance metrics on six test problems, the new algorithm is found to be competitive with SPEA2 and NSGA-II in terms of converging to the true Pareto front and maintaining the diversity of the population, moreover, it is much faster than other two algorithms.
In this paper, a 3D polar-radius surface moment is proposed, and is used for 3D model retrieval. 3D polar-radius surface moments are new moment invariants based on 3D polar-radius moments including the invariance on s...
详细信息
In this paper, a 3D polar-radius surface moment is proposed, and is used for 3D model retrieval. 3D polar-radius surface moments are new moment invariants based on 3D polar-radius moments including the invariance on shift, rotation and scale transforms. Compared to previous methods to compute such moments, the computational complexity for calculating 3D moments can be decreased considerably. With the help of these moment invariants, the 3D models are distinguished accurately.
Existing video research incorporates the use of relevance feedback based on user-dependent interpretations to improve the retrieval results. In this paper, we segregate the process of relevance feedback into 2 distinc...
详细信息
ISBN:
(纸本)9781595937025
Existing video research incorporates the use of relevance feedback based on user-dependent interpretations to improve the retrieval results. In this paper, we segregate the process of relevance feedback into 2 distinct facets: (a) recall-directed feedback;and (b) precision-directed feedback. The recall-directed facet employs general features such as text and high level features (HLFs) to maximize efficiency and recall during feedback, making it very suitable for large corpuses. The precision-directed facet on the other hand uses many other multimodal features in an active learning environment for improved accuracy. Combined with a performance-based adaptive sampling strategy, this process continuously re-ranks a subset of instances as the user annotates. Experiments done using TRECVID 2006 dataset show that our approach is efficient and effective. Copyright 2007 ACM.
The understanding of data is highly relevant to how one senses and perceives them. The existing approaches for classification have been developed mainly based on exploring the intrinsic structure of dataset itself les...
详细信息
The understanding of data is highly relevant to how one senses and perceives them. The existing approaches for classification have been developed mainly based on exploring the intrinsic structure of dataset itself less or no emphasis paid on simulating human visual cognition. A new hyper surface classification method (HSC) has been studied since 2002. HSC is a universal classification method, in which a model of hyper surface is obtained by adoptively dividing the sample space and then the hyper surface is directly used to classify large database based on Jordan curve theorem in topology. In this paper we point out that HSC is a cognitive data visualization method. Simulation results show the effectiveness of the proposed method on large test data with complex distribution and high density. In particular, we show that HSC can very often bring a significant reduction of computation effort without loss of prediction capability.
In this paper a novel direct clustering algorithm based on generalized information distance (GID) is put forward. Firstly, based on information theory, a basic concept of measure of diversity is given and an inequalit...
详细信息
In this paper a novel direct clustering algorithm based on generalized information distance (GID) is put forward. Firstly, based on information theory, a basic concept of measure of diversity is given and an inequality about measure of diversity is proved. Based on this inequality, a concept of increment of diversity is discussed and a defined. Secondly, by analyzing distance measure, two new concepts of generalized information distance (GID) and improved generalized information distance (IGID) are proposed, and a new direct clustering algorithm based on GID and IGID is designed. Finally this algorithm is applied to soil fertility data processing, and compared with hierarchical clustering algorithm (HCA). The results of simulation application show that the algorithm presented here is feasible and effective. Because of simplicity of algorithm and robustness. It provides a new research approach for studies of pattern recognition theory.
Topic-based language model has attracted much attention as the propounding of semantic retrieval in recent years. Especially for the ASR text with errors, the topic representation is more reasonable than the exact ter...
详细信息
Topic-based language model has attracted much attention as the propounding of semantic retrieval in recent years. Especially for the ASR text with errors, the topic representation is more reasonable than the exact term representation. Among these models, Latent Dirichlet Allocation(LDA) has been noted for its ability to discover the latent topic structure, and is broadly applied in many text-related tasks. But up to now its application in information retrieval(IR) is still limited to be a supplement to the standard document models, and furthermore, it has been pointed out that directly employing the basic LDA model will hurt retrieval performance. In this paper, we propose a lexicon-guided two-level LDA retrieval framework. It uses the HowNet to guide the first-level LDA model's parameter estimation, and further construct the second-level LDA models based on the first-level's inference results. We use TRECID 2005 ASR collection to evaluate it, and compare it with the vector space model(VSM) and latent semantic Indexing(LSI). Our experiments show the proposed method is very competitive.
In inter-picture coding, block-based frequency transform is usually carried out on the predicted errors for each inter-block to remove the spatial correlation among them. However, it can not always do well since the p...
详细信息
In inter-picture coding, block-based frequency transform is usually carried out on the predicted errors for each inter-block to remove the spatial correlation among them. However, it can not always do well since the predicted errors in some inter-blocks have marginal or diagonal correlation. A good solution is to omit transform operations for the predicted errors of those inter-blocks with low correlation before quantization operation. The same phenomenon also can be observed in Fine Grain Scalability (FGS) layer coding. In this paper, an adaptive prediction error coding method in spatial and frequency domain with lower complexity is considered for FGS layer coding. Transform operation is only needed when there are non-zero reconstructed coefficients in spatially co-located block in base layer. The experimental results show that compared with FGS coding in JSVM, higher coding efficiency can be achieved with lower computational complexity at decoder since inverse transform is no longer needed for those predicted errors coded in spatial domain at encoder.
暂无评论