After analyzing the disadvantages of traditional text clustering method based on keywords set, a novel approach for clustering of Chinese text based on concept hierarchy is presented. It introduces a Chinese topic cla...
详细信息
After analyzing the disadvantages of traditional text clustering method based on keywords set, a novel approach for clustering of Chinese text based on concept hierarchy is presented. It introduces a Chinese topic classify dictionary as background knowledge to clustering of Chinese text. It adopts a hierarchical coding system which reflects concept relevance among different words and uses vector space model based on concept hierarchy to represent Chinese text. The experimental results show this approach is more effective than traditional text clustering method based on keywords set
Let {vij}, i, j = 1, 2, …, be i.i.d, random variables with Ev11 = 0, Ev11^2 = 1 and a1 = (ai1,…, aiM) be random vectors with {aij} being i.i.d, random variables. Define XN =(x1,…, xk) and SN =XNXN^T,where xi=ai...
详细信息
Let {vij}, i, j = 1, 2, …, be i.i.d, random variables with Ev11 = 0, Ev11^2 = 1 and a1 = (ai1,…, aiM) be random vectors with {aij} being i.i.d, random variables. Define XN =(x1,…, xk) and SN =XNXN^T,where xi=ai×si and si=1/√N(v1i,…, vN,i)^T. The spectral distribution of SN is proven to converge, with probability one, to a nonrandom distribution function under mild conditions.
Barcode has been widely applied in the modern world. This paper presents a fast and robust recognition method of noisy code 39 barcode. The proposed method can be divided into two steps: search and decoding. In the fi...
详细信息
Barcode has been widely applied in the modern world. This paper presents a fast and robust recognition method of noisy code 39 barcode. The proposed method can be divided into two steps: search and decoding. In the first step, all asterisks in the image are found with evenly defined scan lines and then those with the same directions are matched together to get a valid barcode region. In the second step, a local denoise method is first applied to eliminate noise in the barcode region and then a middle band filter is used to decode the barcode. Our method is simple in comparison with former methods and experimental results show that it is efficient for fast barcode recognition on noisy images.
This paper aims to carry out granular analysis of time sequence based on quotient space. Granular methods have long before been adopted to analyze time sequence, but the granularity was based on time, for example, day...
详细信息
This paper aims to carry out granular analysis of time sequence based on quotient space. Granular methods have long before been adopted to analyze time sequence, but the granularity was based on time, for example, day mean, month mean, year mean and so on in finance forecast. In this paper, the granularity is based on space and some significant results are obtained: we can, in certain circumstances, get characteristics of time sequence in an original space when carrying out granular analysis of it in its coarser-grain space; granular analysis of a Markov chain is equivalent to an hidden Markov model (HMM), contrarily, any HMM is equivalent to granular analysis of a Markov chain. These results deepened our understanding of HMM from the perspective of granular analysis. We can not only use the methods of HMM to study time sequence, but also use the methods of granular analysis based on quotient space theory to solve the problems of HMM.
We propose an appearance-based image clustering approach called GGCI (global geometric clustering for image). For face images taken with varying pose, expression, eyes (wearing sunglasses or not) or object images unde...
详细信息
ISBN:
(纸本)0769525210
We propose an appearance-based image clustering approach called GGCI (global geometric clustering for image). For face images taken with varying pose, expression, eyes (wearing sunglasses or not) or object images under different viewing conditions, GGCI uses easily measured local metric information to learn the underlying global geometry of images space, then apply the extended nearest neighbor approach to cluster images. Different from the usual nearest neighbor approach, GGCI considers the density around the nearest points within clusters. Moreover, our approach clusters based on the geodesic distance measure instead of Euclidean distance measure, which better reflects the intrinsic geometric structure of manifold embedded in high dimensional image space. Experimental results suggest that the proposed GGCI approach achieves lower error rates in image clustering when manifolds are embedded in image space
The functional network was introduced by ***, which extended the neural network. Not only can it solve the problems solved, but also it can formulate the ones that cannot be solved by traditional network. This paper a...
详细信息
The functional network was introduced by ***, which extended the neural network. Not only can it solve the problems solved, but also it can formulate the ones that cannot be solved by traditional network. This paper applies functional network to approximate the multidimension function under the ridgelet theory. The method performs more stable and faster than the traditional neural network. The numerical examples demonstrate the performance.
Web services composition techniques are gaining momentum as the opportunity to establish reusable and versatile inter-operability applications. The purpose of semantic Web services is to use semantic specification to ...
详细信息
Web services composition techniques are gaining momentum as the opportunity to establish reusable and versatile inter-operability applications. The purpose of semantic Web services is to use semantic specification to automate the discovery, invocation, and composition Web services. Description logics is the formalized foundation of semantic Web services and provides well-defined semantics. And many researchers propose their composition approach based on planning techniques. We propose our service composition methods based on description logics and AI planning technologies. Our algorithm for services composition uses backward-chaining search method to find potential candidate services. And we propose a DAG-based method to generate the planning process and filtering the inappropriate services during the DAG generation process. We test our approach on a simple, yet realistic example, and the preliminary results demonstrate that our implementation provides a practical solution
A great variety of languages can be designed by different people for different purposes to operate resource spaces. Two fundamental issues are: can we design more operations in addition to existing operations? and, ho...
A great variety of languages can be designed by different people for different purposes to operate resource spaces. Two fundamental issues are: can we design more operations in addition to existing operations? and, how many operations are sufficient or necessary? This paper solves these problems by investigating the theoretical basis for determining how complete a selection capability is provided in a resource operation sublanguage independent of any host language. The result is very useful to the design and analysis of operating languages.
Feature compression is one of the most importmant steps in pattern recognition. In this paper, based on minimum squared error (MSE) rule, we first give discrete K-L transform (DKLT). According to idea of entropy funct...
详细信息
Effective document classification is a long-pursued goal in knowledge management. This paper proposes a novel hybrid approach of semantic representation and statistical measurements. Document is divided into content s...
Effective document classification is a long-pursued goal in knowledge management. This paper proposes a novel hybrid approach of semantic representation and statistical measurements. Document is divided into content segments first. By Formal Concept Analysis (FCA), their semantic links with standard concept identifiers are built up whose weights are calculated statistically. In this way, effective concept fusing and document classification can be achieved. In addition, a semantic overlay for specific documents will be constructed via concept fusing. Experiments show our approach is feasible and effective.
暂无评论