Several cost-sensitive boosting algorithms have been reported as effective methods in dealing with class imbalance problem. Misclassification costs, which reflect the different level of class identification importance...
详细信息
This paper presents a Bayes document classifier using phrases as *** e phrases are extracted using a grammar that iteratively applies the rules to the sequence of words in the document. This grammar is generated from ...
详细信息
Extracting natural groups of the unlabeled data is known as clustering. To improve the stability and robustness of the clustering outputs, clustering ensembles have emerged recently. In this paper, an ensemble of part...
详细信息
ISBN:
(纸本)9781605581309
Extracting natural groups of the unlabeled data is known as clustering. To improve the stability and robustness of the clustering outputs, clustering ensembles have emerged recently. In this paper, an ensemble of particle swarm clustering algorithms is proposed. That is, the members of the ensemble are based on the cooperative swarms clustering approaches. The performance of the proposed particle swarm clustering ensemble is evaluated using different data sets and is compared to that of other clustering techniques.
Cluster analysis is an un-supervised learning technique that is widely used in the process of topic discovery from text. The research presented here proposes a novel un-supervised learning approach based on aggregatio...
详细信息
This paper presents an algorithm for extraction of phrases from text *** e algorithm builds phrases by iteratively merging bigrams according to an association *** o association measures are presented: mutual informati...
详细信息
In previous work, we showed that the use of Multiple Input Representation(MIR) for the classification of time series data provides complementary information that leads to better accuracy. [4]. In this paper, we introd...
详细信息
In this paper we present a new architecture for combining classifiers. This approach integrates learning into the voting scheme used to aggregate individual classifiers decisions. This overcomes the drawbacks of havin...
详细信息
Urban land-cover classification is one of the most challenging problems in patternanalysis and machineintelligence systems in remote sensing. Dense urban environment sensed by very high-resolution (VHR) optical sens...
详细信息
In this paper, we propose a cluster-based cumulative representation for cluster ensembles. Cluster labels are mapped to incrementally accumulated clusters, and a matching criterion based on maximum similarity is used....
详细信息
An algorithm to identify and remove term redundancy is proposed for text classifiers using ranking-based feature selection. The proposed method employs a normalized mutual information, which is called inclusion measur...
详细信息
暂无评论