Online sequential extreme learningmachine (OS-ELM) proposed by Liang et al. employ sequential learning strategy to learn the target concept from the data. Compared with the original ELM, OS-ELM can learn data one-by-...
详细信息
ISBN:
(纸本)9781479938414
Online sequential extreme learningmachine (OS-ELM) proposed by Liang et al. employ sequential learning strategy to learn the target concept from the data. Compared with the original ELM, OS-ELM can learn data one-by-one or chunk-by-chunk with fixed or varying chunk size with almost same performance as ELM. While compared with other state-of-the-art sequential algorithms such as SGBP, RAN and GAP-RBF, OS-ELM has faster learning speed and better generalization ability. However, similar to ELM, OS-ELM also has instability in different trials of simulations. In addition, for large data sets, OS-ELM will not halt when there are training samples not be learned, this phenomenon results in long learning time. In order to deal with the problems, this paper proposes an algorithm named E-OS-ELM for integrating OS-ELM to classify large data sets. The experimental results show that the proposed method is effective and efficient;it can effectively overcome the drawbacks mentioned above.
It is quite inadequate in providing formula retrieval function by traditional retrieval techniques used in full-text information retrieval system. The main reason is that there are many difficulties to extract the key...
详细信息
In this paper, we propose a new method based on Chinese keyword search to select the WAV or MP3 files in audio post-production. First, we listen to each file and label it with Chinese characters, and then classify and...
详细信息
As the kernel component of scientific documents, mathematical expressions are becoming a new object of searching engines. Different from normal text, mathematical expressions are composed of various kinds of symbols a...
详细信息
ISBN:
(纸本)9781479905607
As the kernel component of scientific documents, mathematical expressions are becoming a new object of searching engines. Different from normal text, mathematical expressions are composed of various kinds of symbols arranged in nonlinear mode, which results in the limitations of traditional full-text information retrieval used for expression searching. In this paper, we discuss the existing search engine of mathematical expressions and introduce the two-dimensional characteristics of mathematical expressions firstly. Then, a data structure of expressing mathematical formulas is designed which contains not only the symbol code but also the mathematical information among symbols. Finally, the indexing algorithm of mathematical expressions is put forward on the basis of the expression data structure. The experimental result shows the effectiveness of the indexing method proposed in this paper.
For the problem of imbalanced data classification which was not discussed in the standard Extreme Support Vector machines (ESVM), an imbalanced extreme support vector machines (IESVM) was proposed. Firstly, a prelimin...
详细信息
For the traditional content-based image retrieval system, the number of irrelevant images for a given query image is significantly more than that of relevant images in an image repository. Therefore, the numbers of ne...
详细信息
Since support vector data description (SVDD) is regarded as a strong classifier, the traditional ensemble methods are not fit for directly combining the results of several SVDDs. Moreover, as is well-known, when many ...
详细信息
In this paper, a robust feature extraction method based on regularized correntropy criterion (RCC) is proposed for novelty detection. In RCC, the criterion aims to maximize the difference between the correntropy of th...
详细信息
ISBN:
(纸本)9781467317139
In this paper, a robust feature extraction method based on regularized correntropy criterion (RCC) is proposed for novelty detection. In RCC, the criterion aims to maximize the difference between the correntropy of the normal data with their mean and the correntropy of the novel data with the mean of normal data. Moreover, the optimal projection vectors in the proposed objective function can be obtained by the half-quadratic (HQ) optimization technique with an iterative manner. Experimental results on one synthetic data set and nine benchmark data sets for novelty detection demonstrate that the proposed method is superior to its related approaches.
PLSA(Probabilistic Latent Semantic Analysis) is a popular topic modeling technique for exploring document collections. Due to the increasing prevalence of large datasets, there is a need to improve the scalability of ...
详细信息
For the traditional content-based image retrieval system, the number of irrelevant images for a given query image is significantly more than that of relevant images in an image repository. Therefore, the numbers of ne...
详细信息
For the traditional content-based image retrieval system, the number of irrelevant images for a given query image is significantly more than that of relevant images in an image repository. Therefore, the numbers of negative samples and positive samples are highly unbalanced, which makes the traditional binary classifiers ineffective. In this paper, our proposed modified AdaBoost-based one-class support vector machine (OCSVM) ensemble is utilized to deal with the aforesaid problem. In our proposed method, the weight update formula of training data for AdaBoost is modified to make AdaBoost fit for combining the results of OCSVMs even though OCSVM is regarded as a strong classifier. Compared with the other three related methods, our proposed approach exhibits better performance on the three benchmark image databases.
暂无评论