Multi-view learning methods have achieved remarkable results in 3D shape recognition. However, most of them focus on the visual feature extraction and feature aggregation, while viewpoints (spatial positions of virtua...
详细信息
Multi-view learning methods have achieved remarkable results in 3D shape recognition. However, most of them focus on the visual feature extraction and feature aggregation, while viewpoints (spatial positions of virtual cameras) for generating multiple views are often ignored. In this paper, we deeply explore the correlation between viewpoints and shape descriptor, and propose a novel viewpoint-guided prototype learning network (VGP-Net). We introduce a prototype representation for each class, including viewpoint prototype and feature prototype. The viewpoint prototype is the average weight of each viewpoint learned from a small support set via Score Unit, and stored in a weight dictionary. Our VGP model self-adaptively learns the view-wise weights by dynamically assembling with the viewpoint prototypes in weight dictionary and performing element-wise operation via view pooling layer. Under the guidance of viewpoint prototype, important visual features are enhanced, while those negligible features are eliminated. These refined features are effectively fused to generate compact shape descriptor. All the shape descriptors are clustered in feature embedding space, and the cluster center represents the feature prototype of each class. The classification thus can be performed by searching the nearest distance to feature prototypes. To boost the learning process, we further present a multi-stream regularization mechanism in both feature space and viewpoint space. Extensive experiments demonstrate that our VGP-Net is efficient, and the learned deep features have stronger discrimination ability. Therefore, it can achieve better performance compared to state-of-the-art methods.
Two-class supervised learning in the context of a classifier ensemble may be formulated as learning an incompletely specified Boolean function, and the associated Walsh coefficients can be estimated without the knowle...
详细信息
Two-class supervised learning in the context of a classifier ensemble may be formulated as learning an incompletely specified Boolean function, and the associated Walsh coefficients can be estimated without the knowledge of the unspecified patterns. Using an extended version of the Tumer-Ghosh model, the relationship between added classification error and second-order Walsh coefficients is established. In this brief, the ensemble is composed of multilayer perceptron base classifiers, with the number of hidden nodes and epochs systematically varied. Experiments demonstrate that the mean second-order coefficients peak at the same number of training epochs as ensemble test error reaches a minimum.
As the most widely known data mining algorithm, classification algorithms have attracted wide attention. K-Nearest Neighbor(KNN) algorithm and decision tree algorithm are the two widely known algorithms in classificat...
详细信息
As the most widely known data mining algorithm, classification algorithms have attracted wide attention. K-Nearest Neighbor(KNN) algorithm and decision tree algorithm are the two widely known algorithms in classification algorithms. Sometimes, people not sure how to choose the suitable to solve the classification problems. In this paper, we establish KNN algorithm model and decision tree ID3 algorithm model to analyze the accuracy of the two algorithms in the same data set with different number of features. Through the learning curve and cross validation, we find ID3 algorithm is better than KNN algorithm, and when the number of feature increased the accuracy of KNN is increasing while ID3 is decreased.
Based on the introduction of the traditional feature weighting algorithm TF-IDF,based on the phenomenon that the eigenvalue extraction is not effective when the text to be classified is not uniform,an improved TF-IDF ...
详细信息
ISBN:
(纸本)9781510871076
Based on the introduction of the traditional feature weighting algorithm TF-IDF,based on the phenomenon that the eigenvalue extraction is not effective when the text to be classified is not uniform,an improved TF-IDF algorithm is proposed in this paper,which considers the uneven text distribution *** experimental results show that the results obtained by the classification algorithm using the improved algorithm are better than the original algorithm in terms of accuracy and recall and make up for the defects of the original TF-IDF algorithm.
It has been observed that a good number of financial organizations often face a number of threats due to credit card fraud that affects consistently to the card holder as well as the organizations. This is one of the ...
详细信息
It has been observed that a good number of financial organizations often face a number of threats due to credit card fraud that affects consistently to the card holder as well as the organizations. This is one of the fastest-growing frauds of its kind and the most emerging problems for the institutions to prevent. A number of researchers and analysts have shown interest to work on this area in order to identify such issues in an effective manner by applying various supervised as well as unsupervised learning approaches. In this assessment, three classification techniques such as support vector machine (SVM), k-nearest neighbor (k-NN), and extreme learning machine (ELM) that come under supervised learning category are applied to the BankSim data to categorize the normal and fraudulent class transactions in credit card. These algorithms are incorporated with the graph features extracted from the dataset by using a database tool Neo4j. The nodes of the graph represent the transactional data samples and the edges create relationships among the nodes to find the patterns of data using connected data analysis. k-fold cross validation approach in Gaussian mixture model (GMM) has been applied for classification of the credit card transaction data in a single distribution. Further, a combined graph-based Gaussian mixture model (CGB-GMM) has been proposed to effectively detect the fraudulent instances in credit card transactions with the application of graph algorithms such as degree centrality, LPA, page rank, and so forth. Each of the learning algorithms are implemented with and without the application of graph algorithms and their performances are assessed empirically for analysis.
Insomnia is a common public health problem and an open biomedical research topic. Insomnia results in various health problems, including memory decline, decreases concentration and weakens problem-solving ability. The...
详细信息
Insomnia is a common public health problem and an open biomedical research topic. Insomnia results in various health problems, including memory decline, decreases concentration and weakens problem-solving ability. The insufficient sleep also leads to skin ageing, heart disease, high blood pressure, arrhythmia and stroke. While it remains as a global health concern, sleep quality improvement using modern technologies, such as machine learning, classification technologies, virtual reality (VR), becomes an open and hot research problem. These modern technologies offer new curing solutions under certain conditions. In this paper, we present a sleeping-aid system with a single-channel electroencephalogram (EEG) sleep stage classification algorithm to improve the sleep quality. The sleeping-aid system promotes machine learning integrated VR and multimedia technology for sleep improvement. Ninety participants were invited to test on three different systems with 3D VR, 2D video, and music only. An adequate stimulus of audio-vision can be a complement of the drug treatment. The experimental results showed that the proposed method demonstrated superior performance over existing methods.
Multi-view data represented in multiple views contains more complementary information than a single view, whereby multi-view learning explores and utilizes the multi-view data. In general, most existing multi-view lea...
详细信息
Multi-view data represented in multiple views contains more complementary information than a single view, whereby multi-view learning explores and utilizes the multi-view data. In general, most existing multi-view learning methods consider the correlation between multiple views. However, the relationship between classes and views which is also important in multi-view learning has never been involved in the existing works. In this paper, we propose a fast and effective multi-view nearest-subspace classifier (MV-NSC) by taking advantage of both the two relationships simultaneously. MV-NSC consists of four main parts: 1) projection residual, 2) view-dependent class separability, 3) view similarity, and 4) final decision. The last part combines the first three parts in one final decision matrix, while the first three parts utilize the information of the multi-view data in various aspects. Our proposed method is evaluated on four benchmark datasets and compared with seven other classifiers including both multi-and single-view algorithms. According to the experimental results, it shows that our proposed method is effective, efficient, and robust in multi-view classification.
The amount of information increases explosively in Internet of Things, because more and more data are sensed by large amount of sensors. The explosive growth of information makes it difficult to access information eff...
详细信息
The amount of information increases explosively in Internet of Things, because more and more data are sensed by large amount of sensors. The explosive growth of information makes it difficult to access information efficiently, so it is an effective method to decrease the amount of information to be transferred on network by text classification. This paper proposes a new text classification algorithm based on vector space model. This algorithm improves the feature selection and weighting methods by introducing synonym replacement to traditional text classification algorithms. The experimental results show that the proposed classification algorithm has considerably improved the precision and recall of classification.
Control charts have been widely used to improve manufacturing processes by reducing variations and defects. In particular, multivariate control charts have been effectively applied with monitoring processes that conta...
详细信息
Control charts have been widely used to improve manufacturing processes by reducing variations and defects. In particular, multivariate control charts have been effectively applied with monitoring processes that contain many correlated variables. Most existing multivariate control charts are vulnerable to mis-classification errors that originate because of the hypothesis tests. In particular, these often cause the generation of a large number of false alarms. In this paper, we propose a procedure to reduce false alarms by combining a multivariate control chart and data mining algorithms. Simulation and real case studies demonstrate that the proposed method effectively reduces the false alarm rate. (C) 2015 Elsevier Ltd. All rights reserved.
Background: Combining clinical and molecular data types may potentially improve prediction accuracy of a classifier. However, currently there is a shortage of effective and efficient statistical and bioinformatic tool...
详细信息
Background: Combining clinical and molecular data types may potentially improve prediction accuracy of a classifier. However, currently there is a shortage of effective and efficient statistical and bioinformatic tools for true integrative data analysis. Existing integrative classifiers have two main disadvantages: First, coarse combination may lead to subtle contributions of one data type to be overshadowed by more obvious contributions of the other. Second, the need to measure both data types for all patients may be both unpractical and (cost) inefficient. Results: We introduce a novel classification method, a stepwise classifier, which takes advantage of the distinct classification power of clinical data and high-dimensional molecular data. We apply classification algorithms to two data types independently, starting with the traditional clinical risk factors. We only turn to relatively expensive molecular data when the uncertainty of prediction result from clinical data exceeds a predefined limit. Experimental results show that our approach is adaptive: the proportion of samples that needs to be re-classified using molecular data depends on how much we expect the predictive accuracy to increase when re-classifying those samples. Conclusions: Our method renders a more cost-efficient classifier that is at least as good, and sometimes better, than one based on clinical or molecular data alone. Hence our approach is not just a classifier that minimizes a particular loss function. Instead, it aims to be cost-efficient by avoiding molecular tests for a potentially large subgroup of individuals;moreover, for these individuals a test result would be quickly available, which may lead to reduced waiting times (for diagnosis) and hence lower the patients distress. Stepwise classification is implemented in R-package stepwiseCM and available at the Bioconductor website.
暂无评论