fuzzy c-means algorithm (Fcm) frequently applid in machine learning has been proven an effective clustering approach. However, the traditional Fcm cannot distinguish the importance of the different data objects and th...
详细信息
fuzzy c-means algorithm (Fcm) frequently applid in machine learning has been proven an effective clustering approach. However, the traditional Fcm cannot distinguish the importance of the different data objects and the discriminative ability of the different features in the clustering process. In this paper, we propose a new kind of Fcm clustering framework: *** the different data weights and feature weights, an adaptive data weights vector and an adaptive feature weights matrix are introduced into the conventional Fcm and a new objective function is constructed. By the proposed objective function, the corresponding scientific updating iterative rules of the membership matrix, the weights of the different feature, the weights of the different data object and the cluster centers can be derived *** results have demonstrated that the algorithm proposed in this paper can deliver consistently promising results and improve the clustering performance greatly.
Mining smart data from the collected big data in Internet of Things which attempts to better human life by integrating physical devices into the information space. As one of the most important clustering techniques fo...
详细信息
Mining smart data from the collected big data in Internet of Things which attempts to better human life by integrating physical devices into the information space. As one of the most important clustering techniques for drilling smart data, the fuzzy c-means algorithm (FcM) assigns each object to multiple groups by calculating a membership matrix. However, each big data object has a large number of attributes, posing an remarkable challenge on FcM for loT big data real-time clustering. In this paper, we propose an efficient fuzzyc-means approach based on the tensor canonical polyadic decomposition for clustering big data in Internet of Things. In the presented scheme, the traditional fuzzy c-means algorithm is converted to the high-order tensor fuzzy c-means algorithm (HOFcM) via a bijection function. Furthermore, the tensor canonical polyadic decomposition is utilized to reduce the attributes of every objects for enhancing the clustering efficiency. Finally, the extensive experiments are conducted to compare the developed scheme with the traditional fuzzy c-means algorithm on two large loT datasets including sWSN and eGSAD regarding clustering accuracy and clustering efficiency. The results argue that the developed scheme achieves a significantly higher clustering efficiency with a slight clustering accuracy drop compared with the traditional algorithm, indicating the potential of the developed scheme for drilling smart data from loT big data. (c) 2018 Elsevier B.V. All rights reserved.
This paper presents a fuzzy support vector classifier by integrating modified fuzzyc-meansclustering based on Mahalanobis distance into fuzzy support vector data description. The proposed algorithmcan be used to de...
详细信息
This paper presents a fuzzy support vector classifier by integrating modified fuzzyc-meansclustering based on Mahalanobis distance into fuzzy support vector data description. The proposed algorithmcan be used to deal with the outlier sensitivity problem in traditional multi-class classification problems. The modified fuzzyc-meansclustering algorithm based on Mahalanobis distance takes into the samples' correlation account, and is improved to generate different weight values for main training data points and outliers according to their relative importance in the training data. Experimental results show that the proposed method can reduce the effect of outliers and give high classification accuracy.
In MRI, the image with poor quality, especially the image with noise interference or low contrast, may provide insufficient data for the visual interpretation of the affected part. Image segmentation provides an effec...
详细信息
In MRI, the image with poor quality, especially the image with noise interference or low contrast, may provide insufficient data for the visual interpretation of the affected part. Image segmentation provides an effective method to facilitate early detection and further diagnosis. By introducing a Particle Swarm Optimization (PSO) initialization step and a novel dissimilarity measure metric, we present a local information kernelized fuzzyc-means (LIKFcM) algorithm for image segmentation. The dissimilarity measure metric, considering an adaptive tradeoff weighted factor, incorporates the Mahalanobis distance and outliers-rejection-based spatial term which eliminates unreliable neighboring information. By using this dissimilarity measure metric, the new algorithmcould take reliable contextual information into account and achieve better segmentation results on images with complexed boundaries. Furthermore, the adaptive tradeoff factor depends on a fast noise estimation algorithm. This factor avoids subjective adjustment and makes the LIKFcM algorithm more universal. To evaluate the performance of the proposed algorithm both quantitatively and qualitatively, experiments are conducted both on synthetic images and real-world images with different kinds of noise. Segmentation Accuracy (SA) and comparison scores are used to evaluate the performance of both proposed algorithm and other methods. Experimental results illustrate that the proposed algorithm has better performance on denoising and reserving useful edges. The LIKFcM algorithm not only shows more robustness to noise but also preserves the texture details of the images.
Medical image processing is an interdisciplinary subject of integrated medical imaging, mathematics, computer science and other disciplines. With high spatial resolution, high signal-to-noise ratio and high resolution...
详细信息
Medical image processing is an interdisciplinary subject of integrated medical imaging, mathematics, computer science and other disciplines. With high spatial resolution, high signal-to-noise ratio and high resolution of soft tissue, the technology can accurately locate the target areas of interest in medical images, thus providing useful information for clinicians to formulate disease treatment plans. These techniques include digital subtraction angiography, magnetic resonance imaging, computed tomography, ultrasound imaging and positron emission tomography. The purpose of this paper is to study the application of fuzzyc-meansclustering in image analysis of critical medicine. This paper discusses the classification effect, clustering process, iteration times and running time of different algorithms, and the segmentation effect of different algorithms. By designing parameters and carrying out simulation experiments, the traditional clustering algorithm and improved local adaptive method are compared, and the problem of long coding time of traditional image compression algorithm is solved. The simulation results under the same working environment show that the coding speed of the algorithm is about five times faster than that of the traditional image compression algorithm without affecting the signal-to-noise ratio and compression rate, which proves the superiority of the algorithm.
Purpose - Data envelopment analysis (DEA) is a non-parametric model that is developed for evaluating the relative efficiency of a set of homogeneous decision-making units that each unit transforms multiple inputs into...
详细信息
Purpose - Data envelopment analysis (DEA) is a non-parametric model that is developed for evaluating the relative efficiency of a set of homogeneous decision-making units that each unit transforms multiple inputs into multiple outputs. However, usually the decision-making units are not completely similar. The purpose of this paper is to propose an algorithm for DEA applications when considered DMUs are non-homogeneous. Design/methodology/approach - To reach this aim, an algorithm is designed to mitigate the impact of heterogeneity on efficiency evaluation. Using fuzzy c-means algorithm, a fuzzyclustering is obtained for DMUs based on their inputs and outputs. Then, the fuzzyc-means based DEA approach is used for finding the efficiency of DMUs in different clusters. Finally, the different efficiencies of each DMU are aggregated based on the membership values of DMUs in clusters. Findings - Heterogeneity causes some positive impact on some DMUs while it has negative impact on other ones. The proposed method mitigates this undesirable impact and a different distribution of efficiency score is obtained that neglects this unintended impacts. Research limitations/implications - The proposed method can be applied in DEA applications with a large number of DMUs in different situations, where some of them enjoyed the good environmental conditions, while others suffered from bad conditions. Therefore, a better assessment of real performance can be obtained. Originality/value - The paper proposed a hybrid algorithmcombination of fuzzyc-meansclustering method with classic DEA models for the first time.
This letter presents a new inter-cluster proximity index for fuzzy partitions obtained from the fuzzy c-means algorithm. It is defined as the average proximity of all possible pairs of clusters. The proximity of each ...
详细信息
This letter presents a new inter-cluster proximity index for fuzzy partitions obtained from the fuzzy c-means algorithm. It is defined as the average proximity of all possible pairs of clusters. The proximity of each pair of clusters is determined by the overlap and the separation of the two clusters. The former is quantified by using concepts of fuzzy Rough sets theory and the latter by computing the distance between cluster centroids. Experimental results indicate the efficiency of the proposed index.
Data mining is the process of discovering meaningful new correlation, patterns and trends by sifting through large amounts of data, using pattern recognition technologies as well as statistical and mathematical techni...
详细信息
Data mining is the process of discovering meaningful new correlation, patterns and trends by sifting through large amounts of data, using pattern recognition technologies as well as statistical and mathematical techniques. cluster analysis is often used as one of the major data analysis technique widely applied for many practical applications in emerging areas of data mining. Two of the most delegated, partition based clustering algorithms namely k-means and fuzzyc-means are analyzed in this research work. These algorithms are implemented by means of practical approach to analyze its performance, based on their computational time. The telecommunication data is the source data for this analysis. The connection oriented broad band data is used to find the performance of the chosen algorithms. The distance (Euclidian distance) between the server locations and their connections are rearranged after processing the data. The computational complexity (execution time) of each algorithm is analyzed and the results are compared with one another. By comparing the result of this practical approach, it was found that the results obtained are more accurate, easy to understand and above all the time taken to process the data was substantially high in fuzzy c-means algorithm than the k-means. (c) 2014 Elsevier B.V. All rights reserved.
This paper presents a novel fuzzyc-means (FcM) clustering simultaneously incorporating local and global information (FLGIcM) method to unsupervised change detection (cD) from remotely sensed images. A new factor incl...
详细信息
This paper presents a novel fuzzyc-means (FcM) clustering simultaneously incorporating local and global information (FLGIcM) method to unsupervised change detection (cD) from remotely sensed images. A new factor including three local, global and edge parameters is added into the conventional FcM to enhance the insensitivity to noise and preserve detailed features. The spatial attraction between the central pixel and its neighborhood pixels is incorporated as a local parameter to utilize spatial information. A global parameter designed based on the estimated mean values of changed and unchanged pixels is introduced into the new factor to enhance its robustness and ability of separating changed from unchanged pixels. In addition, an edge parameter is also added to remain accurate edges and change details. Two experiments were carried out on Landsat images to test the performance of FLGIcM. Experimental results indicate that FLGIcM always achieves high accuracy and overperforms some state-of-the-art cD methods. Therefore, the proposed FLGIc provides an effective unsupervised cD method.
In this research, we investigated the performance of the combination of fuzzyc-means and latent Dirichlet allocation algorithms for Arabic multi-document summarization. The summary should include the most essential s...
详细信息
In this research, we investigated the performance of the combination of fuzzyc-means and latent Dirichlet allocation algorithms for Arabic multi-document summarization. The summary should include the most essential sentences from multi-documents with the same topic. The TAc-2011 corpus is used for experiments, first, the documents in the corpus are clustered using fuzzy c-means algorithm. The aim of the clustering process here is to classify the documents according to their topics, e.g., economic, politic, sport, etc. The results are compared against some recent Arabic summarization approaches that used ant colony and discriminant analysis algorithms. The proposed approach has obtained competitive results compared to those recent approaches.
暂无评论