检索结果-内蒙古大学图书馆

Promote clustering Accuracies With fuzzy Supervised clustering algorithm

International Journal of Intelligent Technologies and Applied Statistics 2021年第4期14卷 205-219页

作者： Huang-Shu Fen Jeng-Ming Yih

Purpose: fuzzy algorithms of Gath-Geva (GG) and Gustafson-Kessel (GK) based on Mahalanobis distance can improve those limitations of spherical structural clusters, but GG algorithm can only be used for the data with multivariate normal distribution. GK algorithm is limited by that it must know the distribution of data. An improved supervised clustering algorithm based on fuzzy c-means (FcM) has been proposed. Methodology: We take a new threshold value and a new convergent algorithm to improve those limitations of GG and GK algorithms, delete the constraint of the determinants of covariance matrices in the GK algorithm, and replace the covariance matrix with the correlation matrix which exists in the objective function. Findings: The experimental results of real data sets show that our proposed new algorithm can promote clustering accuracies and get better performance. Value: The popular FcM algorithm based on Euclidean distance function converges to a local minimum of the objective function, which can only be used to detect spherical structural clusters. Adding fuzzy covariance matrices in their distance measure was not directly derived from the objective function. But it is not stable enough when some of its covariance matrices are not equal. Hence, different initializations may lead to different results.

关键词： fuzzy c-means algorithm Mahalanobis distance Spherical structural clusters Objective function

来源：评论

学校读者我要写书评

暂无评论

Application of clustering Analysis in Brain Gene Data Based on Deep Learning

引用

IEEE AccESS 2019年 7卷 2947-2956页

作者： Suo, Yina Liu, Tingwei Jia, Xueyong Yu, Fuxing North China Univ Sci & Technol Informat Engn Inst Tangshan 063000 Peoples R China Shandong Univ Sch Math Jinan 250100 Shandong Peoples R China North China Univ Sci & Technol Coll Elect Engn Tangshan 063000 Peoples R China

In the current research, cluster analysis has become a very good way to obtain biological information by analyzing the brain gene expression data. In recent years, many experts have used improved traditional clustering algorithm and a new clustering algorithm to mine brain gene expression data. First, the random Forest method is used to preprocess high-dimensional and high-complexity brain gene expression data. Then, a clustering model based on deep learning is proposed, and a clustering algorithm is implemented by using deep belief network (DBN) and fuzzy c-means algorithm (FcM). This model makes full use of the generality of unsupervised learning of deep learning and clustering technology, combines the advantages of deep learning with clustering, and makes clustering effect better and more convenient for clustering high-dimensional data.

关键词： Deep belief network fuzzy c-means algorithm unsupervised learning brain gene data clustering

来源：评论

学校读者我要写书评

暂无评论

A stratified sampling based clustering algorithm for large-scale data

引用

KNOWLEDGE-BASED SYSTEMS 2019年 163卷 416-428页

作者： Zhao, Xingwang Liang, Jiye Dang, chuangyin Shanxi Univ Sch Comp & Informat Technol Key Lab Computat Intelligence & Chinese Informat Minist Educ Taiyuan 030006 Shanxi Peoples R China City Univ Hong Kong Dept Syst Engn & Engn Management Hong Kong Peoples R China

Large-scale data analysis is a challenging and relevant task for present-day research and industry. As a promising data analysis tool, clustering is becoming more important in the era of big data. In large-scale data clustering, sampling is an efficient and most widely used approximation technique. Recently, several sampling-based clustering algorithms have attracted considerable attention in large-scale data analysis owing to their efficiency. However, some of these existing algorithms have low clustering accuracy, whereas others have high computational complexity. To overcome these deficiencies, a stratified sampling based clustering algorithm for large-scale data is proposed in this paper. Its basic steps include: (1) obtaining a number of representative samples from different strata with a stratified sampling scheme, which are formed by locality sensitive hashing technique, (2) partitioning the chosen samples into different clusters using the fuzzy c-means clustering algorithm, (3) assigning the out-of-sample objects into their closest clusters via data labeling technique. The performance of the proposed algorithm is compared with the state-of-the-art sampling-based fuzzy c-means clustering algorithms on several large-scale data sets including synthetic and real ones. The experimental results show that the proposed algorithm outperforms the related algorithms in terms of clustering quality and computational efficiency for large-scale data sets. (c) 2018 Published by Elsevier B.V.

关键词： Large-scale data fuzzy c-means algorithm Stratified sampling Data labeling

来源：评论

学校读者我要写书评

暂无评论

Research on Leak Location Method of Water Supply Network based on Deep Neural Network Model

Research on Leak Location Method of Water Supply Network bas...

引用

第四届材料科学应用与能源材料国际研讨会

作者： Xiaoxuan Wu chen Zhang School of Artificial Intelligence and Big Data Hefei University Anhui Engineering Lab of Big Data Technology Application for Urban Infrastructure Hefei University

The water supply network is one of the important infrastructure in urban construction. It has strong theoretical and practical significance to realize the real-time monitoring and leak location of the water supply network. In this paper, based on the similarity of water supply network node pressure, fuzzy c-means clustering algorithm is used to realize the selection of finite monitoring points. On this basis, a depth neural network model is constructed according to the pressure changes of the monitoring points before and after the leakage of the water supply network, so as to locate the leakage points. In the experimental part, hydraulics simulation was conducted by using EPANETH pipe network adjustment software according to the layout structure of water supply network, and the pressure of all nodes was obtained. A deep neural network model was established by Keras in Tensorflow framework. After model training and testing, the training error was controlled within the effective range of 5%.Finally, the model is applied to the actual leakage problem of underground water supply network in Langxi county of Xuancheng city, and the accurate location of the leakage point is realized. The experiment proves the feasibility and accuracy of the method proposed in this paper.

关键词： clustering Ensemble Deep Neural Network fuzzy c-means algorithm Leak Location Water Supply Network

来源：评论

学校读者我要写书评

暂无评论

An efficient fuzzy c-means approach based on canonical polyadic decomposition for clustering big data in IoT

引用

FUTURE GENERATION cOMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF EScIENcE 2018年 88卷 675-682页

作者： Bu, Fanyu Inner Mongolia Univ Finance & Econ Coll Comp & Informat Management Hohhot Peoples R China

Mining smart data from the collected big data in Internet of Things which attempts to better human life by integrating physical devices into the information space. As one of the most important clustering techniques for drilling smart data, the fuzzy c-means algorithm (FcM) assigns each object to multiple groups by calculating a membership matrix. However, each big data object has a large number of attributes, posing an remarkable challenge on FcM for loT big data real-time clustering. In this paper, we propose an efficient fuzzy c-means approach based on the tensor canonical polyadic decomposition for clustering big data in Internet of Things. In the presented scheme, the traditional fuzzy c-means algorithm is converted to the high-order tensor fuzzy c-means algorithm (HOFcM) via a bijection function. Furthermore, the tensor canonical polyadic decomposition is utilized to reduce the attributes of every objects for enhancing the clustering efficiency. Finally, the extensive experiments are conducted to compare the developed scheme with the traditional fuzzy c-means algorithm on two large loT datasets including sWSN and eGSAD regarding clustering accuracy and clustering efficiency. The results argue that the developed scheme achieves a significantly higher clustering efficiency with a slight clustering accuracy drop compared with the traditional algorithm, indicating the potential of the developed scheme for drilling smart data from loT big data. (c) 2018 Elsevier B.V. All rights reserved.

关键词： Big data Internet of Things Smart data fuzzy c-means algorithm canonical polyadic decomposition

来源：评论

学校读者我要写书评

暂无评论

Extended power-based aggregation of distance functions and application in image segmentation

引用

INFORMATION ScIENcES 2019年 494卷 155-173页

作者： Delic, Marija Nedovic, Ljubo Pap, Endre Univ Novi Sad Fac Tech Sci Dept Fundamentals Sci Trg Dositeja Obradovica 6 Novi Sad 21000 Serbia Singidunum Univ Dept Postgrad Studies 32 Danijelova St Belgrade 11000 Serbia

In this paper, we psropose a novel method for construction of a distance function and demonstrate its application in image segmentation. In algorithms for image segmentation, distance functions represent a criterion which divides pixels into groups of segments. We introduce two extended aggregation functions, extended powers product and extended weighted arithmetic mean of powers. Their relevant properties are examined, as well as certain resulting properties of distance functions, which are constructed by an application of mentioned aggregation functions. In addition, one pixel descriptor, which is motivated by Local Binary Pattern family of descriptors (LBPs), is introduced and discussed. In the experimental section, we present an application of the introduced extended aggregation functions and descriptor, by a construction of a new distance function, used in fuzzy c-means clustering algorithm (FcM) for image segmentation. (c) 2019 Elsevier Inc. All rights reserved.

关键词： Distance function Metrics Extended aggregation function Local binary pattern Image segmentation fuzzy c-means algorithm

来源：评论

学校读者我要写书评

暂无评论

Rainfall estimation from MSG images using fuzzy association rules

引用

JOURNAL OF INTELLIGENT & fuzzy SYSTEMS 2019年第1期37卷 1357-1369页

作者： Bouaita, Bilal Moussaoui, Abdelouahab Bachari, Nour El Islam Univ Ferhat Abbas Setif 1 Dept Comp Sci Setif Algeria Univ USTHB Dept Ecol & Environm Algiers Algeria

The Meteosat Second Generation (MSG) satellite can be used to estimate rainfall through the multispectral images, which are provided every 15 min across 12 channels. However, most studies have not maximized the terabytes of data provided by the channels in this satellite, which are potentially rich in new resources that need to be exploited. Moreover, these studies classify pixels conventionally, where a pixel is considered either 100% precipitant or 0% (no-precipitant), whereas actually it cannot be classified in a clear and unambiguous way. To address this problem, we propose a method that exploits the images of the channels and constructs an estimation model in the form of fuzzy association rules to estimate the rainfall in Northeastern Algeria. Each rule is in if (condition)-then (conclusion) form, where the condition is a combination of the various fuzzy classes of MSG images, and the conclusion contains a single fuzzy class that represents the intensities of rain: no-rain, low, moderate, and high. The obtained results are compared with the data obtained by the European Organization for the Exploitation of Meteorological Satellites Multisensor Precipitation Estimate program.

关键词： Data mining MSG images apriori algorithm fuzzy association rules fuzzy c-means algorithm

来源：评论

学校读者我要写书评

暂无评论

Analysis of integrated energy-load characteristics based on sparse clustering and compressed sensingInspec keywordsOther keywords

IET ENERGY SYSTEMS INTEGRATION

引用

IET ENERGY SYSTEMS INTEGRATION 2019年第3期1卷 194-201页

作者： Wang, He Hou, Yongshan Yu, Huanan Northeast Elect Power Univ Sch Elect Engn Jilin 132012 Peoples R China

An integrated energy system not only provides a platform for multi-energy coupling utilisation but also satisfies users' diversified energy demands. However, in view of the enormous amount of integrated energy data and the difficulty of analysing the characteristics of that data, an integrated energy-analysis method based on sparse clustering and compressed sensing is proposed in this study. This method uses the fuzzy c-means algorithm to construct an over-complete dictionary and then compresses, collects, and reconstructs the integrated energy data using the compressed sensing theory method. This process analyses integrated energy-load characteristics accurately and also solves the problem of low data-transmission efficiency. Simulation results show that the method is suitable for analysing and processing integrated load data in integrated energy systems.

关键词： compressed sensing pattern clustering buildings (structures) energy consumption power engineering computing load forecasting fuzzy set theory integrated energy-load characteristics sparse clustering multienergy coupling utilisation integrated energy data compressed sensing theory method fuzzy c-means algorithm over-complete dictionary

来源：评论

学校读者我要写书评

暂无评论

Traffic Operation Data Analysis and Information Processing Based on Data Mining

引用

AUTOMATIc cONTROL AND cOMPUTER ScIENcES 2019年第3期53卷 244-252页

作者： Jiang, Zhihuang Guangdong Pei Zheng Coll Huadu Dist 510830 Guangdong Peoples R China

With the acceleration of urbanization, urban traffic problems are becoming more and more prominent. In the face of massive traffic data, it is difficult to predict traffic condition with effective data analysis methods. In order to deal with traffic data better, this study applied data mining in traffic data analysis and processing, constructed a Hadoop based data analysis system to collect and preprocess data, and analyzed traffic data using parallel distributed calculation based on MapReduce. The improved fuzzy c-means (FcM) algorithm and the random forest algorithm were used. The simulation results showed that the error rate of the improved FcM algorithm is 10% and the accuracy rate of the random forest algorithm is 92.3%, indicating the system had high reliability. Then an experiment was carried out on the main traffic roads in Huadu district of Guangzhou, china. It was found that the method was efficient and accurate and had a good application prospect.

关键词： data mining traffic operation Hadoop information processing fuzzy c-means algorithm random forest algorithm

来源：评论

学校读者我要写书评

暂无评论

An efficient fault diagnosis strategy based on SVDD and fuzzy clustering for ground-based electronic equipment

An efficient fault diagnosis strategy based on SVDD and fuzz...

引用

IEEE International conference on Signal Processing, communications and computing (IcSPcc)

作者： Wang, cheng Yang, Huahui Meng, chen Army Engn Univ Measurement Engn Dept Shijiazhuang Hebei Peoples R China

ISBN: (纸本)9781728117089

In this paper, an efficient fault detection approach which employs the Support Vector Data Description (SVDD) and fuzzy c-means algorithm (FcM) is proposed for ground-based electronic equipment. Firstly, the FcM method is applied to fault pattern mining in which the prior knowledge of equipment faults is difficult to be known. Then SVDD model is trained with different faults data independently for multi-classification. This fault diagnosis strategy can be used in health condition monitoring for ground-based electronic equipment. The experimental results verify its effectiveness in fault diagnosis with high accuracy and real-time performance.

关键词： Fault diagnosis support vector domain description fuzzy c-means algorithm condition monitoring electronic equipment

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：