版权所有:内蒙古大学图书馆 技术提供:维普资讯• 智图
内蒙古自治区呼和浩特市赛罕区大学西街235号 邮编: 010021
作者机构:College of Information Engineering Yangzhou University Yangzhou China Institute of Scientific and Technical Information of China Beijing China Joint NTU-UBC Research Centre of Excellence in Active Living for the Elderly Nanyang Technological University Singapore Joint NTU-WeBank Research Centre on Fintech Nanyang Technological University Singapore College of Computer Science and Technology Jilin University Changchun China School of Computer Science and Technology Harbin Institute of Technology Shenzhen China School of Computer Science and Engineering Nanyang Technological University Singapore
出 版 物:《TechRxiv》 (TechRxiv)
年 卷 期:2021年
核心收录:
主 题:Clustering algorithms
摘 要:The widely applied density peak clustering (DPC) algorithm makes an intuitive cluster formation assumption that cluster centers are often surrounded by data points with lower local density and far away from other data points with higher local density. However, this assumption suffers from one limitation that it is often problematic when identifying clusters with lower density because they might be easily merged into other clusters with higher density. As a result, DPC may not be able to identify clusters with variational density. To address this issue, we propose a variational density peak clustering (VDPC) algorithm, which is designed to systematically and autonomously perform the clustering task on datasets with various types of density distributions. Specifically, we first propose a novel method to identify the representatives among all data points and construct initial clusters based on the identified representatives for further analysis of the clusters’ property. Furthermore, we divide all data points into different levels according to their local density and propose a unified clustering framework by combining the advantages of both DPC and DBSCAN. Thus, all the identified initial clusters spreading across different density levels are systematically processed to form the final clusters. To evaluate the effectiveness of the proposed VDPC algorithm, we conduct extensive experiments using 20 datasets including eight synthetic, six real-world and six image datasets. The experimental results show that VDPC outperforms two classical algorithms (i.e., DPC and DBSCAN) and four state-of-the-art extended DPC algorithms. © 2021, CC BY-NC-SA.