检索结果-内蒙古大学图书馆

Fundamental clustering algorithms suite

SOFTWAREX 2021年 13卷

作者： Thrun, Michael C. Stier, Quirin Philipps Univ Marburg Databion Res Grp Hans Meerwein Str 6 D-35043 Marburg Germany Philipps Univ Marburg Dept Hematol Oncol & Immunol Hans Meerwein Str 6 D-35043 Marburg Germany

The article presents immediate access to over fifty fundamental clustering algorithms. Additionally, access to clustering benchmark datasets published priorly as "Fundamental clustering Problems Suite" (FCPS) is provided. The software library is named "FCPS", available in R on CRAN and accessible within Python. The input and output of clustering algorithms are standardized to enable users a swift execution of cluster analysis. By combining mirrored-density plots (MD plots) with statistical testing, FCPS provides a tool to investigate the cluster-tendency quickly before the cluster analysis itself. Common clustering challenges can be generated with an arbitrary sample size. Additionally, FCPS sums up 26 indicators intending to estimate the number of clusters and provides an appropriate implementation of the clustering accuracy for more than two clusters. (C) 2020 The Author(s). Published by Elsevier B.V.

关键词： Cluster analysis clustering algorithms Clusterability Cluster-tendency Number of clusters

来源：评论

学校读者我要写书评

暂无评论

An Approach to Study the Poverty Reduction Effect of Digital Inclusive Finance from a Multidimensional Perspective Based on clustering algorithms

引用

SCIENTIFIC PROGRAMMING 2021年第1期2021卷

作者： Zhou, Lu Wang, Huiling Tianjin Univ Finance & Econ Coll Finance Tianjin 300222 Peoples R China Tianjin Renai Coll Tianjin 301636 Peoples R China Chongqing City Management Coll Chongqing 401331 Peoples R China

The evaluation of clustering algorithms is intrinsically difficult because of the lack of objective measures. On the basis of the DIFI and China's Provincial Panel data, this study aims to test the poverty reduction effect of digital inclusive finance in three dimensions of income, education, and healthcare and further look at the transmission mechanism of digital inclusive finance in poverty alleviation. The results indicated that digital inclusive finance exerts a poverty reduction effect in three dimensions-medical poverty, income poverty, and education poverty. Of these, the coverage breadth significantly affects the alleviation of medical poverty, the use depth significantly affects the alleviation of income poverty and education poverty, and the digitization level affects the alleviation of poverty in three dimensions. The level of regional economic development plays an intermediary role in the poverty alleviation effect of digital inclusive finance. Compared with the western region, which is relatively backward in development, the poverty reduction effect of digital inclusive finance in the eastern region is more significant.

关键词： clustering algorithms

来源：评论

学校读者我要写书评

暂无评论

Holistic Assessment of Structure Discovery Capabilities of clustering algorithms

Holistic Assessment of Structure Discovery Capabilities of C...

引用

European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD)

作者： Hoeppner, Frank Jahnke, Maximilian Ostfalia Univ Appl Sci Dept Comp Sci D-38302 Wolfenbuttel Germany

ISBN: (纸本)9783030461508;9783030461492

Existing cluster validity indices often possess a similar bias as the clustering algorithm they were introduced for, e.g. to determine the optimal number of clusters. We suggest an efficient and holistic assessment of the structure discovery capabilities of clustering algorithms based on three criteria. We determine the robustness or stability of cluster assignments and interpret it as the confidence of the clustering algorithm in its result. This information is then used to label the data and evaluate the consistency of the stability-assessment with the notion of a cluster as an area of dense and separated data. The resulting criteria of stability, structure and consistency provide interpretable means to judge the capabilities of clustering algorithms without the typical biases of prominent indices, including the judgment of a clustering tendency.

关键词： clustering algorithms

来源：评论

学校读者我要写书评

暂无评论

Improving the clustering algorithms Automatic Generation Process with Cluster Quality Indexes 20th

Improving the Clustering Algorithms Automatic Generation Pro...

引用

20th International Conference on Computational Science and Its Applications (ICCSA)

作者： Montenegro, Michel Meiguins, Aruanda Meiguins, Bianchi Morais, Jefferson Fed Univ Para Comp Sci Postgrad Program BR-66075110 Belem Para Brazil

ISBN: (纸本)9783030587994;9783030587987

Autoclustering is a computational tool for the automatic generation of clustering algorithms, which combines and evaluates the main parts of density-based algorithms to generate more appropriate solutions for a given dataset for clustering tasks. Autoclustering uses the Estimation of Distribution algorithms (EDA) evolutionary technique to create the algorithms (individuals), and the adapted CLEST method (originally determines the best number of groups for a dataset) to compute individual fitness, using a decision-tree classifier. Thus, as the motivation to improve the quality of the results generated by Autoclustering, and to avoid possible bias by the adoption of a classifier, this work proposes to increase the efficiency of the evaluation process by the addition of a quality metric based on a fusion of three quality indexes of solution clusters. The three quality indexes are Silhouette, Dunn, and Davies-Bouldin, which assess the situation Intra and Inter clusters, with algorithms based on distance and independent of the generation of the groups. A final score for a specific solution (algorithm + parameters) is the average of normalized quality metric and normalized fitness. Besides, the results of the proposal presented solutions with higher cluster quality metrics, higher fitness average, and higher diversity of generated individuals (clustering algorithms) when compared with traditional Autocluestering.

关键词： Autoclustering Cluster quality index clustering algorithms

来源：评论

学校读者我要写书评

暂无评论

Performance evaluation of clustering algorithms for varying cardinality and dimensionality of data sets 1

Performance evaluation of clustering algorithms for varying ...

引用

1st International Conference on Recent Advances in Materials and Manufacturing (ICRAMM)

作者： Renjith, Shini Sreekumar, A. Jathavedan, M. Cochin Univ Sci & Technol Dept Comp Applicat Kochi 682022 Kerala India Mar Baselios Coll Engn & Technol Dept Comp Sci & Engn Thiruvananthapuram 695015 Kerala India

clustering is the most widely used unsupervised machine learning technique, having extensive applications in statistical analysis. We have multiple clustering algorithms available in theory and many more implementations available in practice. A bunch of literatures can be found focusing on the quality of clustering algorithms using various internal and external evaluation techniques. The motivation behind this work is the scarcity of literatures dealing with performance of clustering algorithms in terms of turnaround time. This paper summarizes the experimental analysis conducted on the performance of multiple clustering algorithms based on cardinality and dimensionality. The analysis is performed in R, which is a free and open source programming language mainly used for statistical computing. This work evaluates nine key algorithms coming under partitioning, hierarchical, density-based and model-based clustering approaches using different social media data sets. We captured performance trends of these algorithms in terms of turnaround time by varying the cardinality and dimensionality parameters of the data sets. Based on our experiments, CLARA, CLARANS, and k-means algorithms demonstrate best performances with varying cardinality. It is also observed that changes in dimensionality do not impact hierarchical clustering approaches whereas there is a positive influence on the execution time for partitioning, density-based and model-based clustering approaches. © 2019 Elsevier Ltd. All rights reserved.

关键词： clustering algorithms clustering performance clustering quality Social media Turnaround time

来源：评论

学校读者我要写书评

暂无评论

Towards the Use of clustering algorithms in Recommender Systems 26

Towards the Use of Clustering Algorithms in Recommender Syst...

引用

Conference of the Association-for-Information-Systems (AMCIS)

作者： Miranda, Leandro Viterbo, Jose Bernardini, Flavia Univ Fed Fluminense Niteroi RJ Brazil

ISBN: (纸本)9781733632546

Recommender Systems have been intensively used in Information Systems in the last decades, facilitating the choice of items individually for each user based on your historical. clustering techniques have been frequently used in commercial and scientific domains in data mining tasks and visualization tools. However, there is a lack of secondary studies in the literature that analyze the use of clustering algorithms in Recommender Systems and their behavior in different aspects. In this work, we present a Systematic Literature Review (SLR), which discusses the different types of information systems with the use of the clustering algorithm in Recommender Systems, which typically involves three main recommendation approaches found in literature: collaborative filtering, content-based filtering, and hybrid recommendation. In the end, we did a quantitative analysis using K-means clustering for finding patterns between clustering algorithms, recommendation approaches, and some datasets used in their publications.

关键词： Machine learning clustering algorithms recommender systems

来源：评论

学校读者我要写书评

暂无评论

clustering algorithms in an Educational Context: An Automatic Comparative Approach

引用

IEEE ACCESS 2020年 8卷 146994-147014页

作者： Hooshyar, Danial Yang, Yeongwook Pedaste, Margus Huang, Yueh-Min Univ Tartu Inst Educ EE-50090 Tartu Estonia Natl Cheng Kung Univ Dept Engn Sci Tainan 701 Taiwan

Despite an increasing consensus regarding the significance of properly identifying the most suitable clustering method for a given problem, a surprising amount of educational research, including both educational data mining (EDM) and learning analytics (LA), neglects this critical task. This shortcoming could in many cases have a negative impact on the prediction power of both the EDM and LA based approaches. To address such issues, this work proposes an evaluation approach that automatically compares several clustering methods using multiple internal and external performance measures on 9 real-world educational datasets of different sizes, created from the University of Tartu's Moodle system, to produce two-way clustering. Moreover, to investigate the possible effect of normalization on the performance of the clustering algorithms, this work performs the same experiment on a normalized version of the datasets. Since such an exhaustive evaluation includes multiple criteria, the proposed approach employs a multiple criteria decision-making method (i.e., TOPSIS) to rank the most suitable methods for each dataset. Our results reveal that the proposed approach can automatically compare the performance of the clustering methods and accordingly recommend the most suitable method for each dataset. Furthermore, our results show that in both normalized and nonnormalized datasets of different sizes with 10 features, DBSCAN and k-medoids are the best clustering methods, whereas agglomerative and spectral methods appear to be among the most stable and highly performing clustering methods for such datasets with 15 features. Regarding datasets with more than 15 features, OPTICS is among the top-ranked algorithms among the nonnormalized datasets, and k-medoids is the best among the normalized datasets. Interestingly, our findings reveal that normalization may have a negative effect on the performance of certain methods, e.g., spectral clustering and OPTICS;however, it appears to m

关键词： clustering methods clustering algorithms Task analysis Decision making Education Size measurement Educational context clustering methods multiple criteria decision-making educational data mining learning analytics

来源：评论

学校读者我要写书评

暂无评论

clustering algorithms and Validation Indices for mmWave Radio Multipath Propagation 18

Clustering Algorithms and Validation Indices for mmWave Radi...

引用

18th Annual Wireless Telecommunications Symposium (WTS)

作者： Moayyed, Miead Tehrani Antonescu, Bogdan Basagni, Stefano Northeastern Univ Inst Wireless IoT Boston MA 02115 USA

ISBN: (纸本)9781538683804

Transmissions in the mmWave spectrum benefit from a-priori knowledge of radio channel propagation models. This paper is concerned with one important task that helps provide a more accurate channel model, namely, the clustering of all multipath components arriving at the receiver. Our work focuses on directive transmissions in urban outdoor scenarios and shows the importance of the correct estimation of the number of clusters for mmWave radio channels simulated with a software ray-tracer tool. We investigate the effectiveness of k-means and k-power-means clustering algorithms in predicting the number of clusters through the use of cluster validity indices (CMIs) and score fusion techniques. Our investigation shows that clustering is a difficult task because the optimal number of clusters is not always given by one or by a combination of more CMIs. However, using score fusion methods, we find the optimal partitioning for the k-means algorithm based on the power and time of arrival of the multipath rays or based on their angle of arrival. When the k-power-means algorithm is used, the power of each arriving ray is the most important clustering factor, making the dominant received paths pull the other ones around them, to form a cluster. Thus, the number of clusters is smaller and the decision based on CMIs or score fusion factors easier to be taken.

关键词： mmWave clustering algorithms cluster validity indices channel propagation models

来源：评论

学校读者我要写书评

暂无评论

Semantic Web and Web Page clustering algorithms: A Landscape View

EAI Endorsed Transactions on Energy Web

引用

EAI Endorsed Transactions on Energy Web 2021年第33期8卷 1-14页

作者： Obaid, Ahmed J. Chatterjee, Tanusree Bhattacharya, Abhishek Faculty of Computer Science and Mathematics University of Kufa Iraq Techno International NewTown India Institute of Engineering and Management India

The major evolution of the semantic web has become exchanging data between applications in all domains of activities. Based on this vision, different applications in recent days, e.g. in the fields of community web portals, social networking, e-learning, multimedia retrieval, etc. have been designed. Due to growing number of web services, clustering of web resources becomes a valuable tool for semantic web mining. clustering of internet objects like Internet web pages’ intimate new methods for grouping correlated content for better understanding and satisfies massive user query results in web pages’ search. Hence, web pages clustering algorithms should be able to handle massive irregular content and discover knowledge regardless of the web page complexity. These algorithms vary depending on the characteristics and data types. So, choosing the most appropriate algorithm is not an easy process as it should be accurate in terms of time and space complexity. Therefore, this paper rigorously surveys the most important algorithms of different types used for web page clustering. In addition, a comparative analysis of all such algorithms are provided in terms of several parameters. Finally, a brief discussion is provided on why web page clustering is important in emerging era of Semantic Web of Thing (SWoT) applications. Copyright © 2020 Ahmed J. Obaid et al., licensed to EAI. This is an open access article distributed under the terms of the Creative Commons Attribution license, which permits unlimited use, distribution and reproduction in any medium so long as the original work is properly cited.

关键词： clustering algorithms

来源：评论

学校读者我要写书评

暂无评论

A Comprehensive Analysis of Kernelized Hybrid clustering algorithms with Firefly and Fuzzy Firefly algorithms 5th

A Comprehensive Analysis of Kernelized Hybrid Clustering Alg...

引用

5th International Conference on Computational Intelligence in Data Mining, ICCIDM 2018

作者： Tripathy, B.K. Agrawal, Anmol School of Information Technology and Engineering VIT University VelloreTamil Nadu632014 India

ISBN: (纸本)9789811386756

In order to handle the problem of linear separability in the early data clustering algorithms, Euclidean distance is being replaced with Kernel functions as measures of similarity. Another problem with the clustering algorithms is the selection of initial centroids randomly, which affects not only the final result but also decreases the convergence rate. Optimal selection of initial centroids through optimization algorithms like Firefly or Fuzzy Firefly algorithms provide partial solution to this problem. In this paper, we focus on two kernels;Gaussian and Hyper-tangent and use both Firefly and Fuzzy Firefly algorithms separately along with algorithms like FCM, IFCM and RFCM and analyse their efficiency using two measures DB and D. Our analysis concludes that RFCM with Hyper-tangent kernel and fuzzy firefly produce the best results with fastest convergence rate. We use the two images;MRI scan of a human brain and blood cancer cells for our analysis. © 2020, Springer Nature Singapore Pte Ltd.

关键词： clustering algorithms

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：