文献详情 >A parallel and distributed C4.... 收藏

A parallel and distributed C4.5 algorithm in cloud computing environments

作者：Lin, Kawuu W. Zheng, Ya-Jun Chen, Ju-Chin Wang, Wei-Chiang Chen, Chao-Chun

作者机构：Natl Kaohsiung Univ Sci & Technol Dept Comp Sci & Informat Engn Jiangong Rd Kaohsiung 807618 Taiwan Natl Cheng Kung Univ Inst Mfg Informat & Syst Univ Rd Tainan 70101 Taiwan

出版物：《COMPUTING》 (Comput.)

年卷期：2025年第107卷第2期

页面：1-38页

核心收录：

学科分类：08[工学] 0812[工学-计算机科学与技术（可授工学、理学学位）]

基　　金：National Science and Technology Council of Taiwan [112-2221-E-992-072]

主　　题：Data mining Classification algorithm C4.5 Distributed mining

摘要：Data mining seeks to derive significant insights from big data, offering valuable information for decision-makers and generating economic value. Among its tasks, classification is one of the most essential, with decision tree algorithms being a widely adopted solution due to their efficiency in addressing association rules and classification challenges. Decision trees are particularly advantageous for their ability to provide interpretable results at minimal computational cost. However, the exponential growth of data in the Internet era has highlighted the limitations of traditional algorithms, which struggle to efficiently process large-scale *** address this issue, this study introduces PD-C4.5, a parallel and distributed implementation of the C4.5 algorithm designed for cloud computing environments. By incorporating a microservices architecture, the proposed approach modularizes computation, enabling flexible, scalable, and distributed execution. This design not only enhances system maintainability but also optimizes resource utilization while maintaining high computational *** evaluations demonstrate that PD-C4.5 significantly outperforms Original C4.5 and MR-C4.5 in handling medium and large datasets, achieving notable reductions in computation time and resource consumption. Additionally, the integration of microservices ensures seamless scalability to accommodate increasing data volumes. This study provides a novel and practical solution for large-scale data classification, bridging the gap between computational efficiency and scalability in the era of big data.

本地馆藏 | 借阅须知 | 我要预约

已订购，未入库

sda

目录详情 | 试阅读 |

读者评论与其他读者分享你的观点

学校读者

用户名:未登录

我的评分

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

看过本文的还看了

相关文献

该作者的其他文献

CADAL相关文献

A parallel and distributed C4.5 algorithm in cloud computing environments

读者评论与其他读者分享你的观点

请选择收藏分类：

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

看过本文的还看了

相关文献

该作者的其他文献

CADAL相关文献

A parallel and distributed C4.5 algorithm in cloud computing environments

读者评论 与其他读者分享你的观点

请选择收藏分类： 新增自定义分类 确定 取消

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

读者评论与其他读者分享你的观点

请选择收藏分类：