咨询与建议

看过本文的还看了

相关文献

该作者的其他文献

文献详情 >mrMoulder: A recommendation-ba... 收藏

mrMoulder: A recommendation-based adaptive parameter tuning approach for big data processing platform

mrMoulder : 为大数据处理调节途径的一个基于建议的适应参数平台

作     者:Cai, Lin Qi, Yong Wei, Wei Wu, Jinsong Li, Jingwei 

作者机构:Xi An Jiao Tong Univ Sch Elect & Informat Engn Xian 710049 Shaanxi Peoples R China Xian Univ Technol Sch Comp Sci & Engn Xian 710048 Shaanxi Peoples R China Univ Chile Dept Elect Engn Av Tupper 2007 Santiago 8370451 Chile 

出 版 物:《FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE》 (下代计算机系统)

年 卷 期:2019年第93卷

页      面:570-582页

核心收录:

学科分类:08[工学] 0812[工学-计算机科学与技术(可授工学、理学学位)] 

基  金:National Natural Science Foundation of China 

主  题:Big data processing Performance optimization Parameter tuning Online configuration recommendation Collaborative filtering 

摘      要:Nowadays the world has entered the big data era. Big data processing platforms, such as Hadoop and Spark, are increasingly adopted by many applications, in which there are numerous parameters that can be tuned to improve processing performance for big data platform operators. However, due to the large number of these parameters and the complex relationship among them, it is very time-consuming to manually tune parameters. Therefore, it is a challenge to automatically configure parameters as quickly as possible to optimize the performance of the current job. Existing auto-tuning methods often take a certain time before job runs to get the optimal configuration, which would increase the job s total processing time and reduce the overall efficiency of cluster. In this paper, we propose an adaptive tuning framework, mrMoulder, to recommend a near-optimal configuration for the new job in a short time. mrMoulder sets a self-extending configuration repository and a collaborative filtering based recommendation engine, to speed up the process of optimizing parameter configuration. We have deployed mrMoulder in a Hadoop cluster, and the experiment results have demonstrated that, for a new big data application, the recommend time of mrMoulder is only 20% to 30% of that for the existing auto-tuning methods, while the recommendation quality remains almost unchanged. (C) 2018 Elsevier B.V. All rights reserved.

读者评论 与其他读者分享你的观点

用户名:未登录
我的评分