咨询与建议

看过本文的还看了

相关文献

该作者的其他文献

文献详情 >Early straggler tasks detectio... 收藏

Early straggler tasks detection by recurrent neural network in a heterogeneous environment

作     者:Bawankule, Kamalakant Laxman Dewang, Rupesh Kumar Singh, Anil Kumar 

作者机构:Motilal Nehru Natl Inst Technol Allahabad Dept Comp Sci & Engn Pryagraj Uttar Pradesh India 

出 版 物:《APPLIED INTELLIGENCE》 (Appl Intell)

年 卷 期:2023年第53卷第7期

页      面:7369-7389页

核心收录:

学科分类:08[工学] 0812[工学-计算机科学与技术(可授工学、理学学位)] 

基  金:Quality Improvement Program of All India Council for Technical Education(AICTE)  India 

主  题:ANN (Artificial neural network) ESDRNN (Early straggler detection by a recurrent neural network) Hadoop Heterogeneous environment MapReduce RNN (Recurrent neural network) TTE (Time to end) 

摘      要:Heterogeneity is common in parallel and distributed environments used for extensive computations such as MapReduce. Stragglers are the tasks that are running on inferior performing nodes in the cluster. Early detection of stragglers is always challenging in such environments. In the previously proposed approaches, late detection of straggler tasks and estimation of time to end (TTE) for all the tasks running in a heterogeneous environment delays the entire job execution. Early straggler detection help to speculate a task at the early stages of task execution which indirectly improves the complete job execution. This article proposed early straggler detection by a recurrent neural network (ESDRNN) that collects the task and node information every three seconds from ApplicationMaster to train the RNN. It classifies the straggler tasks pretty early by RNN, between thirty to forty seconds of task execution, and transfers a list of classified tasks to an agent running on ResourceManager. RNN is a type of artificial neural network that is prevalent for processing sequential time-series data. Then, the agent predicts the TTE of these classified tasks by the Autoregressive integrated moving average (ARIMA) model. Finally, it sorts and refreshes the list with higher TTE after every ten seconds and speculates the tasks for the early completion of the MapReduce job. This proposed technique s performance is evaluated on the HiBench benchmark suite of Hadoop s most popular benchmark. Finally, compared with the default speculation technique and different techniques, the proposed speculation technique detects the stragglers early within 35 to 40 seconds of task execution. As a result, it decreases the job execution time by an average of 21% to 38% significantly for different workloads in a heterogeneous Hadoop cluster.

读者评论 与其他读者分享你的观点

用户名:未登录
我的评分