期刊论文详细信息
NEUROCOMPUTING 卷:418
A comparison of forecasting models for the resource usage of MapReduce applications
Article
Li, Yang Yuan1  Tien Van Do2,3  Nguyen, Hai T.4 
[1] Xian SiYuan Univ, Xian, Peoples R China
[2] Baoji Univ Arts & Sci, Baoji, Peoples R China
[3] Budapest Univ Technol & Econ, Dept Networked Syst & Serv, Magyar Tudosok Korutja 2, H-1117 Budapest, Hungary
[4] Budapest Univ Technol & Econ, Balatonfured Student Res Grp, Budapest, Hungary
关键词: MapReduce application;    Resource usage parameters;    LSTM model;    Multiple linear regression model;   
DOI  :  10.1016/j.neucom.2020.07.059
来源: Elsevier
PDF
【 摘 要 】

In this paper, we construct forecasting models (multivariate long short-term memory recurrent neural networks and multiple linear regression) for the resource usage prediction of four MapReduce applica-tions and applications executed within the Spark framework. We have evaluated the impact of a sample size to prediction accuracy. Also, we propose a phase modelling approach for read/write-intensive applications. Our results show that models based on long short-term memory recurrent neural networks exhibit a higher accuracy than multiple linear regression models and the intensive characteristics of a resource are closely related to the prediction accuracy of forecasting models. We investigated the hyper parameter tuning of such models and showed that a randomly initialised, shallow, well-tuned network may outperform deeper models that use stacked autoencoder initialisation. Furthermore, multivariate long short-term memory recurrent neural network models are more sensitive to sample size than multiple linear regression models. We show that an LSTM model trained in a specific machine may be used to predict the resource usage in another machine. (C) 2020 Elsevier B.V. All rights reserved.

【 授权许可】

Free   

【 预 览 】
附件列表
Files Size Format View
10_1016_j_neucom_2020_07_059.pdf 4779KB PDF download
  文献评价指标  
  下载次数:0次 浏览次数:0次