A novel approach to workload prediction using attention-based LSTM encoder-decoder network in cloud environment

Server workload in the form of cloud-end clusters is a key factor in server maintenance and task scheduling. How to balance and optimize hardware resources and computation resources should thus receive more attention. However, we have observed that the disordered execution of running application and...

Full description

Saved in:
Bibliographic Details
Published inEURASIP journal on wireless communications and networking Vol. 2019; no. 1; pp. 1 - 18
Main Authors Zhu, Yonghua, Zhang, Weilin, Chen, Yihai, Gao, Honghao
Format Journal Article
LanguageEnglish
Published Cham Springer International Publishing 17.12.2019
Springer Nature B.V
SpringerOpen
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Server workload in the form of cloud-end clusters is a key factor in server maintenance and task scheduling. How to balance and optimize hardware resources and computation resources should thus receive more attention. However, we have observed that the disordered execution of running application and batching seriously cuts down the efficiency of the server. To improve the workload prediction accuracy, this paper proposes an approach using the long short-term memory (LSTM) encoder-decoder network with attention mechanism. First, the approach extracts the sequential and contextual features of the historical workload data through the encoder network. Second, the model integrates the attention mechanism into the decoder network, through which the prediction for batch workloads can be carried out. Third, experiments carried out on Alibaba and Dinda workload traces dataset demonstrate that our method achieves state-of-the-art performance in mixed workload prediction in cloud computing environment. Furthermore, we also propose a scroll prediction method, which splits a long prediction sequence into several small sequences to monitor and control prediction accuracy. This work helps to dynamically guide the configuration for workload balancing.
ISSN:1687-1499
1687-1472
1687-1499
DOI:10.1186/s13638-019-1605-z