Machine Learning Predictions of Runtime and IO Traffic on High-End Clusters
We use supervised machine learning algorithms (i.e., Decision Trees, Random Forest, and K-nearest Neighbors) to predict performance characteristics such as runtime and IO traffic of batch jobs on high-end clusters, using only user job scripts as input. We show that decision trees outperform other al...
Saved in:
Published in | 2016 IEEE International Conference on Cluster Computing (CLUSTER) pp. 255 - 258 |
---|---|
Main Authors | , , , , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
01.09.2016
|
Subjects | |
Online Access | Get full text |
ISSN | 2168-9253 |
DOI | 10.1109/CLUSTER.2016.58 |
Cover
Summary: | We use supervised machine learning algorithms (i.e., Decision Trees, Random Forest, and K-nearest Neighbors) to predict performance characteristics such as runtime and IO traffic of batch jobs on high-end clusters, using only user job scripts as input. We show that decision trees outperform other algorithms and accurately predict the runtime of 73% of jobs within a error tolerance of 10 minutes, which is a 51% improvement over the user requested runtime. |
---|---|
ISSN: | 2168-9253 |
DOI: | 10.1109/CLUSTER.2016.58 |