Comparison and analysis of software and hardware energy measurement methods for a CPU+GPU system and selected parallel applications

In this paper authors extend upon their previous research on powercapped optimization of performance-energy metrics of deep neural networks training workloads. A professional power meter Yokogawa WT-310E is used, as well as Intel RAPL and Nvidia NVML interfaces, to examine power consumption of a muc...

Full description

Saved in:
Bibliographic Details
Published inComputer Science and Information Systems Vol. 22; no. 2; pp. 563 - 590
Main Authors Koszczał, Grzegorz, Matuszek, Mariusz, Czarnul, Paweł
Format Journal Article
LanguageEnglish
Published 01.04.2025
Online AccessGet full text
ISSN1820-0214
2406-1018
DOI10.2298/CSIS240722023K

Cover

Loading…
More Information
Summary:In this paper authors extend upon their previous research on powercapped optimization of performance-energy metrics of deep neural networks training workloads. A professional power meter Yokogawa WT-310E is used, as well as Intel RAPL and Nvidia NVML interfaces, to examine power consumption of a much more comprehensive set of multi-GPU and multi-CPU workloads, including: selected kernels from NAS Parallel Benchmarks for CPUs and GPUs as well as Horovod-Python Xception deep neural network training using several GPUs. A comparison and discussion of results obtained by both power measurement methods has been performed using 2 systems, one with 2 Intel Xeon CPUs and 8 Nvidia Quadro RTX 6000 GPUs and the second 2 Intel Xeon CPUs and 4 Nvidia Quadro RTX 5000 GPUs. We compared power consumption between hardware and software interfaces for CPU, GPU and mixed CPU+GPU workload configurations, using 1-40 threads for the CPUs and 1-8 GPUs.
ISSN:1820-0214
2406-1018
DOI:10.2298/CSIS240722023K