

Maximizing Power-Constrained Supercomputing Throughput
Wednesday, June 11, 2025 9:00 AM to 9:25 AM · 25 min. (Europe/Berlin)
Hall F - 2nd floor
Research Paper
Sustainability and Energy Efficiency
Information
Maximizing supercomputing throughput within power and cooling limits is a key challenge for exascale systems, which are increasingly constrained by power rather than performance. Effective power and energy management is essential. Whereas power capping has been well-known to increase energy efficiency and reduce energy costs, power variability has emerged as an orthogonal driving force on cost through service pricing models and increase component wear out. We present a performance-power-efficiency model that ties application performance, empirical power usage, power variability, and energy efficiency in a single methodology to enable optimize HPC system operation. We demonstrate the ability of our methodology to understand performance and energy efficiency through power capping on NVIDIA GPUs using seven workloads and three microbenchmarks. We show that uniform throughput power capping can potentially improve power-constrained system throughput by 1.8x and 2.5x based on capped node power and peak node power, outperforming uniform power capping by 20%.
Contributors:
Contributors:
Format
On DemandOn Site
Documents & Links
Read the Full Paper Open Access at IEEE Xplore!



