Maximizing Power-Constrained Supercomputing Throughput

Maximizing Power-Constrained Supercomputing Throughput

Wednesday, June 11, 2025 9:00 AM to 9:25 AM · 25 min. (Europe/Berlin)
Hall F - 2nd floor
Research Paper
Sustainability and Energy Efficiency

Information

Maximizing supercomputing throughput within power and cooling limits is a key challenge for exascale systems, which are increasingly constrained by power rather than performance. Effective power and energy management is essential. Whereas power capping has been well-known to increase energy efficiency and reduce energy costs, power variability has emerged as an orthogonal driving force on cost through service pricing models and increase component wear out. We present a performance-power-efficiency model that ties application performance, empirical power usage, power variability, and energy efficiency in a single methodology to enable optimize HPC system operation. We demonstrate the ability of our methodology to understand performance and energy efficiency through power capping on NVIDIA GPUs using seven workloads and three microbenchmarks. We show that uniform throughput power capping can potentially improve power-constrained system throughput by 1.8x and 2.5x based on capped node power and peak node power, outperforming uniform power capping by 20%.
Contributors:
Format
On DemandOn Site

Log in

See all the content and easy-to-use features by logging in or registering!