Close

Presentation

Evaluating the Power-Monitoring Capabilities of Aurora
DescriptionExascale systems like Aurora push performance bounds but they draw tens of megawatts, making precise, low-overhead power monitoring essential for efficiency and cost control. We present an ongoing evaluation of the two primary power-monitoring interfaces on Aurora, quantifying accuracy and temporal granularity from a single node to a system level. Our contribution is a reproducible methodology, combining HPC benchmarks, mini-apps, and spectral analysis, to determine when each tool is trustworthy and how to configure sampling. Preliminary results characterize sampling limits and overhead trade-offs. Complete results are in progress and we seek to deduce if our current methods of power monitoring are suitable for exascale levels. In the poster, we will share the evaluation framework, early comparative results, and actionable best practices for exascale power studies.