Close

Presentation

Physical System Study on Balancing Interactive and Batch Job Performance through Oversubscribing Scheduling
DescriptionThis paper evaluates oversubscribing in High-Performance Computing (HPC) systems as a solution to balance interactive and batch job performance. Using real workload traces and physical hardware experiments, we demonstrate that oversubscribing can reduce queue waiting times while maintaining overall system performance. Our results show this approach (1) decreases waiting times for interactive jobs, (2) has minimal impact on overall system throughput, and (3) effectively manages individual job turnaround times. Unlike traditional multiple queue approaches, oversubscribing provides these benefits with simpler configuration requirements. Additionally, through quantitative memory usage analysis, we provide insights into oversubscribing applicability for production capacity planning. Our research contributes empirical evidence of its effectiveness in real HPC environments, supported by comprehensive experimental data and practical implementation insights.