Session
ExHetAI: Extreme Heterogeneity and AI Convergence in HPC
Session Chairs
DescriptionThe increasing convergence of AI and HPC, combined with the rapid evolution of heterogeneous computing architectures, is transforming modern supercomputing. The emergence of specialized accelerators, including GPUs, TPUs, IPUs, neuromorphic chips, quantum processors, and FPGAs, has introduced new challenges in performance portability, system optimization, and software adaptability. In this exascale and extreme heterogeneity era, effectively exploiting diverse hardware architectures requires AI-driven approaches, novel programming models, and intelligent workload management. This workshop will bring together experts from academia, industry, and national laboratories to explore AI-HPC convergence, heterogeneous system architectures, energy-efficient computing, and AI-assisted performance optimization. By fostering interdisciplinary discussions and collaborations, the workshop aims to advance scalable, efficient, and sustainable computing. We invite contributions on topics including heterogeneous hardware, AI-driven HPC techniques, memory architectures, and programming models, with a focus on shaping the future of AI-driven scientific discovery and high performance computing.
Event Type
Workshop
TimeSunday, 16 November 20259:00am - 12:30pm CST
Location264
Partially Livestreamed
Partially Recorded
TP
W
Presentations
| 9:00am - 9:01am CST | ExHetAI: Extreme Heterogeneity and AI Convergence in HPC | |
| 9:01am - 9:10am CST | Opening Remarks | |
| 9:10am - 10:00am CST | Invited Talk 1: Memorization vs Reasoning in MoEs and Estimating Memory Consumption in Distributed Training Presenter | |
| 10:00am - 10:30am CST | Morning Break - ExHetAI | |
| 10:30am - 11:00am CST | Invited Talk 2: Automating Energy-Aware HPC Code Development for Performance and Portability in Heterogeneous Computing Presenter | |
| 11:00am - 11:20am CST | Enabling Unstructured Sparse Fine-Tuning and Inference for Foundation Models on Wafer-Scale Engine | |
| 11:20am - 11:40am CST | WAGES: Workload-Aware GPU Sharing System for Energy-Efficient Serverless LLM Serving | |
| 11:40am - 12:00pm CST | OmniFed: A Modular Framework for Configurable Federated Learning from Edge to HPC | |
| 12:00pm - 12:20pm CST | Enhancing ChatPORT with CUDA-to-SYCL Kernel Translation Capability |
