Presentation
SIGN IN TO VIEW THIS PRESENTATION Sign In
Communication Libraries in HPC and AI
DescriptionThe world's largest supercomputers for scientific discovery are also premier systems for artificial intelligence model training and inference. While traditional HPC compute has predominantly leveraged the MPI standard, AI workloads have increasingly focused on collective communication libraries, such as NVIDIA's NCCL and AMD's RCCL, which are optimized for high-bandwidth throughput. This BoF session at SC25 aims to delve into the intricacies of collective communication libraries, focusing on the comparison between the widely adopted Message Passing Interface (MPI) and NCCL/RCCL, as well as other key messaging libraries such as SHMEM.
Event Type
Birds of a Feather
TimeWednesday, 19 November 202512:15pm - 1:15pm CST
Location240-241-242
Livestreamed
Recorded
Archive
view
Links





