Close

Session

This content is available for: Technical Program Reg Pass, Workshop Reg Pass. Upgrade Registration
Workshop
:
PDSW'25: The 10th International Parallel Data Systems Workshop
DescriptionEfficient storage, movement, and management of data are crucial to application performance and scientific productivity in both traditional simulation-oriented HPC environments and cloud AI/ML/big data analysis environments. This issue is further exacerbated by the growing volume of experimental and observational data, the widening gap in performance between computational hardware and storage hardware, and the emergence of new data-driven algorithms in machine learning. The goal of this workshop is to facilitate in-depth discussions of research and development that address the most critical challenges in large-scale data storage and data processing. PDSW will continue to build on the successful tradition established by its predecessor workshops: the Petascale Data Storage Workshop (PDSW, 2006-2015) and the Data Intensive Scalable Computing Systems workshop (DISCS, 2012-2015). These workshops were successfully combined in 2016, and the resulting joint workshop has attracted up to 45 full paper submissions and 195 attendees per year from 2016 to 2024.
Event Type
Workshop
TimeMonday, 17 November 20259:00am - 5:30pm CST
Location230
Tags
Data Analytics
High Performance I/O, Storage, Archive, & File Systems
Storage
Recordings
Livestreamed
Recorded
Registration Categories
TP
W
Presentations
9:00am - 9:05am CSTPDSW'25: The 10th International Parallel Data Systems Workshop
9:05am - 10:00am CSTFeatured Talk: Supporting Science Through Data Management Software
Presenter
10:00am - 10:30am CSTMorning Break - PDSW'25: The 10th International Parallel Data Systems Workshop
10:30am - 11:00am CSTLLMTailor: A Layer-wise Tailoring Tool for Efficient Checkpointing of Large Language Models
11:00am - 11:30am CSTSlimIO: Lightweight I/O Path Design for Write Isolation in FDP-backed In-Memory Databases
11:30am - 12:00pm CSTParallel Data Object Creation: Scalable Metadata Management in Parallel I/O Library
12:00pm - 12:30pm CSTSmartIO: A Lightweight End-to-End Workflow for Runtime I/O Optimization of HPC Systems
12:30pm - 2:00pm CSTLunch - PDSW'25: The 10th International Parallel Data Systems Workshop
2:00pm - 2:30pm CSTRL4Sys: A Lightweight System-driven RL Framework for Drop-in Integration in System Optimization
2:30pm - 3:00pm CSTQuantifying AWS S3 I/O Performance Boundaries Using the Roofline Model
3:00pm - 3:30pm CSTAfternoon Break - PDSW'25: The 10th International Parallel Data Systems Workshop
3:30pm - 4:00pm CSTSecure In-Storage Execution of VTK Workloads on Modern Parallel NFS Data Servers
4:00pm - 4:05pm CSTTowards AI-Driven Interfaces for Scientific Data Management
4:05pm - 4:10pm CSTLLM training in practice: insights from 85,000 checkpoints
Author/Presenter
4:10pm - 4:15pm CSTPAPI Support for Specialized AI Architectures
Author/Presenters
4:15pm - 4:20pm CSTAccessing Serialized Data Fromats with GPU-Initiated I/O
4:20pm - 4:25pm CSTHPC Consult Ticket Analysis with SambaNova
Author/Presenter
4:25pm - 4:30pm CSTEvaluating Usage and Performance of DAOS for a Classic HPC Application
Author/Presenter
4:30pm - 4:35pm CSTDAOS on 400 Gbps Fabrics
Author/Presenter
4:35pm - 4:40pm CSTAccelerating Exascale Scientific Discovery via In-Situ and In-Transit Data Analytics in HPC
4:40pm - 5:30pm CSTPanel: Storage Architectures and I/O Optimizations for AI Applications