ESSA 2024: 5th Workshop on Extreme-Scale Storage and Analysis

To be held on May 27, 2024 in conjunction with IEEE IPDPS 2024, San Francisco, CA, USA 

Agenda

Workshop Day: Monday, May 27, 2024
Workshop Location: Hyatt Regency San Francisco, Embarcadero Center, San Francisco, California USA
Workshop Room: Regency B (Street Level)

13:00 - 13:10 Welcome Message

13:10 - 14:00 Keynote: HPC and Databases Revisited
­— Jay Lofstead (Sandia National Laboratories)

14:00 - 14:30 Paper Talk: The impact of asynchronous I/O in checkpoint-restart workloads
Hariharan Devarajan, A. Moody, D. Dai, C. Stanavige, E. Gonsiorowski, M. McFadden, O. Faaland, G. Kosinovsky, K. Mohror

14:30 - 15:00 Paper Talk: Benchmarking variables for checkpointing in HPC Applications
    — Xiang Fu, Xin Huang, Wubiao Xu, Shiman Meng, Weiping Zhang, Luanzheng Guo, Kento Sato

15:00 - 15:30 Coffee Break

15:30 - 16:00 Paper Talk: Extending the Mochi Methodology to Enable Dynamic HPC Data Services
    — Matthieu Dorier, Philip Carns, Robert Ross, Shane Snyder, Rob Latham, Amal Gueroudji, George Amvrosiadis, Chuck Cranor, Jerome Soumagne

16:00 - 16:30 Paper Talk: Adaptive Per-File Lossless Compression of Floating-Point Data
Andrew Rodriguez, Noushin Azami, Martin Burtscher

16:30 - 17:00 Paper Talk: Optimizing Forward Wavefield Storage Leveraging High-Speed Storage Media
    — João Speglich, Navjot Kukreja, George Bisbas, Átila Saraiva, Jan Hückelheim, Fabio Luporini, John Washbourne

17:00 - 17:30 Paper Talk: The Art of Sparsity: Mastering High-Dimensional Tensor Storage
Bin Dong, Kesheng Wu, Suren Byna

17:30 - 17:45 Discussion and Closing Remarks

Keynote

Jay Lofstead (Sandia National Laboratories)

Jay Lofstead is a Principal Member of Technical Staff at Sandia National Laboratories. His research interests focus around large scale data management and trusting scientific computing. In particular, he works on storage, IO, metadata, workflows, reproducibility, software engineering, machine learning, and operating system-level support for any of these topics. Broadly across these topics, he is also deeply interested in ethics related to these topics and computing in general and how to drive inclusivity across the computation-related science domains. Dr. Lofstead received his Ph.D. in Computer Science from the Georgia Institute of Technology in 2010.

Keynote Abstract

Around twenty-five years ago, the HPC storage and IO community investigated the potential for relational databases for HPC data management and found numerous issues making an RDBMS a poor choice. SciDB made decisions as well that further cemented the difficulty in terms of ingestion velocity and overhead against an RDBMS mode for HPC data management. More recent work in the metadata arena, such as EMPRESS, and in a new IO library, Stitch-IO, show a potential path towards bringing these communities together. However, the challenges present offer potentially new research and a need to overcome well entrenched bias. This talk will explore the roots of this bias, why we should rethink, and propose a path towards data management bliss.

Workshop Overview

Advances in storage are becoming increasingly critical because workloads on high performance computing (HPC) and cloud systems are producing and consuming more data than ever before, and the situation promises to only increase in future years. Additionally, the last decades have seen relatively few changes in the structure of parallel file systems, and limited interaction between the evolution of parallel file systems, e.g., Lustre, GPFS, and I/O support systems that take advantage of hierarchical storage layers, e.g., node local burst buffers. However, recently the community has seen a large uptick in innovations in storage systems and I/O support software for several reasons:

Our goals in the ESSA Workshop are to bring together expert researchers and developers in data-related areas including storage, I/O, processing and analysis on extreme scale infrastructures including HPC systems, clouds, edge systems or hybrid combinations of those, to discuss advances and possible solutions to the new challenges we face.

Topics and Scope

ESSA 2024 Workshop Organization

Workshop Chairs

Program Chairs

Web Chair

Publicity Chair

Important Dates

Please note: All deadlines are Anywhere on Earth