We attended IPDPSC'25 in Milan, Italy to present three papers:
[IPDPS'25] Chris Egersdoerfer*, Arnav Sareen, Jean Luca Bez, Suren Byna, Dongkuan (DK) Xu, Dong Dai, "IOAgent: Democratizing Trustworthy HPC I/O Performance Diagnosis Capability via LLMs." IPDPS, 2025
[IPDPS'25] Saisha Kamat*, Mai Zheng, Bo Fang, Dong Dai, "Be Aware of Metadata Corruption in Parallel File System: It Can Be Silent and Catastrophic" IPDPS, 2025
[IPDPS'25] Md. Hasanur Rashid*, Dong Dai "AdapTBF: Decentralized Bandwidth Control via Adaptive Token Borrowing for HPC Storage" IPDPS, 2025
Our group attended SC'24 in Atlanta for presenting two papers
[PDSW@SC'24] Chris Egersdoerfer*, Md. Hasanur Rashid*, Dong Dai, Bo Fang, Tallent Nathan, "Understanding and Predicting Cross-Application I/O Interference in HPC Storage Systems." PDSW@SC, 2024.
[ACM SRC@SC'24] Abdullah Al Raqibul Islam*, Helen Xu, Dong Dai, Aydin Buluc. “Improving SpGEMM Performance Through Reordering and Cluster-wise Computation”. Raqibul won the 3rd place of ACM Student Research Competition - Graduates!
We attended HotStorage'24 in SanFrancisco for presenting the ION paper
[HotStorage'24] Chris Egersdoerfer, Arnav Sareen, Jean Luca Bez, Suren Byna, Dong Dai. “ION: Navigating HPC I/O Optimization Journey using Large Language Models.” In proceedings of the 16th ACM Workshop on Hot Topics in Storage and File Systems (HotStorage’24), 2024.
We attended IPDPS'24 in SanFrancisco for presenting two papers
[JSSPP@IPDPS'24] Monish Soundar Raj, Thomas MacDougall, Di Zhang, Dong Dai. “An Empirical Study of Machine Learning-based Synthetic Job Trace Generation Methods.” Accepted to appear in the 27th Workshop on Job Scheduling Strategies for Parallel Processing (JSSPP@IPDPS’24).
[IPDPS'24] Di Zhang, Monish Soundar Raj, Bing Xie, Sheng Di, Dong Dai. “Cross-System Analysis of Job Characterization and Scheduling in Large-Scale Computing Clusters.” Accepted to appear in the 38th IEEE International Parallel & Distributed Processing Symposium (IPDPS’24), 2024. (Conference CORE Ranking A).