Available on Google Scholar and my CV.
2025
I4. [Preprint] Experiences with Model Context Protocol Servers for Science and High Performance Computing [preprint] [ParslFest slides] [repository]
J4. [FHPCP] Toward a Persistent Event-Streaming System for High-Performance Computing Applications [paper]
I3. [Preprint] Throughput Estimation of Data Transport Networks from Digital Twin Measurements [preprint]
J3. [ApJS] RADAR—Radio Afterglow Detection and AI‑Driven Response: A Federated Framework for Gravitational Wave Event Follow‑Up [paper]
C22. [ICS'25] D-Rex: Heterogeneity-Aware Reliability Framework and Adaptive Algorithms for Distributed Storage [paper]
C21. [CCGrid'25] DynoStore: A wide-area distribution system for the management of data over heterogeneous storage
C20. [CCGrid'25] WRATH: Workload Resilience Across Task Hierarchies in Task-based Parallel Programming Frameworks [paper]
C19. [IPDPS'25] Optimizing Fine-Grained Parallelism Through Dynamic Load Balancing on Multi-Socket Many-Core Systems [paper]
I2. [Preprint] MOFA: Discovering Materials for Carbon Capture with a GenAI-and Simulation-Based Workflow [preprint]
2024
[FTXS'24] Octopus: Experiences with a Hybrid Event-Driven Architecture for Distributed Scientific Computing. paper | project descriptions [1, 2] | diaspora SDK | diaspora service repo | docs and demos | SDK walkthrough | evaluation methodology | tech. report | slides | presentation
[NRDPISI-1] Diaspora: Resilience‑Enabling Services for Real‑Time Distributed Workflows. paper | project descriptions [1, 2] | diaspora SDK | diaspora service repo | docs and demos
[eScience'24] An Empirical Investigation of Container Building Strategies and Warm Times to Reduce Cold Starts in Scientific Computing Serverless Functions. paper | Globus compute dataset | Binder dataset
[eScience'24] TaPS: A Performance Evaluation Suite for Task-based Execution Frameworks. paper | repo | docs
[FGCS Vol. 153] The Globus Compute Dataset: An Open Function-as-a-Service Dataset From the Edge to the Cloud. paper | dataset
2022
[OSDI'22] Cancellation in Systems: An Empirical Study of Task Cancellation Patterns and Failures. paper | poster | codebase | video
[ICC'22] Reliable Broadcast in Critical Applications: Asset Transfer and Smart Home. paper
2021
[SOSP'21] Rabia: Simplifying State-Machine Replication Through Randomization. paper | poster | video | codebase | tech. report
[ICDCN'21] Practical Experience Report: Cassandra+: Trading-Off Consistency, Latency, and Fault-tolerance in Cassandra. paper | tech. report | codebases
2020
[Computer Networks Vol.182] Reliable broadcast with trusted nodes: Energy reduction, resilience, and speed. paper | codebase
[GLOBECOM'20] BBB: A Lightweight Approach to Evaluate Private Blockchains in Clouds. paper | video | codebase
[NCA'20] CassandrEAS: Highly Available and Storage-Efficient Distributed Key-Value Store with Erasure Coding. paper | codebases
[Preprint] Reliable Broadcast in Practical Networks: Algorithm and Evaluation. preprint
[PerVehicle'20] Make Multi-hop Broadcast in VANET Fast by Selecting a Better Route for Source Vehicle. paper | slides | codebase
[DUCSAN'20] Tutorial: Google Cloud for Beginners: Architecture, Storage, and Computation. paper | video | slides | instruction
[DUCSAN'20] Tutorial: Deep Dive into Apache Cassandra: Theory, Design, and Application. paper | slides
[DUCSAN'20] LiteDoc: Make Collaborative Editing Fast, Scalable, and Robust. paper | codebases
2019
[GLOBECOM'19] Reliable Broadcast in Networks with Trusted Nodes. paper | codebase
[PRDC'19] BBB: Make Benchmarking Blockchains Configurable and Extensible. paper | codebase
[NCA'19] Distributed Causal Memory in the Presence of Byzantine Servers. paper | audio | slides
[Sarnoff'19] A First Step Towards Production-Ready Network Function Storage: Benchmarking with NFSB. paper | codebase