Publications
- 2021
[VLDB] Fauce: Fast and Accurate Deep Ensembles with Uncertainty for Cardinality Estimation.
[VLDB] Understanding the Idiosyncrasies of Real Persistent Memory
Shashank Gugnani, Arjun Kashyap, and Xiaoyi Lu
In Proceedings of the VLDB Endowment, the 47th International Conference on Very Large Data Bases
[ATC] ZeRO-Offload: Democratizing Billion-Scale Model Training.
Jie Ren, Samyam Rajbhandari, Reza Yazdani Aminabadi, Olatunji Ruwase, Shuangyan Yang, Minjia Zhang, Dong Li and Yuxiong He
In 27th USENIX Annual Technical Conference (acceptance rate: 18.8%) (Media report 1) (Media report 2)
[ICS] MD-HM: Memoization-based Molecular Dynamics Simulations on Big Memory System.
Zhen Xie, Wenqian Dong, Jie Liu, Ivy Peng, Yanbao Ma and Dong Li
In 35th International Conference on Supercomputing (acceptance rate: 24%)
[ICS] Enabling Energy-Efficient DNN Training on Hybrid GPU-FPGA Accelerators.
Xin He, Jiawen Liu, Zhen Xie, Hao Chen, Guoyang Chen, Weifeng Zhang and Dong Li
In 35th International Conference on Supercomputing (acceptance rate: 24%)
[ICS] Athena: High-Performance Sparse Tensor Contraction Sequence on Heterogeneous Memory.
Jiawen Liu, Dong Li, Roberto Gioiosa and Jiajia Li
In 35th International Conference on Supercomputing (acceptance rate: 24%)
[IPDPS] NVMe-CR: A Scalable Ephemeral Storage Runtime for Checkpoint/Restart with NVMe-over-Fabrics
Shashank Gugnani, Tianxi Li, and Xiaoyi Lu
In Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium
[EuroSys] Tahoe: Tree Structure-Aware High Performance Inference Engine for Decision Tree Ensemble on GPU.
Zhen Xie, Wenqian Dong, Jiawen Liu, Hang Liu and Dong Li
In European Conference on Computer Systems
[FAST] ArchTM: Architecture-Aware, High Performance Transaction for Persistent Memory
Kai Wu, Jie Ren, Ivy Peng and Dong Li
In 19th USENIX Conference on File and Storage Technologies
[ASPLOS] Fast, Flexible and Comprehensive Bug Detection for Persistent Memory Programs
Bang Di, Jiawen Liu, Hao Chen and Dong Li
Architectural Support for Programming Languages and Operating Systems
[PPoPP] Sparta: High-Performance, Element-Wise Sparse Tensor Contraction on Heterogeneous Memory
Jiawen Liu, Jie Ren, Roberto Gioiosa, Dong Li and Jiajia Li
In 26th Principles and Practice of Parallel Programming
Jie Ren, Jiaolin Luo, Kai Wu, Minjia Zhang, Hyeran Jeon and Dong Li
In 27th IEEE International Symposium on HighPerformance Computer Architecture
[TPDS] Efficient Buffer Overflow Detection on GPU.
Bang Di, Jianhua Sun, Hao Chen, and Dong Li
IEEE Transaction on Parallel and Distributed Systems
[TPDS] TRUST: Triangle Counting Reloaded on GPUs
Santosh Pandey, Zhibin Wang, Sheng Zhong, Chen Tian, Bolong Zheng, Xiaoye Li, Lingda Li, Adolfy Hoisie, Caiwen Ding, Dong Li, and Hang Liu
IEEE Transaction on Parallel and Distributed Systems
- 2020
[SC] INEC: Fast and Coherent In-Network Erasure Coding
Haiyang Shi and Xiaoyi Lu
In Proceedings of the 33rd International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2020. (Acceptance Rate: 22.3%)
[SC] RDMP-KV: Designing Remote Direct Memory Persistence-based Key-Value Stores with PMEM
Tianxi Li*, Dipti Shankar*, Shashank Gugnani, and Xiaoyi Lu
In Proceedings of the 33rd International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2020. (Acceptance Rate: 22.3%, *Co-First Authors)
[NeurIPS] HM-ANN: Efficient Billion-Point Nearest Neighbor Search on Heterogeneous Memory.
Jie Ren, Minjia Zhang and Dong Li
In 34th Conference on Neural Information Processing Systems (acceptance rate: 20%).
[Cluster] Exploring Non-Volatility of Non-Volatile Memory for High Performance Computing Under Failures.
Jie Ren, Kai Wu and Dong Li
In IEEE International Conference on Cluster Computing (acceptance rate: %). (Link to the tech report) (Link to the NVC tool)
[PACT] Ribbon: High Performance Cache Line Flushing for Persistent Memory.
Kai Wu, Ivy B. Peng, Jie Ren and Dong Li
In 29th International Conference on Parallel Architectures and Compilation Techniques (acceptance rate: 25%).
[SC] Smart-PGSim: Using Neural Network to Accelerate AC-OPF Power Grid Simulation.
[SC] Enabling Faster NGS Analysis on Optane-based Heterogeneous Memory.
Jiaolin Luo, Luanzheng Guo, Jie Ren, Kai Wu and Dong Li
Poster In 32nd ACM/IEEE International Conference for High Performance Computing, Performance Measurement, Modeling and Tools
[IPDPS] Demystifying the Performance of HPC Scientific Applications on NVM-based Memory Systems.
Ivy Peng, Kai Wu, Jie Ren, Dong Li and Maya Gokhale
In 34th IEEE International Parallel and Distributed Processing Symposium
[IISWC] MATCH: An MPI Fault Tolerance Benchmark Suite
Luanzheng Guo, Giorgis Georgakoudis, Konstantinos Parasyris, Ignacio Laguna and Dong Li
In IEEE International Symposium on Workload Characterization (acceptance rate: %) (Media report)
[USENIX OpML] RIANN: Real-time Incremental Learning with Approximate Nearest Neighbor on Mobile Devices
Jiawen Liu, Zhen Xie, Dimitrios Nikolopoulos and Dong Li
In USENIX Conference on Operational Machine Learning
[MLSys-W] Flame: A Self-Adaptive Auto-Labeling System for Heterogeneous Mobile Processors
Jie Liu, Jiawen Liu, Zhen Xie and Dong Li
In On-Device Intelligence Workshop at Machine Learning and Systems Conference