We are active in the academic community and regularly publish our work for advancing the state-of-the-art and for the benefit of human knowledge. We are highly selective in our choice of venues for publications. Below is a list of our recent publications in top-tier computer science conferences.
[SC'20] Qidong Zhao, Xu Liu, and Milind Chabbi. DRCCTPROF: A Fine-grained Call Path Profiler for ARM-based Clusters}. To appear In proceedings of the international conference for High Performance Computing, Networking, Storage and Analysis, 2020. (Performance Track Best Paper, all tracks Best Paper nominee).
[ICS'20] Jialiang Tan, Shuyin Jiao, Milind Chabbi, Xu Liu. What Every Scientific Programmer Should Know About Compiler Optimizations?. In proceedings of the International Conference on Supercomputing, 2020.
[SC'19] Pengfei Su, Milind Chabbi, Xu Liu. Pinpointing Performance Inefficiencies via Lightweight Variance Profiling}. In proceedings of the international conference for High Performance Computing, Networking, Storage and Analysis, 2019.
[SC'19] Muhammad Aditya Sasongko, Milind Chabbi, Pirah Noor Soomro, and Didem Unat. ComDetective: A Fast and Accurate Communication Detection Tool for Threads. In proceedings of the international conference for High Performance Computing, Networking, Storage and Analysis, 2019. (Performance Track Best Paper, all tracks Best Paper nominee).
[HPCB'19] Matthias Becker, Milind Chabbi, Stefanie Warnat-Herresthal, Umesh Worlikar, Shobhit Agrawal, Jaydeep Bhat, Jonas Schulte-Schrepping, Kevin Baßler, Patrick Günther, Hartmut Schultze, Thomas Ulas, Sharad Singhal, and Joachim L. Schultze. Memory-driven Computing Accelerates Genomic Data Processing. In proceedings of the 6th International Workshop on High Performance Computing on Bioinformatics, 2019.
[FSE'19] Pengfei Su, Qingsen Wang, Milind Chabbi, Xu Liu. Pinpointing Performance Inefficiencies in Java. In proceedings of the ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2019.
[IPDPS'19] Mostofa Patwary, Milind Chabbi, Heewoo Jun, Jiaji Huang, Gregory Diamos, and Kenneth Church. Language Modeling at Scale. In the proceedings of the International Parallel and Distributed Processing Symposium, 2019.
[ICSE'19] Pengfei Su, Shasha Wen, Hailong Yang, Milind Chabbi, and Xu Liu. Load Redundancy: A Software Inefficiency Indicator. In the proceedings of the 41st ACM/IEEE International Conference on Software Engineering, 2019. Montreal, Canada. (Distinguished paper award).
[HPCA'19] Qingsen Wang, Xu Liu, and Milind Chabbi. Featherlight Reuse-distance Measurement. In proceedings of the 25th IEEE International Symposium on High-Performance Computer Architecture, 2019.
[PPoPP'19] Qingsen Wang, Pengfei Su, Milind Chabbi, and Xu Liu. Lightweight Hardware Transactional Memory Profiling}. The proceedings of the ACM SIGPLAN symposium on Principles and Practice of Parallel Programming, 2019. Washington D.C. (Best paper award).
[TOPC] Abdelhalim Amer, Huiwei Lu, Pavan Balaji, Milind Chabbi}, Yanjiw Wei, and Satoshi Matsuoka. Lock Contention Management in Multithreaded MPI. In ACM Transactions on Parallel Computing (TOPC).
[ICS'18] Shasha Wen, Lucy Cherkasova, Felix Xiaozhu Lin, and Xu Liu, "ProfDP: A Lightweight Profiler to Guide Data Placement in Heterogeneous Memory Systems", The 32nd ACM International Conference on Supercomputing, 2018.
[PPoPP'18] Milind Chabbi, Shasha Wen, and Xu Liu. Featherlight On-the-Fly False-sharing Detection. The proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2018. (Best Paper Award).
[PMAM'18] Du Shen, Xu Liu, and Milind Chabbi. An Evaluation of Vectorization and Cache Reuse Tradeoffs on Modern CPUs. The proceedings of the 9th International Workshop on Programming Models and Applications for Multicores and Manycores, 2018.
[ASPLOS'18] Shasha Wen, John Byrne, Xu Liu, and Milind Chabbi. Watching for Software Inefficiencies with Witch. The proceedings of the 23rd ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2018.
[CGO'18] Probir Roy, Shuaiwen Leon Song, Sriram Krishnamoorthy and Xu Liu, "Lightweight Detection of Cache Conflicts", The 2018 International Symposium on Code Generation and Optimization, Feb 24 - 28th, 2018, Vienna, Austria.
[CGO'18] Du Shen, Shuaiwen Leon Song, Ang Li and Xu Liu, "CUDAAdvisor: LLVM-based Runtime Profiling for Modern GPUs", The 2018 International Symposium on Code Generation and Optimization, Feb 24 - 28th, 2018, Vienna, Austria.
[CGO'18] Biwei Xie, Jianfeng Zhan, Xu Liu, Wanling Gao, Zhen Jia, Lixin Zhang, "CVR: Efficient SpMV Vectorization on X86 Processors", The 2018 International Symposium on Code Generation and Optimization, Feb 24 - 28th, 2018, Vienna, Austria.
[PMBS'17] Adarsh Yoga and Milind Chabbi. Path-synchronous Performance Monitoring in HPC Interconnection Networks with Source-code Attribution. In the proceedings of the 8th IEEE workshop in Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems held as part of ACM/IEEE Supercomputing 2017 (SC '17).
[IPDPS'17] Hao Xu, Shasha Wen, Alfredo Gimenez, Todd Gamblin and Xu Liu, "Dr-BW: Identifying Bandwidth Contention in NUMA Architectures with Supervised Learning", The 31st IEEE International Parallel and Distributed Processing Symposium, May 29 - Jun 2, 2017, Orlando, Florida, USA.
[ASPLOS'17] Shasha Wen, Milind Chabbi, and Xu Liu. RedSpy: Exploring Value Locality in Software. In the proceedings of the 22nd ACM International Conference on Architectural Support for Programming Languages and Operating Systems. [17% acceptance rate] (ASPLOS Highlight Paper).
[PPoPP'17] Milind Chabbi, Halim Amer, Shasha Wen, and Xu Liu. An Efficient Abortable-Locking Protocol for Multi-level NUMA Systems. In the proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2017. [22% acceptance rate].
[HPDC'16] Probir Roy, Xu Liu and Shuaiwen Leon Song. SMT-Aware Instantaneous Footprint Optimization. In Proceedings of the 25th ACM international Symposium on High-Performance and Distributed Computing, 2016. [15% acceptance rate].
[ASPLOS'16] Felix Xiaozhu Lin and Xu Liu. memif: Towards Programming Heterogeneous Memory Asynchronously. In Proceedings of the 21st International Conference on Architectural Support for Programming Languages and Operating Systems. [22% acceptance rate].
[PPoPP'16] Milind Chabbi and John Mellor-Crummey. Contention-Conscious, Locality-Preserving Locks. In Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2016. [18% acceptance rate].
[PPoPP'16] Tianzheng Wang, Milind Chabbi, and Hideaki Kimura. Be My Guest -- MCS Lock Now Welcomes Guests. In Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2016. [18% acceptance rate].
[CGO'16] Probir Roy and Xu Liu. StructSlim: A Lightweight Profiler to Guide Structure Splitting. In Proceedings of the 2016 International Symposium on Code Generation and Optimization. [23% acceptance rate].
[CGO'16] Tongping Liu and Xu Liu. Cheetah: Detecting False Sharing Efficiently and Effectively. In Proceedings of the 2016 International Symposium on Code Generation and Optimization. [23% acceptance rate].
[SC'15] Xu Liu and Bo Wu. ScaAnalyzer: A Tool to Identify Memory Scalability Bottlenecks in Parallel Programs. In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, 2015. [22% Acceptance rate] (Best Paper Award).
[PACT'15] Shasha Wen, Xu Liu, and Milind Chabbi. Run Time Value Numbering: A Profiling Technique to Pinpoint Redundant Computations. In Proceedings of the 24th International Conference on Parallel Architectures and Compilation Techniques, 2015. San Francisco, CA. [21% acceptance rate].
[TPDS] Ashwin Aji, Lokendra Panwar, Wu-chun Feng, Pavan Balaji, James Dinan, Rajeev Thakur, Feng Ji, Xiaosong Ma, Milind Chabbi, Karthik Murthy, John Mellor-Crummey, and Keith Bisset. MPI-ACC: Accelerator-Aware MPI for Scientific Applications. In Transactions on Parallel and Distributed Systems.
[PPoPP'15] Milind Chabbi, Michael Fagan, and John Mellor-Crummey. High Performance Locks for Multi-level NUMA Systems. In Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2015. San Francisco, CA. [17% acceptance rate].
[PPoPP '15] Milind Chabbi, Wim Lavrijsen, Wibe de Jong, Koushik Sen, John Mellor-Crummey, Costin Iancu. Barrier Elision for Production Parallel Programs. In Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2015. San Francisco, CA. [17% acceptance rate].
[PACT'14] Xu Liu, Kamal Sharma and John Mellor-Crummey. ArrayTool: A Lightweight Profiler to Guide Array Regrouping. In Proceedings of the 23rd International Conference on Parallel Architectures and Compilation Techniques, 2014.
[PPOPP'14] Xu Liu and John Mellor-Crummey. A Tool to Analyze the Performance of Multithreaded Programs on NUMA Architectures. In Proceedings of the 19th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2014.
[CGO'14] Milind Chabbi, Xu Liu, and John Mellor-Crummey. Call Paths for Pin Tools. In Proceedings the International Symposium on Code Generation and Optimization, 2014. Orlando, FL. [28% acceptance rate]
[SC'13] Xu Liu and John Mellor-Crummey. A Data-centric Profiler for Parallel Programs. In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, 2013.
[SC'13] Milind Chabbi, Karthik Murthy, Michael Fagan, and John Mellor-Crummey. Effective Sampling-Driven Performance Tools for GPU-Accelerated Supercomputers. In Proceedings the International Conference for High Performance Computing, Networking, Storage and Analysis (SC’13). Denver, CO. [20% acceptance rate].
[HPDC'13] Ashwin Aji, Lokendra Panwar, Wu-chun Feng, Pavan Balaji, James Dinan, Rajeev Thakur, Feng Ji, Xiaosong Ma, Milind Chabbi, Karthik Murthy, John Mellor-Crummey, and Keith Bisset. On the Efficacy of GPU-Integrated MPI for Scientific Applications. In Proceedings of the Symposium on High-Performance Parallel and Distributed Computing. New York City, NY. [15% acceptance rate].
[IPDPS'13] Sanjay Chatterjee, Sagnak Tasirlar, Zoran Budimlic, Vincent Cave, Milind Chabbi, Max Grossman, Yonghong Yan, and Vivek Sarkar. Integrating Asynchronous Task Parallelism with MPI. In Proceedings of the International Parallel and Distributed Processing Symposium. Boston, MA. [21% acceptance rate].
[ICS'13] Xu Liu, John Mellor-Crummey and Mike Fagan. A New Approach for Performance Analysis of OpenMP Programs. In Proceedings of 27th International Conference on Supercomputing.
[CGO'12] Milind Chabbi and John Mellor-Crummey. DeadSpy: A Tool to Pinpoint Program Inefficiencies. In Proceedings of the International Symposium on Code Generation and Optimization, 2012. San Jose, California. [28% acceptance rate] (Best Presentation Award)
[EXADAPT'11] Milind Chabbi, John Mellor-Crummey, and Keith Cooper. 2011. Efficiently Exploring Compiler Optimization Sequences with Pairwise Pruning. In Proceedings of the 1st International Workshop on Adaptive Self-Tuning Computing Systems for the Exaflop Era. San Jose, California.
[CGO'11] Xu Liu and John Mellor-Crummey. Pinpointing Data Locality Problems Using Data-Centric Analysis. In Proceedings of 2011 International Symposium on Code Generation and Optimization.