Qingda Lu

(Ph.D. The Ohio State University. 2008)
Email: MY_FIRST_NAME AT gmail DOT com


I work on cloud storage systems in Alibaba Group. Before I worked at Intel from 2009 to 2015, mainly on operating systems-related projects. I am currently living in the Seattle area.

From 2002 to 2008 I was a PhD student in Department of Computer Science and Engineering at The Ohio State University, mainly working on compiler / run-time optimization techniques. My research publications can be found below.

Publications

Qingpeng Niu, James Dinan, Qingda Lu, P. Sadayappan. "PARDA: A Fast Parallel Reuse Distance Analysis Algorithm", Proc. 26th Intl. Parallel and Distributed Processing Symp. (IPDPS), 2012

Q. Lu, X. Gao, S. Krishnamoorthy, G. Baumgartner, J. Ramanujam, P. Sadayappan "Empirical Performance-Model Driven Data Layout Optimization and Library Call Selection for Tensor Contraction Expressions", Journal of Parallel and Distributed Computing, Vol. 72, No. 3, March 2012, pp. 338-352

Jiang Lin, Qingda Lu, Xiaoning Ding, Zhao Zhang, Xiaodong Zhang and P. Sadayappan "Enabling Software Management for Multicore Cache with Lightweight Hardware Support", 18th International Symposium on Super Computing (SC-2009), 2009

Qingda Lu, Jiang Lin, Xiaoning Ding, Zhao Zhang, Xiaodong Zhang and P. Sadayappan. "Soft-OLP: Improving Hardware Cache Performance Through Software-Controlled Object-Level Cache Partitioning", 18th International Symposium on Parallel Architectures and Compilation Techniques (PACT-18),2009

Qingda Lu, Christophe Alias, Uday Bondhugula, Thomas Henretty, Sriram Krishnamoorthy, J. Ramanujam, Atanas Rountev, P. Sadayappan, Yongjian Chen, Haibo Lin, and Tin-fook Ngai. "Data Layout Transformation for Enhancing Locality on NUCA Chip Multiprocessors", 18th International Symposium on Parallel Architectures and Compilation Techniques (PACT-18),2009

A. Hartono, Q. Lu, T. Henretty, S. Krishnamoorthy, H. Zhang, G. Baumgartner, D.E. Bernholdt, M. Nooijen, R.M. Pitzer, J. Ramanujam, P. Sadayappan. "Performance Optimization of Tensor Contraction Expressions for Many-Body Methods in Quantum Chemistry" Journal of Physical Chemistry A, Vol. 113, No. 45, 2009

Rubao Lee, Xiaoning Ding, Feng Chen, Qingda Lu, and Xiaodong Zhang, "MCC-DB: Minimizing Cache Conflicts in Multi-Core Processors for Databases", 35th International Conference on Very large Databases (VLDB 2009), 2009

Jiang Lin, Qingda Lu, Xiaoning Ding, Zhao Zhang, Xiaodong Zhang, and P. Sadayappan, "Gaining Insights into Multi-Core Cache Partitioning: Bridging the Gap between Simulation and Real Systems", 14th International Symposium on High-Performance Computer Architecture (HPCA) , 2008

Qingda Lu, Sriram Krishnamoorthy, P. Sadayappan, "Combining Analytical and Empirical Approaches in Tuning Matrix Transposition", the 15th international conference on Parallel architectures and compilation techniques (PACT),2006

Albert Hartono, Qingda Lu, Xiaoyang Gao, Sriram Krishnamoorthy, Marcel Nooijen, Gerald Baumgartner, David E. Bernholdt, Venkatesh Choppella, Russell M. Pitzer, J. Ramanujam, Atanas Rountev, P. Sadayappan. "Identifying Cost-Effective Common Subexpressions to Reduce Operation Count in Tensor Contraction Evaluations. ", International Conference on Computational Science (ICCS), 2006

Xiaoyang Gao, Swarup Kumar Sahoo, Qingda Lu, Gerald Baumgartner, Chi-Chung Lam, J. Ramanujam, P. Sadayappan, "Performance Modeling and Optimization of Parallel Out-of-Core Tensor Contractions", ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP), 2005.

G. Baumgartner, A. Auer, D.E. Bernholdt, A. Bibireata, V. Choppella, D. Cociorva, X. Gao, R.J. Harrison, S. Hirata, S. Krishnamoorthy, S. Krishnan, C. Lam, Q. Lu, M. Nooijen, R.M. Pitzer, J. Ramanujam, P. Sadayappan and A. Sibiryakov. "Synthesis of High-Performance Parallel Programs for a Class of Ab Initio Quantum Chemistry Models" Invited paper for special issue of Proceedings of the IEEE, January 2005.

Qingda Lu, Xiaoyang Gao, Sriram Krishnamoorthy, Gerald Baumgartner, J. Ramanujam, P. Sadayappan, "Empirical Performance-Model Driven Data Layout Optimization," Proceedings of the 17th International Workshop on Languages and Compilers for Parallel Computing (LCPC), September, 2004

Qingda Lu, Jiesheng Wu, Dhabaleswar K. Panda and P. Sadayappan. "Applying MPI Derived Datatypes to the NAS Benchmarks: A Case Study", 3rd Workshop on Compile and Runtime Techniques for Parallel Computing (CRTPC-3), held in conjunction with The 2004 International Conference on Parallel Processing (ICPP-2004) , August, 2004.