Hongyu Zhang

Primary Email Address: hongyujohn@gmail.com

Office URL Google Scholar DBLP

I am currently an Associate Professor at The University of Newcastle, Australia. Before that, I was a Lead Researcher at Microsoft Research, Beijing, China (2014-2016), an Associate Professor at Tsinghua University, China (2006-2014). I received my PhD degree in Computer Science from School of Computing, National University of Singapore in 2003.

My research is in the area of software engineering, in particular, data-driven software engineering, software analytics, software testing and debugging, software maintenance, and software reuse. The main theme of my research is to improve software quality and productivity by mining and analyzing software data.

I am always open for collaborations!

Research Area:

My research area is software engineering, in particular:

  • software analytics, data-driven software engineering
  • intelligent software and service engineering
  • software testing, debugging, fault diagnosis
  • compiler testing and validation
  • software maintenance and reuse, software product line

The main theme of my research is to improve software quality and productivity by utilizing knowledge mined from software data. Over the years, a software organization could accumulate a large amount of data including source code, bug reports, execution logs, changes, metrics, documents, and so on. Data mining, machine learning, and information retrieval techniques can be applied to extract knowledge from the software data and solve software engineering problems. Together with my students and collaborators, I have published more than 110 research papers in reputable international journals and conferences.

Data-Driven Software Testing, Debugging, and Fault Diagnosis based on production logs, crash reports, and bug reports.

Data-Driven Software Development based on source code and APIs.

I also work on: Bug Analysis and Prediction (statistical analysis of bugs and prediction of bug-prone modules), Debugging and Testing (detecting, locating, and fixing bugs, compiler testing), Software Metrics (quantitative measurement of software product and process), and Software Reuse (reusing previously-written software).

Research Grants:

  • NSF China, Project “Software Crash Analysis”, Grant No. 61272089 (PI)
  • NSF China, Project “Software Defect Prediction Models and Applications”, Grant No. 61073006, 2011 – 2013. (PI)
  • NSF China, Project "Software Customization Techniques", Grant No. 60703060, 2008-2011. (PI)
  • NSF China, Project "Software Defect and Failure Prediction Techniques", Grant No. 90718022, 2008-2011. (PI)
  • National High-tech 863 Project No. 2007AA01Z122, 2008-2010. (Co-PI)
  • National High-tech 863 Project No. 2007AA01Z480, 2008-2010. (Co-PI)
  • The 3rd Tsinghua University Zi-Zhu Research Program, "Software Quality Measurement and Prediction", Project ID: 2010THZ0, 2011-2013. (PI)
  • The 6th Key Researcher Support Program, Tsinghua University, 2007-2009. (PI)

Tool Development:


Referred conference and journal papers:

Scholarly book chapters:

  • Qingwei Lin, Jian-Guang Lou, Hongyu Zhang, Dongmei Zhang, “How to Tame Your Online Services”, Chapter 12 of the book Perspectives on Data Science for Software Engineering (editors: T. Menzies, L. Williams, T. Zimmermann), Morgan-Kaufmann, 2016, ISBN 978-0128042069.
  • Zhitao Hou, Hongyu Zhang, Haidong Zhang, Dongmei Zhang, “Visual Analytics for Software Engineering Data”, Chapter 15 of the book Perspectives on Data Science for Software Engineering (editors: T. Menzies, L. Williams, T. Zimmermann ), Morgan Kaufmann, 2016, ISBN 978-0128042069.
  • Hai Wang, Yuan Fang Li, Jing Sun, Hongyu Zhang, Jeff Z. Pan, “Towards a Consistent Feature Model using OWL”, a chapter of the book Semantic Web Enabled Software Engineering (editors. J. Pan, Y. Zhao), IOS Press, 2014.ISBN 978-1-61499-369-8.
  • Weishan Zhang, Stan Jarzabek, Hongyu Zhang, Neil Loughran, Awais Rashid, “Software evolution with XVCL”, Chapter VI of the book Software Evolution with UML and XML (editor H. Yang), Idea Group Publishing, 2005. ISBN 9781591404620.

Research Program Committee:

Program Organizations:

  • General co-chair, The 36th International Conference on Software Maintenance and Evolution (ICSME 2020)
  • Program co-chair, The 18th IEEE International Conference on Software Quality, Reliability, and Security (QRS 2018)
  • Program co-chair, The 25th Asia-Pacific Software Engineering Conference (APSEC 2018)
  • Tool Demonstration co-chair: The International Symposium of Software Testing and Analysis (ISSTA 2019)
  • Short Paper chair: 2018 Australian Software Engineering Conference (ASWEC 2018)
  • Co-organizer: Dagstuhl Seminar 17502 on "Testing and Verification of Compilers", Dec 2017, Germany.
  • Program co-chair, Early Research Achievements (ERA) track, ICSME’16
  • Program co-chair, The 12th International Conference on Predictive Models and Data Analytics in Software Engineering (PROMISE’16)
  • The International Conference on Predictive Models in Software Engineering (PROMISE), 2014-2017. (Steering Committee Member)
  • The Second International Workshop on Software Mining (SoftMine-2013, co-located with ASE'13), Silicon Valley, CA, November 2013. (co-organizers)
  • The 8th International Workshop on Advanced Modularization Techniques (AOAsia/Pacific 2013), a workshop at AOSD 2013, March 2013.
  • The First International Workshop on Software Mining (SoftMine-2012, co-located with KDD'12), Beijing, China, May 2012. (co-organizers)
  • The 12th International Conference on Quality Software (QSIC 2012), August 2012, Xi'an, China. (industry track co-chairs)
  • The 26th European Conference on Object-Oriented Programming (ECOOP 2012), June 2012, Beijing, China. (local organisation co-chairs)
  • ICSE 2014 Workshop on Emerging Trends in Software Metrics (WETSoM @ ICSE 2014), India, June 2014. (co-organizers)
  • ICSE 2012 Workshop on Emerging Trends in Software Metrics (WETSoM @ ICSE 2012), Zurich, Switzerland, June 2012. (co-organizers)
  • ICSE 2011 Workshop on Emerging Trends in Software Metrics (WETSoM @ ICSE 2011), May, 2011, Honolulu, Hawaii, USA. (co-organizers)
  • ICSE 2010 Workshop on Emerging Trends in Software Metrics (WETSoM @ ICSE 2010), May 4, 2010, Cape Town, South Africa. (co-organizers)
  • The 1st International Symposium on Emerging Trends in Software Metrics (ETSM 2009), 26 May, 2009, Pula, Sardinia, Italy. (co-organizers)
  • 15th Asia-Pacific Software Engineering Conference (APSEC 2008), Beijing, China, Dec 2008 (publicity chair).

Journal Services:

I am on the Editorial Board of:

and on the Reviewer Board of the Journal of Empirical Software Engineering.

I am also a frequent reviewer for the following international journals: IEEE Transactions on Software Engineering, ACM Transactions on Software Engineering and Methodology, IEEE Software, IEEE Transactions on Knowledge and Data Engineering, Automated Software Engineering, Science of Computer Programming, Software Quality Journal, Software Practice & Experience, Journal of Software Maintenance and Evolution.

I also review proposals for Natural Science Foundation of China (NSFC) and Natural Sciences and Engineering Research Council of Canada (NSERC).

Recent Invited Talks/Seminars:

  • Keynote: Intelligent Fault Diagnosis and Prediction through Data Analytics, The 6th International Workshop on Quantitative Approaches to Software Quality (QuASoQ 2018).
  • Invited: AI-Enabled Software and Service Engineering, The 2018 Computing in the 21st Century Conference & Asia Faculty Summit, Microsoft Research Asia, Nov 2018.
  • Invited: Log-based Fault Diagnosis for Large-Scale Software Systems, Asian-Pacific Workshop of Advanced Software Engineering, Gold Coast, Australia, Nov 2018.
  • Invited: Towards Intelligent Software Development, The First Yanqi Meeting on Automatic Software Engineering, Beijing, China, Oct 2018.
  • Invited: Towards Intelligent Code Reuse, 2017 China Software Engineering Research and Industry Summit, Sep 2017, Shanghai, China.
  • Keynote: Software Analytics: Data-Driven Software Engineering, The Fourth International Workshop on Software Mining, Nov 2015, 2016, Lincoln, Nebraska, USA (co-located with ASE 2015)
  • Invited: Code Search: Research and Practice, The 3rd Chinese forum of Software Engineering Research and Practice (SERP 2016), July 20, 2016, Beijing, China
  • Invited: Towards a Theory of Software Engineering, The 5th International Workshop on Theory-Oriented Software Engineering, May 15, 2016, Austin, Texas, USA (co-located with ICSE 2016)
  • Invited: Effective Bug Management via Software Analytics, 4th International Symposium on High Confidence Software (ISHCS 2015), Jan 2015, Beijing, China.
  • Invited: Monte Verita Symposium on Developer Support, Switzerland, March 2012.
  • Invited: MSR (Mining Software Repository) Vision 2020, Canada, August 2012.
  • Invited: Symposium on Advanced Software Engineering Techniques, Shanghai Jiaotong University, 2012.
  • Invited: Symposium on Software Quality and Analysis, Nanjing University, 2012.
  • Seminar: at University of Texas at Dallas, Feb 2017.
  • Seminar: at University of Science and Technology Beijing, April 2016.
  • Seminar: at Chinese Academy of Science, April 2016.
  • Seminar: at Tsinghua University, May 2014.

Visiting Positions:

I was a visiting professor/scholar at the following organizations:

  • University of Cagliari, Italy (1/2011 – 3/2011)
  • Microsoft Research Asia (7/2012 – 8/2012)
  • Swinburne University of Technology, Australia (8/2012 – 9/2012)
  • The Hong Kong University of Science and Technology (10/2012 – 3/2013)
  • University of Toronto/University of Waterloo (5/2001 - 9/2001)


I taught the following courses to postgraduate and undergraduate students:

  • Software Verification and Validation
  • Software Measurement
  • Software Quality Engineering (This course was evaluated top 15% among all postgraduate courses offered in Tsinghua University in 2011)
  • Software Reuse
  • Java programming course
  • Object-Oriented Programming


I am grateful that I have the privilege to advise the following brilliant students/interns:

Liya Chakma, Rongxin Wu (now at HKUST), Jian Zhou (now at Baidu), Liang Gong (now at UC Berkeley), Jianxun Yang, Shuijin Lu, Jue Wang (now at Postal Bank), Shuai Chen (now at Facebook), Wei Li (now at Google), Jiangtao Gong (now at Tsinghua), Ke Ma, Bei Shi (now at CUHK), Lu Zhang (now at Virginia Tech), Zeqi Shen, Yu Cao, Bo Zhang...

Fei Lv (now at Alibaba), Galina Meyer (now at Stanford), Qing Ren (now at UCLA), Pinjia He, Sheng Tian, Wenhao Song, Senlan Yao (now at Google), Bonan Dong (now at Cornell), Xutong Chen, Wangsheng Hu, Hong Wu (now at Morgan Stanley), Jinbo Pan, Xiaodong Gu (Nanjing University), Wenxiang Hu (now at Microsoft), Chengxun Shu (now at 4Paradigm), Xingzhao Yue (now at Huawei), Chen Xia (now at UCLA)...

Note: If I missed any of you accidently, please do email me (and forgive me). Please also let me know your latest status.

My Erdös number is 4: Hongyu Zhang - Stanislaw Jarzabek - Tomasz Krawczyk - William T. Trotter, Jr. - Paul Erdös

I am a senior member of IEEE.

(Last updated: May 2019)

Psalm 67:1-3: May God be gracious to us and bless us, and make his face shine on us, so that your ways may be known on earth, your salvation among all nations.