Hongyu Zhang
 
Office URL    Google Scholar    DBLP    
 
I am currently an Associate Professor at The University of Newcastle, Australia. Before that, I was a Lead Researcher at Microsoft Research, Beijing, China (2014-2016), an Associate Professor at Tsinghua University, China (2006-2014), a Lecturer at RMIT University, Australia (2003-2006). I received my PhD degree in Computer Science from School of Computing, National University of Singapore in 2003.
 
My research is in the area of software engineering, in particular, data-driven software engineering, software analytics, software testing and debugging, software maintenance, and software reuse. The main theme of my research is to improve software quality and productivity by mining and analyzing software data.  
I am always open for collaborations!
 
Research Area:

My research area is software engineering, in particular:

  • software analytics, mining software repository, data-driven software engineering
  • intelligent software and service engineering
  • software testing, debugging, fault diagnosis
  • compiler testing and validation
  • software maintenance and reuse
The main theme of my research is to improve software quality and productivity by utilizing knowledge mined from software data. Over the years, a software organization could accumulate a large amount of data including source code, bug reports, execution logs, changes, metrics, documents, and so on. Data mining, machine learning, and information retrieval techniques can be applied to extract knowledge from the software data and solve software engineering problems. Together with my students and collaborators, I have published more than 110 research papers in reputable international journals and conferences.
 
Data-Driven Software Testing, Debugging, and Fault Diagnosis based on production logs crash reports, and bug reports.
Data-Driven Software Development based on source code and APIs.

I also work on: Bug Analysis and Prediction (statistical analysis of bugs and prediction of bug-prone modules), Debugging and Testing  (detecting, locating, and fixing bugs, compiler testing), Software Metrics (quantitative measurement of software product and process), and Software Reuse (reusing previously-written software).
 
Research Grants:
  • NSF China, Project “Software Crash Analysis”, Grant No. 61272089          (PI)
  • NSF China, Project “Software Defect Prediction Models and Applications”, Grant No. 61073006, 2011 – 2013. (PI)
  • NSF China, Project "Software Customization Techniques", Grant No. 60703060, 2008-2011. (PI)
  • NSF China, Project "Software Defect and Failure Prediction Techniques", Grant No. 90718022, 2008-2011. (PI)
  • National High-tech 863 Project No. 2007AA01Z122, 2008-2010. (Co-PI)
  • National High-tech 863 Project No. 2007AA01Z480, 2008-2010. (Co-PI)
  • The 3rd Tsinghua University Zi-Zhu Research Program, "Software Quality Measurement and Prediction", Project ID: 2010THZ0, 2011-2013. (PI)
  • The 6th Key Researcher Support Program, Tsinghua University, 2007-2009. (PI)
Tool Development:
 
Publications:
 
Referred conference and journal papers:
  • Shilin He, Qingwei Lin, Jian-Guang Lou, Hongyu Zhang, Michael R. Lyu and Dongmei Zhang,Identifying Impactful Service System Problems via Log Analysis, Proc. The ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE 2018), November 2018, Lake Buena Vista, Florida, USA.
  • Qingwei Lin, Ken Hsieh, Yingnong Dang, Hongyu Zhang, Kaixin Sui, Yong Xu, Jian-Guang Lou, Chenggang Li, Youjiang Wu, Randolph Yao, Murali Chintalapati and Dongmei Zhang,  Predicting Node Failure in Cloud Service Systems, Proc. The ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE 2018), November 2018, Lake Buena Vista, Florida, USA.
  • Qingwei Lin, Weichen Ke, Jian-Guang Lou, Hongyu Zhang, Kaixin Sui, Yong Xu, Ziyi Zhou, Bo Qiao and Dongmei Zhang, BigIN4: Instant, Interactive Insight Identification for Multi-Dimensional Big Data, Proc. The 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2018), Applied Data Science track, London, UK, August 2018.
  • Yong Xu, Kaixin Sui, Randolph Yao, Hongyu Zhang, Qingwei Lin, Yingnong Dang, Peng Li, Keceng Jiang, Wenchi Zhang, Jian-Guang Lou, Murali Chintalapati, Dongmei Zhang, Improving Service Availability of Cloud Systems by Predicting Disk Error, Proc. 2018 USENIX Annual Technical Conference (USENIX ATC'18), July 2018, Boston, MA, USA.
  • Jiajun Jiang, Yingfei Xiong, Hongyu Zhang, Qing Gao, and Xiangqun Chen, Shaping Program Repair Space with Existing Patches and Similar Code, Proc. ACM SIGSOFT International Symposium on Software Testing and Analysis (ISSTA 2018),July 2018, Amsterdam, Netherlands.
  • Mohamad Barbar, Yulei Sui, Hongyu Zhang, Shiping Chen and Jingling Xue, Live Path Control Flow Integrity, In the 40th International Conference on Software Engineering (ICSE 2018), Gothenburg, Sweden, May 2018. (Poster)
  • Xiaodong Gu, Hongyu Zhang, Sunghun Kim, Deep Code Search, In Proc. the 40th International Conference on Software Engineering (ICSE 2018), Gothenburg, Sweden, May 2018. 
  • Rongxin Wu, Ming Wen, Shing-Chi Cheung, Hongyu Zhang, ChangeLocator: Locate Crash-Inducing Changes Based on Crash Reports, Empirical Software Engineering, Springer, Nov 2017, pp. 1-35. 
  • Zhiqiang Li, Xiao-Yuan Jing, Xiaoke Zhu, Hongyu Zhang, Baowen Xu, Shi Ying, On the Multiple Sources and Privacy Preservation Issues for Heterogeneous Defect Prediction, IEEE Transactions on Software Engineering, Dec 2017.
  • Xiaodong Gu, Hongyu Zhang, Dongmei Zhang, Sunghun Kim, DeepAM: Migrate APIs with Multi-modal Sequence to Sequence Learning, In Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI 2017), Melbourne, Australia, August 19-25, 2017
  • Chengxun Shu and Hongyu Zhang, Neural Programming by Example, Proc. the 31st AAAI Conference on Artificial Intelligence (AAAI 2017), February 2017, San Francisco, California, USA.
  • Junjie Chen, Yanwei Bai, Dan Hao, Yingfei Xiong, Hongyu Zhang, and Bing Xie. Learning to Prioritize Test Programs for Compiler Testing. Proc. the 39th International Conference on Software Engineering (ICSE 2017), May 2017, Buenos Aires, Argentina.
  • Zhiqiang Li, Xiao-Yuan Jing, Xiaoke Zhu and Hongyu ZhangHeterogeneous Defect Prediction through Multiple Kernel Learning and Ensemble Learning, Proc. The 33th International Conference on Software Maintenance and Evolution (ICSME 2017), Shanghai, China, Sep 2017.
  • Hongyu Zhang, Anuj Jain, Gaurav Khandelwal, Chandrashekhar Kaushik, Scott Ge,  and Wenxiang Hu, Bing Developer Assistant – Improving Developer Productivity by Recommending Sample Code, Proc. the 24th ACM SIGSOFT International Symposium on the Foundations of Software Engineering (FSE 2016), industry track, Seattle, WA, USA, November 2016.
  • Xiaodong Gu, Hongyu Zhang, Dongmei Zhang, Sunghun Kim. Deep API Learning, in Proceedings of the 24th ACM SIGSOFT International Symposium on the Foundations of Software Engineering (FSE 2016). Seattle, WA, USA, November 13–18, 2016.
  • M. Zhou, X. Cheng, X. Guo, M. Gu, H. Zhang and X. Song, "Improving Failure Detection by Automatically Generating Test Cases Near the Boundaries," Proc. IEEE 40th Annual Computer Software and Applications Conference (COMPSAC), Atlanta, GA, 2016, pp. 164-173.
  • Qingwei Lin, Hongyu Zhang, Jian-Guang Lou, Yu Zhang, and Xuewei Chen, Log Clustering based Problem Identification for Online Service Systems, Proc. the 38th International Conference on Software Engineering (ICSE 2016), SEIP Track, May 14-22, 2016, Austin, TX.
  • Qingwei Lin, Jian-Guang Lou, Hongyu Zhang, and Dongmei Zhang, iDice: Problem Identification for Emerging Issues, Proc. the 38th International Conference on Software Engineering (ICSE 2016), May 14-22, 2016, Austin, TX.
  • Junjie Chen, Wenxiang Hu, Dan Hao, Yingfei Xiong, Hongyu Zhang, Lu Zhang, and Bing Xie, An Empirical Comparison of Compiler Testing Techniques, Proc. the 38th International Conference on Software Engineering (ICSE 2016), May 14-22, 2016, Austin, TX.
  • Junjie Chen, Yanwei Bai, Dan Hao, Yingfei Xiong, Hongyu Zhang, Lu Zhang, and Bing Xie, Test Case Prioritization for Compilers: A Text-Vector Based Approach, Proc. the International Conference on Software Testing, Verification and Validation (ICST 2016), April 10-15, 2016, Chicago, IL, USA.
  • Rongxin Wu, Xiao Xiao, S.C. Cheung, Hongyu Zhang, and Charles Zhang, Casper: An Efficient Approach to Call Trace Collection, in Proc. POPL’16, St. Petersburg, Florida, USA, Jan 2016.    

  • Fei Lv, Hongyu Zhang, Jian-guang Lou, Shaowei Wang, Dongmei Zhang, and Jianjun Zhao, "CodeHow: Effective Code Search based on API Understanding and Extended Boolean Model", in Proc. ASE 2015, Lincoln, Nebraska, Nov 2015.
  • Jieming Zhu, Pinjia He, Qiang Fu, Hongyu Zhang, Michael R. Lyu, Dongmei Zhang, Learning to Log: Helping Developers Make Informed Logging Decisions, Proc. ICSE 2015, Florence, Italy, May 2015.
  • Hucheng Zhou, Jian-Guang Lou, Hongyu Zhang, Haibo Lin, Haoxiang Lin and Tingting Qin, An Empirical Study on Quality Issues of Production Big Data Platform, Proc. ICSE 2015, SEIP Track, Florence, Italy, May 2015.
  • Rui Ding, Hucheng Zhou, Jian-Guang Lou, Hongyu Zhang, Qingwei Lin, Qiang Fu, Dongmei Zhang, Tao Xie, Log2: A Cost-Aware Logging Mechanism for Performance Diagnosis, The 2015 USENIX Annual Technical Conference (USENIX ATC'15), Santa Clara, CA, July 2015.
Scholarly book chapters:
  • Qingwei Lin, Jian-Guang Lou, Hongyu Zhang, Dongmei Zhang, “How to Tame Your Online Services”, Chapter 12 of the book Perspectives on Data Science for Software Engineering (editors: T. Menzies, L. Williams, T. Zimmermann), Morgan-Kaufmann, 2016, ISBN 978-0128042069.
  • Zhitao Hou, Hongyu Zhang, Haidong Zhang, Dongmei Zhang, “Visual Analytics for Software Engineering Data”, Chapter 15 of the book Perspectives on Data Science for Software Engineering (editors: T. Menzies, L. Williams, T. Zimmermann ), Morgan Kaufmann, 2016, ISBN 978-0128042069.
  • Hai Wang, Yuan Fang Li, Jing Sun, Hongyu Zhang, Jeff Z. Pan, “Towards a Consistent Feature Model using OWL”, a chapter of the book Semantic Web Enabled Software Engineering (editors. J. Pan, Y. Zhao), IOS Press, 2014.ISBN 978-1-61499-369-8.
  • Weishan Zhang, Stan Jarzabek, Hongyu Zhang, Neil Loughran, Awais Rashid, “Software evolution with XVCL”, Chapter VI of the book Software Evolution with UML and XML (editor H. Yang), Idea Group Publishing, 2005. ISBN 9781591404620.
 
Research Program Committee:

Program Organizations:

  • General co-chair, The 36th International Conference on Software Maintenance and Evolution (ICSME 2020)
  • Program co-chair, The 18th IEEE International Conference on Software Quality, Reliability, and Security (QRS 2018)
  • Program co-chair, The 25th Asia-Pacific Software Engineering Conference (APSEC 2018)
  • Tool Demonstration co-chair: The International Symposium of Software Testing and Analysis (ISSTA 2019) 
  • Short Paper chair: 2018 Australian Software Engineering Conference (ASWEC 2018)
  • Co-organizer: Dagstuhl Seminar 17502 on "Testing and Verification of Compilers", Dec 2017, Germany.
  • Program co-chair, Early Research Achievements (ERA) track, ICSME’16
  • Program co-chair, The 12th International Conference on Predictive Models and Data Analytics in Software Engineering (PROMISE’16)
  • The International Conference on Predictive Models in Software Engineering (PROMISE), 2014-Now.     (Steering Committee Member)
  • The Second International Workshop on Software Mining (SoftMine-2013, co-located with ASE'13),  Silicon Valley, CA, November 2013.  (co-organizers)
  • The 8th International Workshop on Advanced Modularization Techniques (AOAsia/Pacific 2013), a workshop at AOSD 2013, March 2013.
  • The First International Workshop on Software Mining (SoftMine-2012, co-located with KDD'12),  Beijing, China, May 2012.  (co-organizers)
  • The 12th International Conference on Quality Software (QSIC 2012), August 2012, Xi'an, China. (industry track co-chairs)
  • The 26th European Conference on Object-Oriented Programming (ECOOP 2012), June 2012, Beijing, China. (local organisation co-chairs)
  • ICSE 2014 Workshop on Emerging Trends in Software Metrics (WETSoM @ ICSE 2014), India, June 2014. (co-organizers)
  • ICSE 2012 Workshop on Emerging Trends in Software Metrics (WETSoM @ ICSE 2012), Zurich, Switzerland, June 2012. (co-organizers)
  • ICSE 2011 Workshop on Emerging Trends in Software Metrics (WETSoM @ ICSE 2011), May, 2011, Honolulu, Hawaii, USA. (co-organizers)
  • ICSE 2010 Workshop on Emerging Trends in Software Metrics (WETSoM @ ICSE 2010), May 4, 2010, Cape Town, South Africa. (co-organizers)
  • The 1st International Symposium on Emerging Trends in Software Metrics (ETSM 2009), 26 May, 2009, Pula, Sardinia, Italy. (co-organizers)
  • 15th Asia-Pacific Software Engineering Conference (APSEC 2008), Beijing, China, Dec 2008 (publicity chair). 

Journal Services:
I am on the Editorial Board of Journal of Systems and Software, and the Reviewer Board of the Journal of Empirical Software Engineering.
I am also a frequent reviewer for the following international journals: IEEE Transactions on Software Engineering, ACM Transactions on Software Engineering and Methodology, IEEE Software, IEEE Transactions on Knowledge and Data Engineering, Automated Software Engineering, Science of Computer Programming, Software Quality Journal, Software Practice & Experience,  Journal of Software Maintenance and Evolution. 
 
I also review proposals for Natural Science Foundation of China (NSFC) and Natural Sciences and Engineering Research Council of Canada (NSERC).
 
Recent Invited Talks/Seminars:
  • Keynote: Improving Software Quality through Data Analytics, The 7th International Workshop on Emerging Trends in Software Metrics, May 15, 2016, Austin, Texas, USA (co-located with ICSE 2016)
  • Keynote: Software Analytics: Data-Driven Software EngineeringThe Fourth International Workshop on Software Mining, Nov 2015, 2016, Lincoln, Nebraska, USA (co-located with ASE 2015)
  • Invited:  Code Search: Research and Practice, The 3rd Chinese forum of Software Engineering Research and Practice (SERP 2016), July 20, 2016, Beijing, China
  • Invited: Towards a Theory of Software Engineering, The 5th International Workshop on Theory-Oriented Software Engineering, May 15, 2016, Austin, Texas, USA (co-located with ICSE 2016)
  • Invited: Effective Bug Management via Software Analytics, 4th International Symposium on High Confidence Software (ISHCS 2015), Jan 2015, Beijing, China.
  • Invited: Monte Verita Symposium on Developer Support, Switzerland, March 2012.
  • Invited: MSR (Mining Software Repository) Vision 2020, Canada, August 2012.
  • Invited: Symposium on Advanced Software Engineering Techniques, Shanghai Jiaotong University, 2012.
  • Invited: Symposium on Software Quality and Analysis, Nanjing University, 2012.
  • Seminar: at University of Texas at Dallas, Feb 2017.
  • Seminar: at University of Science and Technology Beijing, April 2016. 
  • Seminar: at Chinese Academy of Science, April 2016.
  • Seminar: at Tsinghua University, May 2014.

Visiting Positions:

I was a visiting professor/scholar at the following organizations:

  • University of Cagliari, Italy (1/2011 – 3/2011)
  • Microsoft Research Asia (7/2012 – 8/2012)
  • Swinburne University of Technology, Australia (8/2012 – 9/2012)
  • The Hong Kong University of Science and Technology (10/2012 – 3/2013)
  • University of Toronto/University of Waterloo (5/2001 - 9/2001)  

Teaching:

I taught the following courses to postgraduate and undergraduate students: 

At Microsoft Research Asia:

At the School of Software, Tsinghua University:

  • Software Measurement and Process Improvement (first semester)
  • Software Quality Engineering (second semester. This course was evaluated top 15% among all postgraduate courses offered in Tsinghua University in 2011)

At the School of Computer Science and Information Technology, RMIT University:

  • ISYS1081/2368 "Software Reuse"  
  • COSC1073 "Programming Principles 1A" (a first-year Java programming course)
  • COSC1095/1295 "Programming Principles 2J"/"Java For Programmers"   
  • COSC1252/1264 “Object-Oriented Programming”   

Students:

I am grateful that I have the privilege to advise the following brilliant students/interns:

Liya Chakma, Rongxin Wu (now at HKUST), Jian Zhou (now at Baidu), Liang Gong (now at UC Berkeley), Jianxun Yang, Shuijin Lu, Jue Wang (now at Postal Bank), Shuai Chen (now at Facebook), Wei Li (now at Google), Jiangtao Gong (now at Tsinghua), Ke Ma, Bei Shi (now at CUHK), Lu Zhang (now at Virginia Tech), Zeqi Shen, Yu Cao, Bo Zhang...

Fei Lv (now at Alibaba), Galina Meyer (now at Stanford), Qing Ren (now at UCLA), Pinjia He, Sheng Tian, Wenhao Song, Senlan Yao (now at Google), Bonan Dong (now at Cornell), Xutong Chen, Wangsheng Hu, Hong Wu (now at Morgan Stanley), Jinbo Pan, Xiaodong Gu (Nanjing University), Wenxiang Hu (now at Microsoft), Chengxun Shu (now at 4Paradigm), Xingzhao Yue (now at Huawei), Chen Xia (now at UCLA)... 

Note: If I missed any of you accidently, please do email me (and forgive me). Please also let me know your latest status. 


My Erdös number is 4: Hongyu Zhang - Stanislaw Jarzabek - Tomasz Krawczyk - William T. Trotter, Jr. - Paul Erdös 

I am a senior member of IEEE.


(Last updated: May 2018)
 
Psalm 67:1-3: May God be gracious to us and bless us, and make his face shine on us, so that your ways may be known on earth, your salvation among all nations.