Publications
2024
[J30] SimClone: Detecting Tabular Data Clones using Value Similarity. Xu Yang, Gopi Krishnan Rajbahadur, Dayi Lin, Shaowei Wang, Zhen Ming (Jack) Jiang. ACM Transactions on Software Engineering and Methodology (TOSEM), 2024.
[J29] Studying and Recommending Information Highlighting in Stack Overflow Answers. Shahla Ahmed, Shaowei Wang, Xiaoxiang Zhang, Tse-Hsun Chen, Yuan Tian. Information and Software Technology (IST), 2024.
[C27] Towards Better Graph Neural Neural Network-based Fault Localization Through Enhanced Code Representation. Md Nakhla Raf, Dong Jae Kim, An Ran Chen, Tse-Hsun (Peter) Chen, Shaowei Wang. In Proceedings of the 32th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering (FSE), 2024. Acceptance rate 25.5%.
[J28] Multi-Language Software Development: Issues, Challenges, and Solutions, Haoran Yang, Yu Nong, Shaowei Wang, and Haipeng Cai. IEEE Transactions on Software Engineering (TSE), 2024
[C26] On the Executability of R Markdown Files. Md Anaytul Islam, Muhammad Asaduzzaman, Shaowei Wang. In Proceedings of the 21th International Conference on Mining Software Repositories (MSR), 2024.
[C25] LLMParser: An Exploratory Study on Using Large Language Models for Log Parsing. Zeyang Ma, An Ran Chen, Dong Jae Kim, Tse-Hsun (Peter) Chen, Shaowei Wang. 46th IEEE/ACM International Conference on Software Engineering (ICSE), 2024.
2023
[J27] Study the Correlation Between the readme File of GitHub Projects and Their Popularity . Tianlei Wang, Shaowei Wang, se-Hsun (Peter) Chen. Journal of Systems & Software (JSS), 2023.
[J26] A Study of Update Request Comments in Stack Overflow Answer Posts. Mohammad Sadegh Sheikhaei, Yuan Tian, Shaowei Wang. Journal of Systems & Software (JSS), 2023.
[C24] Demystifying Issues, Challenges, and Solutions for Multilingual Software Development. Haoran Yang, Weile Lian, Shaowei Wang, Haipeng Cai. 45th IEEE/ACM International Conference on Software Engineering (ICSE), 2023. [pre-print]
[C23] Does data sampling improve deep learning-based vulnerability detection? Yeas! and Nays! Xu Yang, Shaowei Wang, Li Yi, Shaohua Wang. 45th IEEE/ACM International Conference on Software Engineering (ICSE), 2023. [pre-print]
[J25] An Empirical Study of Text-based Machine Learning Models for Vulnerability Detection. Kollin Napier, Tanmay Bhowmik, Shaowei Wang. Empirical Software Engineering Journal (EMSE), 2023.
2022
[J24] T-Evos: A Large-Scale Longitudinal Study on CI Test Execution and Coverage Evolution. An Ran Chen, Tse-Hsun Chen, Shaowei Wang. IEEE Transactions on Software Engineering (TSE), 2022.
[J23] An empirical study on the challenges that developers encounter when developing Apache Spark applications. Wang, Zehao, Tse-Hsun Peter Chen, Haoxiang Zhang, and Shaowei Wang. Journal of Systems and Software (JSS). 2022.
[C22] A First Look at Information Highlighting in Stack Overflow Answers. Shahla Ahmed, Shaowei Wang, Xiaoxiang Zhang, Tse-Hsun Chen, Yuan Tian. 38th International Conference on Software Maintenance and Evaluation (ICMSE), NIER, 2022. [PDF][Code]
[J22] Real World Projects, Real Faults: Evaluating Spectrum Based Fault Localization Techniques on Python Projects . Ratnadira Widyasari, Gede Artha, Azriadi Prana, Stefanus Agus Haryono, Shaowei Wang, David Lo. Empirical Software Engineering Journal (EMSE), 2022. [PDF]
[J21] Studying the Practices of Logging Exception Stack Traces in Open-Source Software Projects. Heng Li, Haoxiang Zhang, Shaowei Wang, Ahmed E Hassan. IEEE Transactions on Software Engineering (TSE), 2022.
[J20] Studying Donations and their Expenses in Open Source Projects: A Case Study of GitHub Projects Collecting Donations through Open Collectives. Jiayuan Zhou, Shaowei Wang, Yasutaka Kamei, Ahmed E. Hassan, Naoyasu Ubayashi. Empirical Software Engineering Journal (EMSE), 2021 [PDF].
2021
[C21] [Best paper award] Is reputation on Stack Overflow always a good indicator for users’ expertise? No! Shaowei Wang, Daniel M. German, Tse-Hsun Chen, Yuan Tian, Ahmed E. Hassan. 37th International Conference on Software Maintenance and Evaluation (ICMSE), NIER, 2021. [PDF]
[C20] IncBL: Incremental Bug Localization. Zhou Yang , Jieke Shi , Shaowei Wang , David Lo . The 36th IEEE/ACM International Conference on Automated Software Engineering (ASE), Tool demo, 2021.
[C19] Would You Like a Quick Peek? Providing Logging Support to Monitor Data Processing in Big Data Application. Zehao Wang, Haoxiang Zhang, Tse-Hsun (Peter) Chen, Shaowei Wang. In Proceedings of the 28th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering (FSE), 2021. [PDF]
[J19] Studying Backers and Hunters in Bounty Issue Addressing Process of Open Source Projects. Jiayuan Zhou, Shaowei Wang, Haoxiang Zhang, Tse-Hsun Chen, Ahmed E. Hassan. Empirical Software Engineering Journal (EMSE), 2021. [PDF]
[J18] Pathidea: Improving Bug Localization by Re-Constructing Execution Paths Using Logs. An Ran Chen, Tse-Hsun (Peter) Chen, Shaowei Wang. IEEE Transactions on Software Engineering (TSE), 2021 [PDF]
[J17] Demystifying the challenges and benefits of analyzing user-reported logs in bug reports. Chen, An Ran, Tse-Hsun Peter Chen, and Shaowei Wang. Empirical Software Engineering (EMSE), 2021. [PDF]
[J16] The impact of feature importance methods on the interpretation of defect classifiers. Rajbahadur, Gopi Krishnan, Shaowei Wang, Gustavo Ansaldi, Yasutaka Kamei, and Ahmed E. Hassan. IEEE Transactions on Software Engineering (TSE), 2021. [PDF]
[J15] A Study of C/C++ Code Weaknesses on Stack Overflow, Haoxiang Zhang, Shaowei Wang, Heng Li, THP Chen, Ahmed E. Hassan, IEEE Transactions on Software Engineering (TSE), 2021. [PDF]
2020
[J14] Are comments on Stack Overflow well organized for easy retrieval by developers? Haoxiang Zhang, Shaowei Wang, Tse-Hsun Chen, Ahmed E. Hassan, ACM Transactions on Software Engineering and Methodology (TOSEM), 2020. [PDF]
[J13] A Study of Bug Management Using the Stack Exchange Question and Answering Platform. Aaditya Bhatia, Shaowei Wang, Muhammad Asaduzzaman, and Ahmed E. Hassan, IEEE Transactions on Software Engineering (TSE), 2020. [PDF]
[J12] Studying the Association between Bountysource Bounties and the Issue-addressing Likelihood of GitHub Issue Reports. Jiayuan Zhou, Shaowei Wang, Cor-Paul Bezemer, Ying Zou, Ahmed E. Hassan, IEEE Transactions on Software Engineering (TSE), 2020. [PDF]
2019
[J11] Reading Answers on Stack Overflow: Not Enough! Haoxiang Zhang, Shaowei Wang, Tse-Hsun Chen, Ahmed, E. Hassan, IEEE Transactions on Software Engineering (TSE), 2019. [PDF]
[J10] Impact of Discretization Noise on Machine Learning Classifiers when Studying Software Engineering Datasets. Gopi Krishnan Rajbahadur, Shaowei Wang, Yasutaka Kamei, Ahmed E Hassan, IEEE Transactions on Software Engineering (TSE), 2019. [PDF]
[J9] Bounties on Technical Q&A Sites: A Case Study of Stack Overflow Bounties. Jiayuan Zhou, Shaowei Wang, Cor-Paul Bezemer, Ahmed E. Hassan, Empirical Software Engineering (EMSE), 2019. [PDF]
[J8] An Empirical Study of Obsolete Answers on Stack Overflow . Haoxiang Zhang, Shaowei Wang, Tse-Hsun Chen, Ying Zou, Ahmed, E. Hassan, IEEE Transactions on Software Engineering (TSE), 2019. [PDF]
2018
[J7] How Do Users Revise Answers on Technical Q&A Websites? A Case Study on Stack Overflow. Shaowei Wang, Tse-Hsun Chen, Ahmed, E. Hassan, IEEE Transactions on Software Engineering, 2018 (TSE). [PDF]
[J6] How Do Developers Utilize Source Code from Stack Overflow? Yuhao Wu, Shaowei Wang, Cor-Paul Bezemer, Katsuro Inoue. Empirical Software Engineering (EMSE), 2018. [PDF]
[J5] Studying the consistency of star ratings and reviews of popular free hybrid android and iOS apps, Hanyang Hu, Shaowei Wang, Cor-Paul Bezemer, Ahmed E. Hassan. Empirical Software Engineering (EMSE), 2018. [PDF]
2017
[J4] Understanding the Factors for Fast Answers in Technical Q&A Websites: An Empirical Study on Four Stack Exchange Websites, Shaowei Wang, Tse-Hsun Chen, Ahmed E. Hassan, Empirical Software Engineering (EMSE), 2017. [Selected as a Journal-First paper at ICSE 2018] [PDF]
[J3] EnTagRec++: An Enhanced Tag Recommendation System for Software Information Sites, Shaowei Wang, David Lo, Bogdan Vasilescu, Alexander Serebrenik, Empirical Software Engineering (EMSE), 2017. [PDF]
[C18] The Impact of Using Regression Models to Build Defect Classifiers, Gopi Krishnan Rajbahadur, Shaowei Wang, Yasutaka Kamei, Ahmed E Hassan, In Proceedings of the 14th International Conference on Mining Software Repositories (MSR), 2017.
2016 and before
[J2] AmaLgam+: Composing Rich Information Sources for Accurate Bug Localization, Shaowei Wang, David Lo, Journal of Software: Evolution and Process, 2016 (JSEP). [preprint]
[C17] Query expansion via WordNet for effective code search, Meili Lu, Xiaobing Sun, Shaowei Wang, David Lo, Yucong Duan, 22nd IEEE International Conference on Software Analysis (SANER), Evolution, and Reengineering.
[C16] Active Semi-supervised Approach for Checking App Behavior against Its Description, Siqi Ma, Shaowei Wang, David Lo, Robert Huijie Deng, Cong Sun, 39th IEEE Annual Computer Software and Applications Conference (COMPSAC), 2015.
[C15] CodeHow: Effective Code Search based on API Understanding and Extended Boolean Model, Fei Lv, Hongyu Zhang, Jian-guang Lou, Shaowei Wang, Dongmei Zhang, Jianjun Zhao, 30th IEEE/ACM International Conference on Automated Software Engineering (ASE), 2015.
[C14] Scalable Parallelization of Specification Mining, Shaowei Wang, David Lo, Lingxiao Jiang, book chapter, The Art and Science of Analyzing Software Data, 2015.
[C13] Active Code Search: Incorporating User Feedback to Improve Code Search Relevance, Shaowei Wang, David Lo, Lingxiao Jiang, 29th IEEE/ACM International Conference on Automated Software Engineering (ASE), 2014.
[C12]Compositional Vector Space Models for Improved Bug Localization, Shaowei Wang, David Lo, Julia Lawall, 30th International Conference on Software Maintenance and Evaluation (ICSME), 2014. [code]
[C11] An Enhanced Tag Recommendation System for Software Information Sites, Shaowei Wang, David Lo, Bogdan Vasilescu, Alexander Serebrenik, 30th International Conference on Software Maintenance and Evaluation (ICSME), 2014. [code]
[C10][best paper nomination], Version History, Similar Report, and Structure: Putting Them Together for Improved Bug Localization, Shaowei Wang, David Lo, 22nd IEEE International Conference on Program Comprehension (ICPC), 2014. [Code]
[J1] AutoQuery: Automatic Construction of Dependency Queries for Code Search, Shaowei Wang, David Lo, Lingxiao Jiang, Journal of Automated Software Engineering (ASEJ), 2014.
[C9] An Empirical Study on Developer Interactions in StackOverflow, Shaowei Wang, David Lo, Lingxiao Jiang, 28th ACM SIGAPP Symposium On Applied Computing (SAC), 2013.
[C8] Automatic Recommendation of API Methods from Feature Requests, Ferdian Thung, Shaowei Wang, David Lo, and Julia Lawall, 28th IEEE/ACM International Conference on Automated Software Engineering (ASE), 2013.
[C7] Empirical Evaluation of Bug Linking, Tegawende F. Bissyande, Ferdian Thung, Shaowei Wang, David Lo, Lingxiao Jiang, Laurent Reveillere, 17th European Conference on Software Maintenance and Reengineering (CSMR), 2013.
[C6] Multi-Abstract Concern Localization, Tien-Duy B. Le, Shaowei Wang, and David Lo, 29th IEEE International Conference on Software Maintenance (ICSM), 2013.
[C5] Semantically Related Software Terms and Their Taxonomy By Leveraging Collaborative Tagging, Shaowei Wang, David Lo, Lingxiao Jiang, 28th IEEE International Conference on Software Maintenance (ICSM), 2012. [code]
[C4] [Most influencial paper]An Empirical Study of Bugs in Machine Learning Systems, Ferdian Thung, Shaowei Wang, David Lo, Lingxiao Jiang, 23rd IEEE International Symposium on Software Reliability Engineering (ISSRE), 2012. (Test of Time award)
[C3] Code Search via Topic-Enriched Dependency Graph Matching, Shaowei Wang, David Lo, Lingxiao Jiang, 18th IEEE Working Conference on Reverse Engineering (WCRE), 2011.
[C2] Concern Localization using Information Retrieval: An Empirical Study on Linux Kernel, Shaowei Wang, David Lo, Zhenchang Xing, Lingxiao Jiang, 18th IEEE Working Conference on Reverse Engineering (WCRE), 2011.
[C1] Search-Based Fault Localization, Shaowei Wang, David Lo, Lingxiao Jiang, LUCIA, Hoong Chuin Lau, 11/2011, 26th IEEE/ACM International Conference on Automated Software Engineering (ASE), 2011.