We employ NLP methods across diverse domains, encompassing healthcare, finance, and social media.
Our work in biomedical and clinical NLP includes pharmacovigilance event extraction from medical case reports, radiology report generation from x-ray images, biomedical QA, explainable drug-drug interaction extraction, health-related rumour veracity assessment, topical phrase extraction from clinical reports, and understanding patient reviews.
Zhaoyue Sun, Jun Wang, Linhai Zhang, Jiazheng Li, Gabriele Pergola, Lin Gui, Yulan He
Event-Centric Framework for Natural Language Understanding (Jan 2021-Dec 2025), Turing AI Fellowship, funded by the UKRI.
L. Zhang, Z. Gao, D. Zhou and Y. He. Explainable Depression Detection in Clinical Interviews with Personalized Retrieval-Augmented Generation. arXiv:2503.01315, 2025.
Z. Sun, J. Li, G. Pergola and Y. He. ExDDI: Explaining Drug-Drug Interaction Predictions with Natural Language. The 39th Annual AAAI Conference on Artificial Intelligence (AAAI), 2025.
A. Bobrov, D. Saltenis, Z. Sun, G. Pergola and Y. He. DrugWatch: A Comprehensive Multi-Source Data Visualisation Platform for Drug Safety Information. The 62nd Annual Meeting of the Association for Computational Linguistics (ACL), 2024. [Video]
Z. Sun, J. Li, G. Pergola, B.C. Wallace, and Y. He. Leveraging ChatGPT in Pharmacovigilance Event Extraction: An Empirical Study. The 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2024.
J. Wang, A. Bhalerao, T. Yin, S. See, and Y. He. CAMANet: Class Activation Map Guided Attention Network for Radiology Report Generation, IEEE Journal of Biomedical and Health Informatics, to appear.
J. Wang, A. Bhalerao, L. Zhu, and Y. He. Can Prompt Learning Benefit Radiology Report Generation arXiv:2308.16269, 2023.
J. Lu, J. Li, B.C. Wallace, Y. He and G. Pergola. NapSS: Paragraph-level Medical Text Simplification via Narrative Prompting and Sentence-matching Summarization. Findings of EACL, 2023.
Z. Sun, J. Li, G. Pergola, B.C. Wallace, B. John, N. Greene, J. Kim and Y. He. PHEE: A Dataset for Pharmacovigilance Event Extraction from Text. The 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP), Dec. 2022.
J. Wang, A. Bhalerao and Y. He. Cross-modal Prototype Driven Network for Radiology Report Generation. 17th European Conference on Computer Vision (ECCV), 2022, Tel-Aviv, Israel, Oct. 2022.
L. Gui and Y. He. Understanding Patient Reviews with Minimum Supervision, Artificial Intelligence in Medicine, to appear.
G. Pergola, E. Kochkina, L. Gui, M. Liakata and Y. He. Boosting Low-Resource Biomedical QA via Entity-Aware Masking Strategies, The 16th Conference of the European Chapter of the Association for Computational Linguistics (EACL), Apr. 2021.
D. Zhou, L Miao and Y. He. Position-Aware Deep Multi-Task Learning for Drug-Drug Interaction Extraction. Artificial Intelligence in Medicine, 87:1-8, 2018.
G. Pergola, Y. He and D. Lowe. Topical Phrase Extraction from Clinical Reports by Incorporating both Local and Global Context, The 2nd AAAI Workshop on Health Intelligence, New Orleans, Louisiana, USA, Feb. 2018.
Work in NLP for Finance includes financial event extraction from financial statements, opinion mining of customer reviews, ESG report analysis, and causal inference from earnings call transcripts.
Event-Centric Framework for Natural Language Understanding (Jan 2021-Dec 2025), Turing AI Fellowship, funded by the UKRI.
Xinyu Wang, Yuxiang Zhou, Runcong Zhao, Lin Gui, Yulan He
X. Wang, L. Gui and Y. He. A Scalable Framework for Table of Contents Extraction from Complex ESG Annual Reports, The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), Singapore, Dec. 2023.
Y. Zhou and Y. He. Causal Inference from Text: Unveiling Interactions between Variables. Findings of EMNLP, 2023.
X. Wang, L. Gui and Y. He. Document-Level Multi-Event Extraction with Event Proxy Nodes and Hausdorff Distance Minimization. The 61st Annual Meeting of the Association for Computational Linguistics (ACL), Toronto, Canada, Jul. 2023.
R. Zhao, L. Gui and Y. He. CONE: Unsupervised Contrastive Opinion Extraction. The 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), July 2023.
R. Zhao, L. Gui, H. Yan and Y. He. Tracking Brand-Associated Polarity-Bearing Topics in User Reviews. Transactions of the Association for Computational Linguistics, Vol. 11, pp. 404-418, 2023.
In social media analysis, we carry out research in understanding microblog conversations, Twitter sentiment analysis, cyberbullying detection, event extraction and visualisation on Twitter, and analysis of persuasive argumentation in political debates.
Miguel Arana-Catania, Lin Gui, John Dougrez-Lewis, Wenjia Zhang, Lixing Zhu, Yulan He
Y. Zhang, Y. He and D. Zhou. Rehearse With User: Personalized Opinion Summarization via Role-Playing based on Large Language Models. arXiv:2503.00449, 2025.
Y. Zhang, Y. Lai, Z. Wang, P. Li, D. Zhou and Y. He. Opinions Are Not Always Positive: Debiasing Opinion Summarization With Model-Specific and Model-Agnostic Methods. The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING), 2024.
L. Zhu, R. Zhao, G. Pergola and Y. He. Disentangling Aspect and Stance via a Siamese Autoencoder for Aspect Clustering of Vaccination Opinions. Findings of ACL, 2023.
R. Zhao, L. Gui and Y. He. CONE: Unsupervised Contrastive Opinion Extraction. The 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), July 2023.
W. Zhang, L. Gui, R. Procter and Y. He. NewsQuote: A Dataset Built on Quote Extraction and Attribution for Expert Recommendation in Fact-Checking. The 17th International AAAI Conference on Web and Social Media: News Media and Computational Journalism Workshop, Jun. 2023.
R. Zhao, M. Arana Catania, L. Zhu, E. Kochkina, L. Gui, A. Zubiaga, R. Procter, M. Liakata and Y. He. PANACEA: An Automated Misinformation Detection System on COVID-19, EACL system demonstration track , 2023.
R. Zhao, L. Gui, H. Yan and Y. He. Tracking Brand-Associated Polarity-Bearing Topics in User Reviews. Transactions of the Association for Computational Linguistics, accepted.
L. Zhu, Z. Fang, G. Pergola, R. Procter and Y. He. Disentangled Learning of Stance and Aspect Topics for Vaccine Attitude Detection in Social Media. 2022 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), Jul. 2022.
J. Dougrez-Lewis, M. Arana Catania, E. Kochkina, M. Liakata and Y. He. PHEMEPlus: Enriching Social Media Rumour Verification with External Evidence. The 5th FEVER Workshop, co-located with ACL, May 2022.
S. Salawu, J. Lumsden, Y. He. A Mobile-Based System for Preventing Online Abuse and Cyberbullying. International Journal of Bullying Prevention, to appear.
W. Zhang, L. Gui and Y. He. Supervised Contrastive Learning for Multi-modal Unreliable News Detection in COVID-19 Pandemic, The 30th ACM International Conference on Information and Knowledge Management (CIKM), Nov. 2021.
J. Dougrez-Lewis, E. Kochkina, M. Liakata and Y. He. Learning Disentangled Latent Topics for Twitter Rumour Veracity Classification, ACL Findings, 2021.
M. Arana-Catania, F. A. Van Lier, R. Procter, N. Tkachenko, Y. He, A. Zubiaga, M. Liakata. Citizen Participation and Machine Learning for a Better Democracy. Digital Government: Research and Practice, to appear.
S. Salawu, J. Lumsden and Y. He. A Large-Scale English Multi-Label Twitter Dataset for Cyberbullying and Online Abuse Detection. Proceedings of the 5th Workshop on Online Abuse and Harms, co-located with ACL, 2021. [dataset]
L. Zhu, Y. He and D. Zhou. Neural Temporal Opinion Modelling for Opinion Prediction on Twitter. 2020 Annual Conference of the Association for Computational Linguistics (ACL), Jul. 2020.
L. Zhu, Y. He and D. Zhou. Neural Opinion Dynamics Model for the Prediction of User-Level Stance Dynamics. Information Processing and Management, 57(2):102031, 2020.
J. Zeng, J. Li, Y. He, C. Gao, M. Lyu and I. King. What Changed Your Mind: The Roles of Dynamic Topics and Discourse in Argumentation Process. The Web Conference (WWW), Taipei, Apr. 2020.
S. Salawu, Y. He and J. Lumsden. BullStop: A Mobile App for Cyberbullying Prevention, The 28th International Conference on Computational Linguistics (COLING), Dec. 2020. [demo]
J. Zeng, J. Li, Y. He, C. Gao, M. Lyu, I. King. What You Say and How You Say it: Joint Modeling of Topics and Discourse in Microblog Conversations. Transactions of the Association for Computational Linguistics (TACL), 7:267-281, 2019.
L. Zhu, Y. He and D. Zhou. Hierarchical Viewpoint Discovery from Tweets Using Bayesian Modelling. Expert Systems with Applications, 116:430-438, 2019.
U. Orizu and Y. He. Content-Based Conflict-of-Interest Detection on Wikipedia, The 11th International Conference on Language Resources and Evaluation (LREC), Miyazaki, Japan, May 2018.
W.X. Zhao, W. Zhang, Y. He, X. Xie and J.-R. Wen. Automatically Learning Topics and Difficulty Levels of Problems in Online Judge Systems. ACM Transactions on Information Systems, Vol. 36, No. 3, Article 27, 2018.