Jiaqi Wang[1], Xiaochen Wang[1], Yuan Zhong[1], Ziyi Yin[1], Aofei Chang[1], Cao Xiao[2], Fenglong Ma[1]
[1] The Pennsylvania State University [2] GE Healthcare
AAAI 2025
Location: Room: Room 115C, Philadelphia Convention Center, Philadelphia, PA USA
Slides: https://drive.google.com/file/d/12mWUCqdcicqHagu4nCeR1YGFz-vVjlcr/view?usp=sharing
Multimodal learning has demonstrated its potential to offer deeper insights from data that spans various modalities. In the healthcare domain, designing multimodal AI systems to better understand multimodal healthcare data—thereby providing more comprehensive medical support—remains an emerging and relatively unexplored area. This tutorial will focus on exploring multimodal healthcare data, presenting recent advances in multimodal data fusion, pertaining, generation, and privacy-preserving techniques in healthcare, and finally discussing open challenges and future research directions in this rapidly evolving field. This tutorial welcomes researchers, healthcare professionals, students, engineers, and practitioners. The tutorial is designed to balance introductory and advanced material, with 50% geared toward beginners and 50% aimed at intermediate and advanced learners. By the end, attendees will gain a comprehensive understanding of multimodal AI, multimodal health informatics, cutting-edge research, and emerging research directions in this field.
Part 1: Multimodal AI in Healthcare (15 min)
Part 2: Multimodal Pre-training for Healthcare (60 min)
Preliminaries: Multimodal Pre-training
Medical Contrastive Pre-training
Medical Generative Pre-training
Part 3: Multimodal Health Data Generation (60 min + 30 min break)
GAN-based Models
VAE-based Models
Transformer-based Models
Diffusion-based Models
Part 4: Privacy-preserving Multimodal Healthcare Mining (60 min)
Preliminaries: Federated Learning
Multimodal Federated Learning for Healthcare
Federated Medical Foundation Models
Part 5: Conclusion, Future Work, and Q&A (15 min)
Dr. Fenglong Ma is currently an Assistant Professor in the College of Information Science and Technology at the Pennsylvania State University (PSU), leading the PSU Data Science Lab. He received his Ph.D. from the Department of Computer Science and Engineering, University at Buffalo (UB) in 2019, and subsequently joined PSU. His research interests lie in data mining and machine learning, with an emphasis on mining health-related data. His research interests also include federated learning, multimodal learning, health informatics, natural language processing, and security. He has publications in top conferences and journals such as KDD, NeurIPS, WWW, AAAI, IJCAI, ACL, EMNLP, CIKM, WSDM, ICDM, SDM, and TKDE. He was honored to be recognized as the awardee of the NSF CAREER Award, Sony Research Award, and UB CSE 2019 Best Ph.D. Dissertation Award. He was also recognized as AI 2000 Most Influential Scholar Honorable Mention in Data Mining (2022 and 2023) and 2022 Global Top 50 Chinese Rising Stars in Data Mining. More information can be found at his website: https://fenglong-ma.github.io.