12th CVPR Workshop on Medical Computer Vision
June 3 , 2026
Colorado Convention Center, Room 605, Denver, Colorado
June 3 , 2026
Colorado Convention Center, Room 605, Denver, Colorado
Full Keynote Series
Overview
Medical computer vision is reshaping the future of healthcare by uniting computer vision, machine learning, and clinical imaging into a shared scientific endeavor. Progress in this field is driven by cross-disciplinary collaboration, spanning algorithm design, imaging physics, robotics, and clinical translation, and is advancing our ability to understand, diagnose, and treat disease through intelligent visual reasoning. This workshop brings together leading experts from academia, healthcare, and industry whose research collectively spans foundation models, surgical vision, embodied AI, and data-centric learning. Through their perspectives, the event will promote open dialogue on key challenges in scalability, reliability, interpretability, and clinical integration. The broader impact of this workshop lies in shaping a unified research vision for trustworthy, human-centered, and globally accessible medical AI systems. By bridging technical innovation with clinical practice, it will catalyze new collaborations, establish shared benchmarks, and accelerate the translation of computer vision from research to real-world healthcare impact.
Keynote Schedule
Keynote session I
9:15–9:45 am Dimitris Metaxas
9:45–10:15 am Daguang Xu
10:15–10:45 am Sharon X. Huang
10:45–11:15 am Mathias Unberath
11:15–11:45 am Archana Venkataraman
Keynote session II
1:00-1:30 pm Maddie Traverse
1:30–2:00 pm Hoifung Poon
2:00–2:30 pm Jeremias Sulam
2:30–3:00 pm Kayhan Batmanghelich
3:00–3:30 pm Ehsan Adeli
3:30–4:00 pm Mert Sabuncu
Agenda Details
Opening Remarks
9:00-9:15 am
Keynote session I
9:15-9:45 am Dimitris Metaxas
Explainability, Generation, Physics and Dynamics in ML for Biomedical Applications
Dimitris Metaxas is a Board of Governors and Distinguished Professor in the Computer and Information Sciences Department at Rutgers University. He is directing the CBIM Center and the NSF University-Industry Collaboration Center CARTA, and is also a member of the RAD Collaboratory. Dr. Metaxas has been conducting research in the general area of spatiotemporal intelligence. The focus is the development of novel methods and algorithms upon which AI, machine learning, computer vision, medical image analysis, language and graphics/generative methods can advance synergistically in the presence of dynamic spatio-temporal multimodal data and domain knowledge. In biomedical image analysis he developed Machine Learning and deformable model-based methods for material modeling and shape estimation of internal organs from MRI, SPAMM and CT data, explainable diagnosis methods, cancer and cell, analytics. Dr. Metaxas has published over 800 research articles in these areas and has graduated over 80 PhD students, who occupy academic and industry positions. His research has been funded by NIH, NSF, AFOSR, ARO, DARPA, HSARPA, and the ONR. Dr. Metaxas work has received many best paper awards and he has 10 patents. He was awarded a Fulbright Fellowship in 1986, is a recipient of an NSF Research Initiation and Career awards, and an ONR YIP. He is a Fellow of the American Institute of Medical and Biological Engineers, a Fellow of IEEE and a Fellow of the MICCAI Society. He has been involved with the organization of all major conferences in computer vision and medical images analytics ( IEEE ICCV 2031, IEEE CVPR 2026, IEEE CVPR 2014, ICCV 2011, IPMI 2025, DDDAS 24&26, MICCAI 2008; PC ICCV 2007, FIMH 2011 and SCA 2007).
9:45-10:15 pm Daguang Xu
Driving AI Innovation in Healthcare through Open Data and Foundation Models
Daguang Xu is a Senior Research Manager at NVIDIA, where he leads research in healthcare AI and medical imaging. His work focuses on deep learning, computer vision, federated learning, and AI infrastructure for clinical applications. He is a core contributor to the open-source MONAI and NVIDIA FLARE platforms. Daguang Xu received his Ph.D. in Electrical and Computer Engineering from Johns Hopkins University and has published extensively in medical AI and imaging analysis.
10:15-10:45 am Sharon X. Huang
Advancing Diagnostic Robustness and Privacy-Preserving Model Training with Medical Image and Video Generation
Sharon Xiaolei Huang is the David Reese Professor and Department Head in the College of Information Sciences and Technology at Pennsylvania State University. Her research focuses on medical AI, computer vision, biomedical image analysis, and machine learning, with contributions to image and video generation, segmentation, and computer-assisted diagnosis. She has published more than 200 scholarly articles and serves in leadership and editorial roles for major AI and medical imaging conferences and journals.
10:45-11:15 am Mathias Unberath
Digital Twins for Ambient and Embodied Surgical AI
Mathias Unberath is the John C. Malone Associate Professor of Computer Science at Johns Hopkins University. He leads the ARCADE Lab, where his research focuses on AI, computer vision, robotics, and mixed reality for computer-assisted medicine. His work develops human-centered technologies to support clinical decision-making and image-guided interventions. He has received multiple honors, including the NSF CAREER Award and NIH Trailblazer Award.
11:15-11:45 pm Archana Venkataraman
Lightweight and Interpretable AI as a New Window into Brain Dysfunction
Archana Venkataraman is an Associate Professor in the Department of Electrical and Computer Engineering at Boston University, with joint appointments in Biomedical Engineering and Computer Science. Her research focuses on machine learning, artificial intelligence, medical imaging, and computational neuroscience, particularly for understanding brain disorders and mental health. She develops data-driven methods to study brain connectivity and multimodal biomedical data, with applications in neuroscience and precision medicine. Dr. Venkataraman has received several honors, including the NSF CAREER Award and multiple NIH-funded research awards.
Keynote session II
1:00-1:30 pm Maddie Traverse
MedGemma: an open vision-language model for diverse medical applications
Maddie Traverse is a Senior Software Engineer at Google Research, specializing in the intersection of deep learning, computer vision, and healthcare. Since 2023, Madeleine has focused on Google’s medical initiatives. She drove the machine learning development for Google's CT Foundation model and is a key contributor to MedGemma.Her broader work at Google includes designing graph neural networks for population health, advancing geospatial foundation models, and leading large-scale machine learning pipelines for personalized recommendation systems. Maddie has a long-standing focus on computer vision, having built deep learning classifiers for radiological evaluation and previously researched 3D GANs at UC Berkeley.
1:30-2:00 pm Hoifung Poon
Learning the Language of Patients: Multimodal Generative AI for Precision Health
Hoifung Poon is General Manager at Health Futures in Microsoft Research and an affiliated faculty at the University of Washington Medical School. He leads biomedical AI research and incubation, with the overarching goal of structuring medical data to optimize delivery and accelerate discovery for precision health.
2:00-2:30 pm Jeremias Sulam
Flexible methods for uncertainty quantification in medical imaging
Jeremias Sulam is an assistant professor in the Department of Biomedical Engineering, and is also affiliated with the Mathematical Institute for Data Science and the Center for Imaging Science. He holds a secondary appointment in the Department of Applied Mathematics and Statistics and is a member of the Data Science and AI Institute. Jeremias’ research focuses on the foundations of machine learning and applications to biomedical imaging. He is interested in learning under parsimonious structures in data, robustness, and ethical implications of data-driven methods, as well as in the interpretability and auditing of the resulting models. His work is motivated by applications of diagnostic imaging in radiology, inverse problems, and biomarker discovery in neuroscience and digital pathology.
2:30-3:00 pm Kayhan Batmanghelich
Image-Text Foundational for Volumetric Image: Challenges and Opportunities
Kayhan Batmanghelich is an Assistant Professor in the Department of Electrical and Computer Engineering at Boston University. His research focuses on medical image analysis and the broader application of artificial intelligence in healthcare. Previously, he was a faculty member in the Department of Biomedical Informatics at the University of Pittsburgh. He received his PhD from the University of Pennsylvania. He is a recipient of the NSF CAREER Award and a Google Academic Research Award.
3:00-3:30 pm Ehsan Adeli
From Bedside to Living Room: Reimagining Care Through Ambient Intelligence
Ehsan Adeli is an Assistant Professor of Psychiatry and Behavioral Sciences at Stanford University, with courtesy appointments in Computer Science and Biomedical Data Science. He directs the Stanford Translational AI (STAI) in Medicine and Mental Health Lab. His research lies at the intersection of artificial intelligence, computer vision, computational neuroscience, and digital health, with applications in mental health, aging, neurodegenerative disorders, and human behavior analysis. Dr. Adeli develops AI methods for multimodal biomedical data, including imaging, video, and sensor-based health monitoring.
3:30-4:00 pm Mert Sabuncu
Prevalence Adjustment as a Way to Handle Distribution Shift in Medical Vision
Mert R. Sabuncu received a PhD degree in Electrical Engineering from Princeton University, where his dissertation focused on entropy-based approaches to image registration. Mert then moved to the Massachusetts Institute of Technology for a post-doc with Polina Golland at the Computer Science and Artificial Intelligence Lab, where he worked on a range of biomedical image analysis problems, including the segmentation of brain MRI scans. After his post-doc at MIT, Mert spent a few years at the A.A Martinos Center for Biomedical Imaging (Massachusetts General Hospital and Harvard Medical School) as a junior faculty member, where he built a research program on algorithmic tools for integrating genetics and medical imaging. Today, Mert is a Professor in Electrical and Computer Engineering at Cornell University and Cornell Tech, in New York City. He also holds a dual appointment in Radiology at Weill Cornell Medicine, where he serves as the Vice Chair of AI and Engineering Research. His group develops machine learning based computational tools for biomedical imaging applications. He is a recipient of an NSF CAREER Award (2018) and an NIH Early Career Development Award (2011).
Keynote Speakers
Rutgers University
Nvidia
Penn State University
Johns Hopkins University
Microsoft
Boston University
Johns Hopkins University
Boston University
Stanford University
Cornell University
General Chairs
Zongwei Zhou
Yucheng Tang
Chenyu You
Scientific Committee
Alan Yuille
Curtis Langlotz
Yuankai Huo
Yang Yang
Kang Wang
Dong Yang
Organizer Committee
Pedro RAS Bassi
Yunhe Gao
Wenxuan Li
Shanshan Zhu
Sijing Li
Pengfei Guo
Yufan He
Can Zhao
Yuting He