Schedule (June 11th 8:25am - 12:30pm)
Schedule (June 11th 8:25am - 12:30pm)
Best Paper Award:
"Are Vision-Language Models Ready for Dietary Assessment? Exploring the Next Frontier in AI-Powered Food Recognition"
SERGIO ROMERO TAPIADOR (UNIVERSIDAD AUTONOMA DE MADRID); Ruben Tolosana (UNIVERSIDAD AUTONOMA DE MADRID); Blanca Lacruz-Pleguezuelos (IMDEA Food Institute); Laura Judith Marcos-Zambrano (IMDEA Food Institute); Guadalupe X. Bazán (IMDEA Food Institute); Isabel Espinosa-Salinas (IMDEA Food Institute); Julian Fierrez (UNIVERSIDAD AUTONOMA DE MADRID); Javier Ortega-Garcia (UNIVERSIDAD AUTONOMA DE MADRID); Enrique Carrillo de Santa Pau (IMDEA Food Institute); Aythami Morales (UNIVERSIDAD AUTONOMA DE MADRID)
Best Paper Runner-up Award:
"Stochastic-based Patch Filtering for Few-Shot Learning"
Javier Ródenas Cumplido (Universitat de Barcelona); Eduardo Aguilart (Universitat de Barcelona); Petia Radeva (Universitat de Barcelona)
Accepted Full Papers (Workshop Proceedings at https://openaccess.thecvf.com/CVPR2025_workshops/MTF)
Our workshop is assigned poster boards #55 - #75 in ExHall D at time 10:30 - 12:00
Orals
#55. Are Vision-Language Models Ready for Dietary Assessment? Exploring the Next Frontier in AI-Powered Food Recognition [Poster]
SERGIO ROMERO TAPIADOR (UNIVERSIDAD AUTONOMA DE MADRID); Ruben Tolosana (UNIVERSIDAD AUTONOMA DE MADRID); Blanca Lacruz-Pleguezuelos (IMDEA Food Institute); Laura Judith Marcos-Zambrano (IMDEA Food Institute); Guadalupe X. Bazán (IMDEA Food Institute); Isabel Espinosa-Salinas (IMDEA Food Institute); Julian Fierrez (UNIVERSIDAD AUTONOMA DE MADRID); Javier Ortega-Garcia (UNIVERSIDAD AUTONOMA DE MADRID); Enrique Carrillo de Santa Pau (IMDEA Food Institute); Aythami Morales (UNIVERSIDAD AUTONOMA DE MADRID)
#56. Stochastic-based Patch Filtering for Few-Shot Learning
Javier Ródenas Cumplido (Universitat de Barcelona); Petia Radeva (Universitat de Barcelona); Eduardo Aguilart (Universitat de Barcelona)
Posters
#57. Synthetic Data Augmentation using Pre-trained Diffusion Models for Long-tailed Food Image Classification [Poster] [Arxiv]
GaYeon Koh (Korea University); Hyun-Jic Oh (Korea University); Jeonghyun Noh (Korea University); Won-Ki Jeong (Korea University)
#58. Extra-Lightweight AI-Based Privacy Preserving Framework for Egocentric Wearable Cameras [Poster]
Long Li (University of Alabama); Fengqing Zhu (Purdue University); Heather Eicher-Miller (Purdue University); Graham Thomas (Brown University); Yuning Huang (Purdue University); Edward Sazonov (University of Alabama)
#59. Privacy Preserving Ordinal-Meta Learning with VLMs for Fine-Grained Fruit Quality Prediction [Poster]
Riddhi Jain (TCS-Research); Manasi Patwardhan (TCS-Research); Aayush Mishra (TCS-Research); Parijat Deshpande (TCS-Research); Beena Rai (TCS-Research)
#60. Food Degradation Analysis Using Multimodal Fuzzy Clustering [Poster]
Julio Valdes (National Research Council Canada); Stephie Liu (National Research Council Canada); Shawn Yang (National Research Council Canada); Yuhao Chen (University of Waterloo); Alexander Wong (University of Waterloo); PENGCHENG XI (National Research Council Canada)
#61. VolTex: Food Volume Estimation using Text-Guided Segmentation and Neural Surface Reconstruction
Ahmad AlMughrabi (University of Barcelona); Umair Haroon (University of Barcelona); Ricardo Marques (Pompeu Fabra University); Petia Radeva (University of Barcelona)
#62. FoodVideoQA: A Novel Baseline Framework for Dietary Monitoring [Poster]
Siddharth Viswanath (University of Waterloo); Krish Shah (University of Waterloo); Pengcheng Xi (National Research Council Canada); Alexander Wong (University of Waterloo); Yuhao Chen (University of Waterloo)
#63. SAMJAM: Zero-Shot Video Scene Graph Generation for Egocentric Kitchen Videos [Poster] [Arxiv]
Joshua Li (University of Waterloo); Fernando Jose Pena Cantu (University of Waterloo); Emily Yu (University of Waterloo); Alexander Wong (University of Waterloo); Yuchen Cui (University of California Los Angeles); Yuhao Chen (University of Waterloo)
#64. Agro-Net: A Convolution-Attention Fusion based hyperspectral model for agro-food quality assessment
Ocean Monjur (University of Illinois Urbana Champaign); Md. Toukir Ahmed (University of Illinois Urbana Champaign); Md Wadud Ahmed (University of Illinois Urbana Champaign); Mohammed Kamruzzaman (University of Illinois Urbana Champaign)
#65. Decomposing Food Images for Better Nutrition Analysis: A Nutritionist-Inspired Two-Step Multimodal LLM Approach [Poster]
Pitikorn Khlaisamniang (AIAT); Kun Kerdthaisong (Faculty of Engineering, Thammasat School of Engineering, Thammasat University); Supasate Vorathammathorn (Department of Computer Engineering, Faculty of Engineering, King Mongkut’s University of Technology Thonburi); Teermade Thitseesaeng (National Health Security Office (NHSO)); Nutchanon Yongsatianchot (Faculty of Engineering, Thammasat School of Engineering, Thammasat University); Hirunkul Phimsiri (Computer Engineering and Digital Technology Department of Computer Engineering, Faculty of Engineering, Chulalongkorn University, PreceptorAI team, CARIVA); Amrest Chinkamol (Vidyasirimedhi Institute of Science and Technology, PreceptorAI team, CARIVA); Kanyakorn Veerakanjana (Faculty of Medicine Siriraj Hospital, Mahidol University, PreceptorAI team, CARIVA); Kaisorn Kachai (PreceptorAI team, CARIVA); Piyalitt Ittichaiwong (Faculty of Medicine Siriraj Hospital, Mahidol University, PreceptorAI team, CARIVA); Tossaporn Saengja (Faculty of Medicine Siriraj Hospital, Mahidol University, PreceptorAI team, CARIVA)
Accepted Extended Abstracts
#66. Explainable Zero-Shot Food Categorization Using Visual-Language Models [PDF]
János Horváth (Purdue University)
#67. FoodTrack: Estimating Handheld Food Portions with Egocentric Video [PDF] [Arxiv] [Poster]
Ervin Wang (University of Waterloo); Yuhao Chen (University of Waterloo)
#68. Food Type Recognition Using Vision Language Models [PDF]
Saeed Alahmari (Najran University); Tawfiq Salem (Purdue University); Yicheng Shi (Purdue University)
#69. Dietary Intake Estimation via Continuous 3D Reconstruction of Food [PDF] [Arxiv] [Poster]
Yin Hau Lee (University of Waterloo); YuHao Chen (University of Waterloo)
#70. 6D Pose Estimation on Spoons and Hands [PDF] [Arxiv] [Poster]
Kevin Tan (University of Waterloo); Fan Yang (University of Waterloo); Yuhao Chen (University of Waterloo)
#71. Conversational Multimodal LLMs for Food Nutritional Information Retrieval: A Systematic Evaluation [PDF]
Gayatri Bhatambarekar (Virginia Tech); Abhijit Sarkar (Virginia Tech)