Regular papers accepted for oral or poster presentations will appear in the ICCV 2025 Workshops proceedings.
Oral presentations
OscNet v1.5: Energy Efficient Hopfield Network on CMOS Oscillators for Image Classification
Wenxiao Cai, Zongru Li, Iris Wang, Yu-Neng Wang, Thomas H. Lee
(Stanford University, Carnegie Mellon University)
Human Vision Constrained Super-Resolution
Volodymyr Karpenko, Taimoor Tariq, Jorge Condor, Piotr Didyk
(Universita della Svizzera Italiana)
From Neural Activity to Computation: Biological Reservoirs for Pattern Recognition in Digit Classification
Ludovico Iannello, Luca Ciampi, Fabrizio Tonelli, Gabriele Lagani, Lucio Maria Calcagnile, Federico Cremisi, Angelo Di Garbo, Giuseppe Amato
(CNR, School of Education Pisa)
Co-Visibility ReasONing on Sparse Image Sets of Indoor Scenes
Chao Chen, Nobel Dang, Juexiao Zhang, Wenkai Sun, Pengfei Zheng, Xuhang He, Yimeng Ye, Jiasheng Zhang, Taarun Srinivas, Chen Feng
(New York University, Clemson University)
Human-Inspired Summarization: Cluster Scene Videos into Diverse Frames
Chao Chen, Mingzhi Zhu, Yu Yan, Ankush Pratap Singh, Felix Juefei-Xu, Chen Feng
(New York University, New York Institute of Technology, Meta)
SeeEEG: Semantic-aware EEG-based Multi-Modal Retrieval-Augmented Generation for High-Fidelity Visual Brain Decoding
Jun-Mo Kim, Woohyeok Choi, Sang-Jun Park, Keun-Soo Heo, Young-Han Son, Ji-Hye Oh, Dong-Hee Shin, Tae-Eui Kam
(Korea University)
Poster presentations
Long-Tailed Data Classification by Increasing and Decreasing Neurons During Training
Taigo Sakai, Kazuhiro Hotta
(Meijo University)
Towards Human-Like Invariance: Self-Supervised Learning with Feature-Level Rotation Alignment
Sangjun Han, Woojin Cheong, Chang hoon Song, Myungjoo Kang
(Seoul National University)
Using Human Perception to Regularize Transfer Learning
Steve Cruz, Justin Dulay, Walter Scheirer
(University of Notre Dame, Struction)
HiSS: Human-inspired Semantic Segmentation for Vehicle Interior Scene Understanding
Aleksander Kostuch, Joanna Jaworek-Korjakowska
(AGH University of Science and Technology)
AttZoom: Attention Zoom for Better Visual Features
Daniel DeAlcala, Aythami Morales, Julian Fierrez, Ruben Tolosana
(Universidad Autónoma de Madrid)
Who Walks With You Matters: Perceiving Social Interactions with Groups for Pedestrian Trajectory Prediction
Ziqian Zou, Conghao Wong, Beihao Xia, Xinge You
(Huazhong University of Science and Technology)
Exploring Human-Model Alignment in Visual Social Attention During Help-and-Hinder Social Interaction Classification
Lucia Schiatti, Guido Vallarino, Sabrina Megan Lopez, Yen-Ling Kuo, Matteo Moro, Mengmi Zhang, Monica Gori, Alessio Del Bue, Boris Katz, Andrei Barbu
(Istituto Italiano di Tecnologia, University of Genoa, University of Virginia, Nanyang Technological University, Massachusetts Institute of Technology)
Extended abstracts accepted for poster presentations will be accessible via their link reported for each contribution below.
CAVIS: Context-Aware Video Instance Segmentation
Seunghun Lee, Jiwan Seo, Kiljoon Han, Minwoo Choi, Sunghoon Im
(Daegu Gyeongbuk Institute of Science and Technology)
Neural Ganglion Sensors: Learning Task-specific Event Cameras Inspired by the Neural Circuit of the Human Retina
Haley M. So, Gordon Wetzstein
(Stanford University)
Concept-guided Image-to-Image Retrieval via conditioned similarity in Vision-Language Model
Sohwi Lim, Lee Hyoseok, Tae-Hyun Oh
(Pohang University of Science and Technology, Korea Advanced Institute of Science & Technology)
A Cortically-Inspired Recurrent Network for Dynamic Saliency Prediction
Bayron Jossue Serrano Mena
(Universidad de Costa Rica)
Make Me Happier: Evoking Emotions Through Image Diffusion Models
Qing Lin, Jingfeng Zhang, Yew-Soon Ong, Mengmi Zhang
(A*STAR, RIKEN, Nanyang Technological University, )
Modeling Human Gaze Behavior with Diffusion Models for Unified Scanpath Prediction
Giuseppe Cartella, Vittorio Cuculo, Alessandro D'Amelio, Marcella Cornia, Giuseppe Boccignone, Rita Cucchiara
(University of Modena and Reggio Emilia, University of Milan)
Divided Attention: Unsupervised Multi-object Discovery by Motion with Contextually Separated Slots
Dong Lao, Zhengyang Hu, Francesco Locatello, Yanchao Yang, Stefano Soatto
(Louisiana State University, Hong Kong University, Institute of Science and Technology, Amazon Web Services)
Better, But Not Sufficient: Testing Video ANNs Against Macaque IT Dynamics
Matteo Dunnhofer, Christian Micheloni, Kohitij Kar
(York University, University of Udine)
Human-Prior Correction: Post-hoc Calibration that Aligns Vision Models with Human Uncertainty
Anitej Thamma, Shreyas Krishnan
(Harvard University)
Objects in Focus: Predicting Object-Based Attention from Spatial Features
Elizabeth H Hall, Zoe Loh
(University of California - Davis, University of California - Merced)
Learning Physics Like Humans: Uncovering Physical Laws from Monocular Videos
Mohamed Rayan Barhdadi, Hussein Alnuweiri, Hasan Kurban
(Texas A&M University, Hamad Bin Khalifa University)