CVPR Dual
DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos [Spotlight & Best Paper]
Wenbo Hu, Xiangjun Gao, Xiaoyu Li, Sijie Zhao, Xiaodong Cun, Yong Zhang, Long Quan, Ying Shan
Open Ad-hoc Categorization with Contextualized Feature Learning
Zilin Wang, Sangwoo Mo, Stella Yu, Sima Behpour, Liu Ren
STPro: Spatial and Temporal Progressive Learning for Weakly Supervised Spatio-Temporal Grounding [Spotlight]
Aaryan Garg, Akash Kumar, Yogesh S Rawat
SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation [Spotlight]
Claudia Cuttano, Gabriele Trivigno, Gabriele Rosi, Carlo Masone, Giuseppe Averta
Accepted (Archival)
Hierarchical Semantic Segmentation with Autoregressive Language Modeling [Spotlight]
Josh Myers-Dean, Brian Price, Yifei Fan, Danna Gurari
Prompt-Guided Attention Head Selection for Focus-Oriented Image Retrieval
Yuji Nozawa, Yu-Chieh Lin, Kazumoto Nakamura, Youyang Ng
ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements
Mustafa Arda Aydın, Efe Mert Çırpar, Elvin Abdinli, Gozde Unal, Yusuf H Sahin
Show or Tell? A Benchmark To Evaluate Visual and Textual Prompts in Semantic Segmentation [Spotlight]
Gabriele Rosi, Fabio Cermelli
Accepted (Non Archival)
PixFoundation: Are We Heading in the Right Direction with Pixel-level Vision Foundation Models? [Paper]
Mennatullah Siam
Describe Anything: Detailed Localized Image and Video Captioning [Paper]
Long Lian, Yifan Ding, Yunhao Ge, Sifei Liu, Hanzi Mao, Boyi Li, Marco Pavone, Ming-yu Liu, Trevor Darrell, Adam Yala, Yin Cui
Visually Consistent Hierarchical Image Classification [Paper]
Seulki Park, Youren Zhang, Stella Yu, Sara Beery, Jonathan Huang