Poster & Demos
Poster Presentations
Liang Zheng: Label-free model evaluation
Bryan Russell: Language-guided music recommendation for video via prompt analogies
Yong Jae Lee: GLIGEN: Open-Set Grounded Text-to-Image Generation
Jonghyun Choi: Visual Recognition in Practical Scenarios
Efstratios Gavves: Causal Deep Learning for Foundational Embodied AI
Lamberto Ballan: Exploiting Proximity-Aware Tasks for Embodied Social Navigation
Seunghoon Hong: Universal Few-shot Learning of Dense Visual Prediction
Yannis Kalantidis: Fake it till you make it: Learning transferable representations from synthetic ImageNet clones
Ishan Misra: Cut and Learn for Unsupervised Object Detection and Instance Segmentation
Lu Jiang: The Dull Child Paradox in Foundation Models
Rohit Girdhar: ImageBind: One Embedding to Bind Them All
John Collomosse: Image Provenance: To Authenticity and Beyond!
Shaodi You: Physics Modeling for Outdoor Computer Vision
Tianfu Wu: ArtiHippo: Learning to Grow Artificial Hippocampi in Vision Transformers for Resilient Lifelong Learning
Varun Jampani: 3D of Everything
Vittorio Ferrari: Connecting Vision and Language with Video Localized Narratives
Wanli Ouyang: FengWu: Pushing the Skillful Global Medium-range Weather Forecast beyond 10 Days Lead
Michael Wray: Understanding Video-Text Retrieval in Long-Form Videos
Jian Wang: Computational imaging at Snap
Saurabh Gupta: Scaling up Robot Learning by Understanding Videos
Xinlei Chen: R-MAE: Regions Meet Masked Autoencoders
Matthew Blaschko: Surrogate Model Extension (SME): A Fast and Accurate Weight Update Attack on Federated Learning
Miaomiao Liu: Scalable 3D Object Centric Learning
Michael Maire: Unsupervised Segmentation with Diffusion Models
Neill Campbell: Structured Uncertainty in Generative Models
Fernando De la Torre: Zero-Shot Model Diagnosis
Zsolt Kira: Continual Open-World Learning in the Era of Foundation Models
Fuxin Li: AutoFocusFormer: Image Segmentation off Grid
Hiroshi Kawasaki: Challenges on Extreme 3D sensing
Piotr Koniusz: Contrastive learning with GNNs
Yezhou Yang: Decentralized User Attribution and Latent Fingerprinting in Image Generative Models
Boqing Gong: On the evaluation and calibration of vision foundation models
Alex Toshev: On Robustness in Multimodal Learning
Andrew Owens: EXIF as Language: Learning Cross-Modal Associations Between Images and Camera Metadata
Andre Araujo: Local-Features++: Deformable, 3D-aware, Scalable retrieval
Demo Presentations
Mike Zheng Shou: Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
Qifeng Chen: A Toolkit for Personalized Text-to-Video Generation and Editing
Preparation Instructions
Posters will be the same size as for the main conference: https://cvpr2023.thecvf.com/Conferences/2023/PosterPrintingInformation