14.00 - 14.10: Opening
14.10 - 14.40: Gul Varol
Towards Open-Vocabulary Sign Language Recognition
14.40 - 15.00: Spotlights (Presentation Papers)
VQ-HPS: Human Pose and Shape Estimation in a Vector-Quantized Latent Space
Guénolé Fiche, Simon Leglaive, Xavier Alameda-Pineda, Antonio Agudo, Francesc Moreno-Noguer
GestSync: Determining who is speaking without a talking head
Sindhu B Hegde, Andrew Zisserman
Multi-HMR: Multi-Person Whole-Body Human Mesh Recovery in a Single Shot
Fabien Baradel, Matthieu Armando, Salma Galaaoui, Romain Brégier, Philippe Weinzaepfel, Grégory Rogez, Thomas Lucas
PoseEmbroider: Towards a 3D, Visual, Semantic-aware Human Pose Representation
Ginger Delmas, Philippe Weinzaepfel, Francesc Moreno-Noguer, Grégory Rogez
SPAMming Labels: Efficient Annotations for the Trackers of Tomorrow
Orcun Cetintas, Tim Meinhardt, Guillem Brasó, Laura Leal-Taixé
15.00 - 15.30: Cristian Sminchisescu
15.30 - 16.30: Coffee Break and Poster Session
Leveraging key-points Encoded Human Pose Images for Human Activity Recognition
Gaia Dobici, Luca Minutillo, Ermanno Cordelli, Paolo Soda, Francesco Chirico, Goffredo Foglia
Enhancing Thermal MOT: A Novel Box Association Method Leveraging Thermal Identity and Motion Similarity
Wassim El Ahmar, Dhanvin Kolhatkar, Farzan Nowruzi, Robert Laganiere
Multi-Camera Industrial Open-Set Person Re-Identification and Tracking
Federico Cunico, Marco Cristani
Enhanced Action Quality Assessment with Two-Stream Pose and Video Feature Integration
Yanting Zhang, Li Xia, Wenguang Zeng, Shuai Yu, Zijian Wang, Zhijun Fang
PAFUSE: Part-based Diffusion for 3D Whole-Body Pose Estimation
Nermin Samet, Cédric Rommel, David Picard, Eduardo Valle
Enhancing Gait Recognition: Data Augmentation via Physics-Based Biomechanical Simulation
Mritula Chandrasekaran, Jarek M Francik, Dimitrios Makris
Depth-based Privileged Information for Boosting 3D Human Pose Estimation on RGB
Alessandro Simoni, Francesco Marchetti, Guido Borghi, Federico Becattini, Davide Davoli, Lorenzo Garattoni, Gianpiero Francesca, Lorenzo Seidenari, Roberto Vezzani
VATE: a Large Scale Multimodal Spontaneous Dataset for Affective Evaluation
Francesco Agnelli, Giuliano Grossi, Alessandro D'Amelio, Raffaella Lanzarotti, Marco De Paoli
Guidelines for Query and Gallery Image Extraction in Person Re-Identification Systems
Rita Delussu, Lorenzo Putzu, Giorgio Fumera
Pose-independent 3D Anthropometry from Sparse Data
David Bojanić, Stefanie Wuhrer, Tomislav Petković, Tomislav Pribanic
Exploring 3D Face Reconstruction and Fusion Methods for Face Verification: A Case-Study in Video Surveillance
Simone Maurizio La Cava, Sara Concas, Ruben Tolosana, Roberto Casula, Giulia Orrù, Martin Drahansky, Julian Fierrez, Gian Luca Marcialis
FlexControl: Flexible and Efficient Full-Body Controllable Text-to-Motion Generation
Qingyuan Liu, Ke Lu, Zehai Niu, Kun Dong, Jian Xue, Jinbao Wang, Xiaoyu Qin
Coarse to Fine Human Mesh Recovery with Transformers
Vatsal Agarwal, Mara Levy, Max Ehrlich, Youbao Tang, Ning Zhang, Abhinav Shrivastava
Real-time Motion Reconstruction via Human Anatomy Diffusion from Sparse Tracking
Zehai Niu, Ke Lu, Kun Dong, Jian Xue, Xiaoyu Qin, Jinbao Wang
A vision-based framework for human behavior understanding in industrial assembly lines
Konstantinos Papoutsakis, Nikolaos Bakalos, Konstantinos Fragkoulis, Athena Zacharia, Georgia Kapetadimitri, Maria Pateraki
Boosting Pose Estimators via Cross-Representation Distillation
Kang Liu, Zhendong Yang, Jingyun Zhang, Jun Wang, Shaoming Wang, Chun Yuan, Rizen Guo
Upper-Body Pose-based Gaze Estimation for Privacy-Preserving 3D Gaze Target Detection
Andrea Toaiari, Vittorio Murino, Marco Cristani, Cigdem Beyan
16.30 - 17.20: Oral Presentations
EPOCH: Jointly Estimating the 3D Pose of Cameras and Humans
Nicola Garau, Giulia Martinelli, Niccolò Bisagno, Denis Tome, Carsten Stoll
HybridFormer: Bridging Local and Global Spatio-Temporal Dynamics for Efficient Skeleton-Based Action Recognition
Zeyun Zhong, Tianrui Li, Manuel Martin, Mickael Cormier, Chengzhi Wu, Frederik Diederichs, Jürgen Beyerer
MPL: Lifting 3D Human Pose from Multi-view 2D Poses
Seyed Abolfazl Ghasemzadeh, Alexandre Alahi, Christophe De Vleeschouwer
THP3D: Text-Driven Multi-Granularity 3D Human Parsing
Keito Suzuki, Bang Du, Kunyao Chen, Runfa Li, Truong Nguyen
ROMEO: Revisiting Optimization Methods for Reconstructing 3D Human-Object Interaction Models From Images
Alexey Gavryushin, Yifei Liu, Daoji Huang, Yen-Ling Kuo, Julien Valentin, Luc Van Gool, Otmar Hilliges, Xi Wang
17.20 - 17.30: Best Paper Award and Closing
Research Scientist and Engineering Manager
Google DeepMind
Professor
Lund University