Accepted Work
Oral Presentations
The works selected for posters will be a part of CVPR workshop proceedings. Authors of accepted posters will present their posters in person.
Sign Language Translation for Instructional Videos. Laia Tarrés, Gerard I. Gállego, Amanda Duarte, Jordi Torres, Xavier Giró-i-Nieto.
Underwater Moving Object Detection using an End-to-End Encoder-Decoder Architecture and GraphSage with Aggregator and Refactoring. Meghna Kapoor, Suvam Patra, Badri Narayan Subudhi, Vinit Jakhetiya, Ankur Bansal.
Dense Multitask Learning to Reconfigure Comics. Deblina Bhattacharjee, Sabine Süsstrunk,Mathieu Salzmann.
Perception Over Time: Temporal Dynamics for Robust Image Understanding. Maryam Daniali, Edward Kim.
Nonverbal Communication Cue Recognition: A Pathway to More Accessible Communication. Zoya Shafique, Haiyan Wang, YingLi Tian.
A Light-Weight Human Eye Fixation Solution for Smartphone Applications. Sudha Velusamy, Rakesh Radarapu, Anandavardhan Hegde, Narayan Kothari.
Poster Presentations
The works selected for posters will not be a part of CVPR workshop proceedings. Authors of accepted posters will present their posters in person.
A Dataset and System for Automated Rose Growth Monitoring. Risa Shinoda, Ko Motoki, Kensho Hara, Hirokatsu Kataoka, Ryohei Nakano, Tetsuya Nakazaki, Ryozo Noguchi.
A Multi-Institutional Open-Source Benchmark Dataset for Breast Cancer Clinical Decision Support using Synthetic Correlated Diffusion Imaging Data. Chi-en A Tai, Hayden Gunraj, Alexander Wong.
Transformers for Mobile Gait Biometrics. Paula Delgado-Santos, Ruben Tolosana, Richard M Guest, Ruben Vera-Rodriguez, Julian Fierrez.
Improving Data-Efficient Fossil Segmentation via Model Editing. Indu Panigrahi, Ryan A Manzuk, Adam C Maloof, Ruth C Fong.
Cancer-Net BCa-S: Breast Cancer Grade Prediction using Volumetric Deep Radiomic Features from Synthetic Correlated Diffusion Imaging. Chi-en A Tai, Hayden Gunraj, Alexander Wong.
Design a Delicious Lunchbox in Style. Yutong Zhou.
PWR-Align: Leveraging Part-Whole Relationships for Part-wise Rigid Point Cloud Registration in Mixed Reality Applications. Manorama Jha, Bhaskar Banerjee.
A Good Sampling Is All You Might Need. Mariona Carós, Ariadna Just, Santi Seguí, Jordi Vitria.
Labeling Interface for Perceptions of Street Quality. Emily Muller, Emily Gemmell, Ismam Choudhury, Ricky S Nathvani, Antje Barbara Metzler, James Bennett, Emily Denton, Seth Flaxman, Majid Ezzati.
Parametric Regularization Loss in Super-Resolution Reconstruction. Supatta Viriyavisuthisakul, Natsuda Kaothanthong, Parinya Sanguansat, Nguyen Le Minh, Choochart Haruechaiyasak, Toshihiko Yamasaki.
Rethinking Matching-based Few-shot Action Recognition. Juliette LD Bertrand, Yannis Kalantidis, Giorgos Tolias.
An Accurate AI-based Automatic Meter Reading Framework for Real-Time Gas Consumption Monitoring and Calculation. Nastaran Enshaei, Patrick Paul, Stéphane Tremblay, Farnoosh Naderkhani, Ashkan Ebadi.
Leveraging Improved Triplet Loss for Robust Action Segmentation. Elena Belén Bueno Benito, Mariella Dimiccoli.
Hierarchical Explanations for Video Action Recognition. Sadaf Gulshad, Teng Long, Nanne van Noord.
Are Facial Region Localization Models Biased? Surbhi Mittal, Kartik Thakral, Richa Singh, Mayank Vatsa.
Im2Hands: Learning Attentive Implicit Representation of Interacting Two-Hand Shapes. Jihyun Lee, Minhyuk Sung, Honggyu Choi, Tae-Kyun (T-K) Kim.
Overcoming Bias in Pretrained Models by Manipulating the Finetuning Dataset. Angelina Wang, Olga Russakovsky.
NutritionVerse-3D: A 3D Food Model Dataset for Nutritional Intake Estimation. Chi-en A Tai, Matthew E Keller, Mattie Kerrigan, Yuhao Chen, Saeejith Nair, Pengcheng Xi, Alexander Wong.
Towards Robust Image-in-Audio Deep Steganography. Jaume Ros Alonso, Margarita Geleta, Jordi Pons, Xavier Giro-i-Nieto.
Learning Deformable Templates for Brain MRI. Marianne Rakic, John Guttag, Adrian V Dalca.
Using Foundational Models to Improve Classification of Clinical Images in Dermatology. Emily Mu, Kathleen M Lewis, John Guttag.
ShapeShift: Superquadric-based Object Pose Estimation for Robotic Grasping. E. Zhixuan Zeng, Yuhao Chen, Alexander Wong.
RangeViT: Towards Vision Transformers for 3D Semantic Segmentation in Autonomous Driving. Angelika Ando, Spyros Gidaris, Andrei Bursuc, Gilles Puy, Alexandre Boulch, Renaud Marlet.
LightNet: Generative Model for Enhancement of Low-Light Images. Chaitra D Desai, Nikhil Akalwadi, Amogh Mukund Joshi, Sampada M Malagi, Chinmayee P Mandi, Ramesh Ashok Tabib, Ujwala Patil, Uma Mudenagudi.
Positive-Augmented Constrastive Learning for Image and Video Captioning Evaluation. Sara Sarto, Manuele Barraco, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara.
Towards Implicit Representation of a Pose: a New Improved Pipeline of Augmented Autoencoder. Elena Govi, Davide Sapienza, Carmelo Scribano, Giorgia Franchini, Marko Bertogna.
FLEX: Full-Body Grasping Without Full-Body Grasps. Purva Tendulkar, Dídac Surís, Carl Vondrick.
Schrödinger's Camera: First Steps Towards a Quantum-Based Privacy Preserving Camera. Hannah M Kirkland, Sanjeev Koppal.
PRISM: Probabilistic Interactive Segmentation for Medical Images. Hallee E Wong, John Guttag, Adrian V Dalca.
Localized Contrastive and Attention Based Multiple Instance Learning for Automatic Staging of Histopathology Images. Narmada Naik, Angela Crabtree, Carlo Bifulco, Brian Piening, Ganapati Srinivasa, Kevin L Matlock.
GoalieNet: A Multi-Stage Network for Joint Goalie, Equipment, and Net Pose Estimation in Ice Hockey. Fatemeh Shahi, David A Clausi, Alexander Wong.
Compositional Learning for Attribute and Objects. Nirat Saini.
A Compact Deep Learning Image Classification Model with Feature Selection for Generating Effective and Explainable Heatmaps. Luna M Zhang.
AI Assisted Silicosis Detection Among Stone Workers. Yasmeena Akhter, Rishabh Ranjan, Richa Singh, Mayank Vatsa.