List of Accepted Papers

Oral Presentations


PARASOL: Parametric Style Control for Diffusion Image Synthesis


Extending global-local view alignment for self-supervised learning with remote sensing imagery


RetinaLiteNet: A Lightweight Transformer based CNN for Retinal Feature Segmentation


ABC-CapsNet: Attention based Cascaded Capsule Network for Audio Deepfake Detection


GestFormer: Multiscale Wavelet Pooling Transformer Network for Dynamic Hand Gesture Recognition


Unsupervised Domain Adaptation for Weed Segmentation Using Greedy Pseudo-labelling


Extended Abstracts


Translating Imaging to Genomics: Leveraging Transformers for Predictive Modeling


3D Change Detection by 2D Segmentation Masks


COVIDx CXR-4: An Expanded Multi-Institutional Open-Source Benchmark Dataset for Chest X-ray Image-Based Computer-Aided COVID-19 Diagnostics


Optimizing Synthetic Correlated Diffusion Imaging for Breast Cancer Tumour Delineation


Cancer-Net PCa-Gen: Synthesis of Realistic Prostate Diffusion-Weighted Imaging Data via Anatomic-Conditional Controlled Latent Diffusion


Motion Diversification Networks


Optimizing Split Points for Error-Resilient SplitFed Learning


Anomaly Score: Evaluating Generative Models and Individual Generated Images based on Complexity and Vulnerability


Tell me how far: Leveraging Depth in Pre-training for Semantic Segmentation


Test-Time Zero-Shot Temporal Action Localization


ST-Gait++: Leveraging spatio-temporal convolutions for gait-based emotion recognition on videos


Gaze-LLE: Simplifying Gaze Target Estimation by Leveraging an Already-Learned Encoder


ProTeCt: Prompt Tuning for Taxonomic Open Set Classification


Enhancing Clinically Significant Prostate Cancer Prediction in T2-weighted Images through Transfer Learning from Breast Cancer


Improving Breast Cancer Grade Prediction with Multiparametric MRI Created Using Optimized Synthetic Correlated Diffusion Imaging


Using Multiparametric MRI with Optimized Synthetic Correlated Diffusion Imaging to Enhance Breast Cancer Pathologic Complete Response Prediction


SEA-GWNN: Simple and Effective Adaptive Graph Wavelet Neural Network


MVPSNet: Fast Generalizable Multi-view Photometric Stereo


Learning the 3D Fauna of the Web


InterHandGen: Two-Hand Interaction Generation via Cascaded Reverse Diffusion


LEAD: Latent Realignment for Human Motion Diffusion


A Unified Hierarchical Feature Learning and Cross-Modal Alignment for Audio-Visual Scene Recognition


Skin malignancy classification using patients’ skin images and meta-data: Multimodal fusion for improving fairness


Color-cued Efficient Densification Method for 3D Gaussian Splatting


Harnessing Self-Supervised Learning and Vision Transformers for Attention Based Multi-modal Survival Analysis


ScribblePrompt: Fast and Flexible Interactive Segmentation for Any Biomedical Image


Learning Multi-Frame Image Restoration from Synthetic Data


Multimodal Learning for Detecting Stress under Missing Modalities


ViD: Vision in Dark


Audio-visual integration in neural network and human brain


IrrNet: Spatio-Temporal Segmentation guided Classification for Irrigation Mapping


Mocap Everyone Everywhere: Lightweight Motion Capture with Smartwatches and a Head-Mounted Camera


HouseCat6D - A Large-Scale Multi-Modal Category Level 6D Object Perception Dataset with Household Objects in Realistic Scenarios


Analysis of Learned Features and Framework for Potato Disease Detection


What Appears Appealing May Not be Significant! - A Clinical Perspective of Diffusion Models


A Cross-Dataset Study for Text-based 3D Human Motion Retrieval


Towards a Perceptual Evaluation Framework for Lighting Estimation


Tyche: Stochastic In-Context Learning for Medical Image Segmentation


A Multi-Spectral Camera Network for Real-Time Bleeding Detection


HiKER-SGG: Hierarchical Knowledge Enhanced Robust Scene Graph Generation


Temporal Coarse to Fine to Finer Difference Spotting for Action Recognition


Action-conditioned video data improves predictability


Hybrid Multiplicative and Fourier Disparity Layers based Light Field Coding for Autostereoscopic Displays