List of Accepted Papers
Oral Presentations
PARASOL: Parametric Style Control for Diffusion Image Synthesis
Extending global-local view alignment for self-supervised learning with remote sensing imagery
RetinaLiteNet: A Lightweight Transformer based CNN for Retinal Feature Segmentation
ABC-CapsNet: Attention based Cascaded Capsule Network for Audio Deepfake Detection
GestFormer: Multiscale Wavelet Pooling Transformer Network for Dynamic Hand Gesture Recognition
Unsupervised Domain Adaptation for Weed Segmentation Using Greedy Pseudo-labelling
Extended Abstracts
Translating Imaging to Genomics: Leveraging Transformers for Predictive Modeling
3D Change Detection by 2D Segmentation Masks
COVIDx CXR-4: An Expanded Multi-Institutional Open-Source Benchmark Dataset for Chest X-ray Image-Based Computer-Aided COVID-19 Diagnostics
Optimizing Synthetic Correlated Diffusion Imaging for Breast Cancer Tumour Delineation
Cancer-Net PCa-Gen: Synthesis of Realistic Prostate Diffusion-Weighted Imaging Data via Anatomic-Conditional Controlled Latent Diffusion
Motion Diversification Networks
Optimizing Split Points for Error-Resilient SplitFed Learning
Anomaly Score: Evaluating Generative Models and Individual Generated Images based on Complexity and Vulnerability
Tell me how far: Leveraging Depth in Pre-training for Semantic Segmentation
Test-Time Zero-Shot Temporal Action Localization
ST-Gait++: Leveraging spatio-temporal convolutions for gait-based emotion recognition on videos
Gaze-LLE: Simplifying Gaze Target Estimation by Leveraging an Already-Learned Encoder
ProTeCt: Prompt Tuning for Taxonomic Open Set Classification
Enhancing Clinically Significant Prostate Cancer Prediction in T2-weighted Images through Transfer Learning from Breast Cancer
Improving Breast Cancer Grade Prediction with Multiparametric MRI Created Using Optimized Synthetic Correlated Diffusion Imaging
Using Multiparametric MRI with Optimized Synthetic Correlated Diffusion Imaging to Enhance Breast Cancer Pathologic Complete Response Prediction
SEA-GWNN: Simple and Effective Adaptive Graph Wavelet Neural Network
MVPSNet: Fast Generalizable Multi-view Photometric Stereo
Learning the 3D Fauna of the Web
InterHandGen: Two-Hand Interaction Generation via Cascaded Reverse Diffusion
LEAD: Latent Realignment for Human Motion Diffusion
A Unified Hierarchical Feature Learning and Cross-Modal Alignment for Audio-Visual Scene Recognition
Skin malignancy classification using patients’ skin images and meta-data: Multimodal fusion for improving fairness
Color-cued Efficient Densification Method for 3D Gaussian Splatting
Harnessing Self-Supervised Learning and Vision Transformers for Attention Based Multi-modal Survival Analysis
ScribblePrompt: Fast and Flexible Interactive Segmentation for Any Biomedical Image
Learning Multi-Frame Image Restoration from Synthetic Data
Multimodal Learning for Detecting Stress under Missing Modalities
ViD: Vision in Dark
Audio-visual integration in neural network and human brain
IrrNet: Spatio-Temporal Segmentation guided Classification for Irrigation Mapping
Mocap Everyone Everywhere: Lightweight Motion Capture with Smartwatches and a Head-Mounted Camera
HouseCat6D - A Large-Scale Multi-Modal Category Level 6D Object Perception Dataset with Household Objects in Realistic Scenarios
Analysis of Learned Features and Framework for Potato Disease Detection
What Appears Appealing May Not be Significant! - A Clinical Perspective of Diffusion Models
A Cross-Dataset Study for Text-based 3D Human Motion Retrieval
Towards a Perceptual Evaluation Framework for Lighting Estimation
Tyche: Stochastic In-Context Learning for Medical Image Segmentation
A Multi-Spectral Camera Network for Real-Time Bleeding Detection
HiKER-SGG: Hierarchical Knowledge Enhanced Robust Scene Graph Generation
Temporal Coarse to Fine to Finer Difference Spotting for Action Recognition
Action-conditioned video data improves predictability
Hybrid Multiplicative and Fourier Disparity Layers based Light Field Coding for Autostereoscopic Displays