Portable OCT devices are much cheaper and easier to operate, but suffers from low SNR, small field of view and limited resolution.
High-quality commercial optical coherence tomography (OCT) systems can cost over $50K and are bulky, weighing over 50 pounds.
We have developed an image processing pipeline to super-resolve images captured from a low-cost, portable Optical Coherence Tomography device, enabling downstream diagnosis such as Age-related Macular Degeneration detection.
We propose OCTDiff, a bridged diffusion model designed to enhance image resolution and quality from portable OCT devices. Our image-to-image diffusion framework addresses key challenges in the conditional generation process of denoising diffusion probabilistic models (DDPMs). We introduce Adaptive Noise Aggregation (ANA), a novel module to improve denoising dynamics within the reverse diffusion process. Additionally, we integrate Multi-Scale Cross-Attention (MSCA) into the U-Net backbone to capture local dependencies across spatial resolutions.
To address overfitting on small clinical datasets and to preserve fine structural details essential for retinal diagnostics, we design a customized loss function guided by clinical quality scores. OCTDiff outperforms convolutional baselines and standard DDPMs, achieving state-of-the-art performance on clinical portable OCT datasets. Our model and its downstream applications have the potential to generalize to other medical imaging modalities and revolutionize the current workflow of ophthalmic diagnostics.
Read more here (presented at MICCAI FAIR 2022, Best Paper Award earned):