عafifi

اللهم اغفر لأبي وارحمه، وتجاوز عن سيئاته، واجعل قبره روضة من رياض الجنة -- O Allah, forgive my father and have mercy on him.

عafifi

Mahmoud Afifi

Ex-Research Scientist, Ex-Camera Engineer

PhD in Computer Science

Email: m.3[last name][at]gmail[dot]com

━━━━━━━━━━━━━━━━━━━━━━━━━━━━

Experience

Staff Research Scientist / Research Intern: Led the development of a modular AI-based ISP framework with support for multiple picture styles, including dataset creation, an AWB module, a modular neural photofinishing module, and an interactive photo-editing tool, and contributed to post-processing mapping for cross-camera preference-aware WB. Also built the Raw-JPEG Adapter pipeline, which enables user-controllable post-capture re-rendering with new styles by embedding full-resolution raw data with under 2 MB per image. As an intern, I worked on My Filters and developed AI-based color and exposure correction methods (Deep WB & MSEC).

Camera Software Engineer / Student Researcher / Research Intern: Worked with the Pixel team on color correction of Pixel phone cameras. Developed a camera-independent spatially varying AWB correction framework, a chromaticity mapping method for cross-camera transfer without traditional color-chart calibration, and a dual-exposure feature (DEF) for HDR-based illuminant estimation. As a side project, I collaborated with the Gemini team to explore zero-shot image classification using multimodal large language models (WDYS). As a student researcher, I developed a cross-camera AWB method (C5) with the Gcam team.

Machine Learning / Camera Algorithms Engineer: Worked with the Camera ISP Algorithm team on color correction for iPhone cameras. Contributed to spatially varying AWB correction of iPhone cameras and conducted proof-of-concept designs for spectral light sensors.

Computer Vision R&D Engineer / Consultant: Developed an ML algorithm for skin color correction and consulted on hairstyle editing and hair color matching used in LUXY HAIR virtual demo.

Consultant: Worked on image harmonization.

Research Engineer / Consultant: Developed the color correction module in NUDEMETER and consulted on skin tone analysis.

Postdoc Visitor: Worked on camera raw-to-raw mapping, image stylization, and spatially varying auto white balance correction.

Research

I am interested in low-level computer vision and computational photography, with a focus on color processing, editing, and photographic quality enhancement. Below are selected examples of my work. For a full list of publications, please click here.

RawGen: Learning Camera Raw Image Generation

Dongyoung Kim, Junyong Lee*, Abhijith Punnappurath*, Mahmoud Afifi*, Sangmin Han, Alex Levinshtein, and Michael S. Brown

arXiv, 2026

AI Center-Toronto, Samsung Electronics

A diffusion-based method for generating realistic camera raw images. RawGen produces a latent representation of linear CIE XYZ images, conditioned on an sRGB image or a text prompt. The latent is decoded to CIE XYZ and mapped to arbitrary target camera raw spaces.

* Equal contribution

arXiv | Project Page

Modular Neural Image Signal Processing

Mahmoud Afifi, Zhongling Wang, Ran Zhang, and Michael S. Brown

arXiv, 2025

AI Center-Toronto, Samsung Electronics

Unlike most neural ISPs that treat the entire imaging pipeline as a single black-box network, we decompose the process into standard, interpretable, modular learning-based components. This design requires no manual tuning and enables easier debugging, seamless scaling, cross-camera generalization, and fine-grained customization. It also gives users complete control over every stage of the pipeline and supports unlimited post-editable re-rendering by storing compact raw data within the final image. In addition, it enables learning diverse picture styles with minimal memory overhead and integrates naturally with external image-processing functions.

arXiv | Code & Executables | GPU-Accelerated Bilateral Solver | Video

Raw-JPEG Adapter: Efficient Raw Image Compression with JPEG

Mahmoud Afifi, Ran Zhang, and Michael S. Brown

arXiv, 2025

AI Center-Toronto, Samsung Electronics

A lightweight, learnable pre-processing pipeline that adapts camera raw images before standard JPEG compression using spatial-domain and optionally frequency-domain transforms. The operations are fully invertible, with parameters fitting within the JPEG file’s comment (COM) segment (<64 KB), enabling accurate reconstruction of the original raw image after JPEG decoding. This yields high-fidelity raw-to-JPEG storage with significant size reduction.

arXiv | Code

Time-Aware Auto White Balance in Mobile Photography

Mahmoud Afifi*, Luxi Zhao*, Abhijith Punnappurath, Mohamed A. Abdelsalam, Ran Zhang, and Michael S. Brown

ICCV 2025

24% acceptance rate

AI Center-Toronto, Samsung Electronics

Timestamp and geolocation, combined with capture metadata, provide strong cues for estimating scene illuminants in smartphone camera white balancing. Our method leverages this data, along with color information, using a lightweight learnable model (~5K parameters) that runs efficiently on a flagship mobile DSP (0.25 ms) and CPU (0.80 ms), achieving high accuracy. We also introduce a large dataset (~3.2K raw images) from the S24 Ultra, containing ground-truth illuminants (neutral and user-preference-based) and capture metadata.

* Equal contribution

Multispectral Demosaicing via Dual Cameras

SaiKiran Tedla*, Junyong Lee*, Beixuan Yang, Mahmoud Afifi, and Michael S. Brown

ICCV 2025 (Highlight)

24% acceptance rate

AI Center-Toronto, Samsung Electronics in collaboration with York University

A method for multispectral (MS) image demosaicing designed for dual-camera setups that leverages co-captured high-fidelity RGB images to guide the reconstruction of lower-fidelity MS images. We also provide a large dataset of paired RGB and MS mosaiced images with ground-truth demosaiced outputs.

* Equal contribution

CCMNet: Leveraging Calibrated Color Correction Matrices for Cross-Camera Color Constancy

Dongyoung Kim, Mahmoud Afifi, Dongyun Kim, Michael S. Brown, and Seon Joo Kim

ICCV 2025

24% acceptance rate

AI Center-Toronto, Samsung Electronics in collaboration with Yonsei University

By leveraging pre-calibrated color correction matrices (CCMs) existing in camera ISPs, we generate a compact camera fingerprint embedding to adapt our method to new cameras. Our method achieves state-of-the-art performance in color constancy across diverse cameras, while remaining lightweight.

Color Matching Using Hypernetwork-Based Kolmogorov-Arnold Networks

Artem Nikonorov*, Georgy Perevozchikov*, Andrei Korepanov, Nancy Mehta, Mahmoud Afifi, Egor Ershov, and Radu Timofte

ICCV 2025

24% acceptance rate

In collaboration with the University of Würzburg, Samara University, and IITP RAS

A color-matching framework that employs a hypernetwork to generate spatially adaptive weights for controlling KAN’s nonlinear splines. The method achieves state-of-the-art color matching between diverse source and target distributions in both supervised and unsupervised settings.

* Equal contribution

Paper | Supplementary Materials | arXiv | Code & Data

What Do You See? Enhancing Zero-Shot Image Classification with Multimodal Large Language Models

Abdelrahman Abdelhamed*, Mahmoud Afifi*, and Alec Go

arXiv, 2024

Google

With some prompt engineering, multimodal LLMs (e.g., Gemini) can perform zero-shot image classification. However, they may not consistently produce accurate target dataset labels. Our approach leverages multimodal LLMs & cross-modal embedding encoders to produce initial class prediction feature & image description feature alongside image feature, improving zero-shot image classification accuracy without the need for dataset-specific prompts. Our method outperforms prior methods across various datasets, achieving a 6.8% increase in accuracy on ImageNet.

* Equal contribution

arXiv | Code & Data | Patent Application | Google Research Post

Optimizing Illuminant Estimation in Dual-Exposure HDR Imaging

Mahmoud Afifi, Zhenhua Hu, and Liang Liang

ECCV 2024

27.9% acceptance rate

Google

Utilizing the chromatic distortion present between long and short exposure frames of HDR photography, we introduce a compact guiding feature for illuminant estimators. Processed by just ~300 learnable parameters, our method achieves results that match or surpass previous methods relying on thousands or even millions of parameters.

Paper | Supplementary Materials | arXiv | Poster | Patent Application

Rawformer: Unpaired Raw-to-Raw Translation for Learnable Camera ISPs

Georgy Perevozchikov, Nancy Mehta, Mahmoud Afifi, and Radu Timofte

ECCV 2024

27.9% acceptance rate

In collaboration with the University of Würzburg

Rawformer, an unsupervised Transformer-based encoder-decoder model for raw-to-raw mapping, enables the utilization of learnable camera ISP trained on a specific camera's raw images to render raw images taken by new cameras with different characteristics.

Paper | Supplementary Materials | arXiv | Code

Auto White-Balance Correction for Mixed-Illuminant Scenes

Mahmoud Afifi, Marcus A. Brubaker, and Michael S. Brown

WACV 2022

35% acceptance rate

York University

Mixed/single-illuminant scene white balancing does not necessarily require illuminant estimation. Instead, the problem could be bounded by a small set of predefined white-balance settings. Given that, we locally blend a set of small images rendered with different white-balance settings to generate the final corrected image.

Improving Single-Image Defocus Deblurring: How Dual-Pixel Images Help Through Multi-Task Learning

Abdullah Abuolaim, Mahmoud Afifi, and Michael S. Brown

WACV 2022

35% acceptance rate

York University

Jointly learning to predict the two DP views from a single blurry input image improves the network’s ability to learn to deblur the image. Generating high-quality DP views can be used for other DP-based applications, such as reflection removal.

PDF | Supplementary Materials | arXiv | Code & Dataset | Patent

Cross-Camera Convolutional Color Constancy

Mahmoud Afifi, Jonathan T. Barron, Chloe LeGendre, Yun-Ta Tsai, and Francois Bleibel

ICCV 2021 (Oral Presentation)

25.9% acceptance rate | 3% oral presentation acceptance rate

Google Research

A self-calibration method for cross-camera color constancy through the lens of transductive inference: additional (unlabeled) images are provided as input to the model at test time, which allows the model to calibrate itself to the spectral properties of the test-set camera during inference.

HistoGAN: Controlling Colors of GAN-Generated and Real Images via Color Histograms

Mahmoud Afifi, Marcus A. Brubaker, and Michael S. Brown

CVPR 2021

23.4% acceptance rate

York University

HistoGAN is the first work to control colors of GAN-generated images based on features derived directly from color histograms. Our method learns to transfer the color information encapsulated in histogram features to the colors of a GAN-generated images (HistoGAN) or real input images (ReHistoGAN). As color histograms provide an abstract representation of image color that is decoupled from spatial information, our HistoGAN and ReHistoGAN are less restrictive and suitable across arbitrary domains.

Learning Multi-Scale Photo Exposure Correction

Mahmoud Afifi, Konstantinos G. Derpanis, Björn Ommer, and Michael S. Brown

CVPR 2021

23.4% acceptance rate

SAIC, Samsung Research America in collaboration with Heidelberg University

A single coarse-to-fine deep learning model with adversarial training to correct both over- and under-exposed photographs.

CIE XYZ Net: Unprocessing Images for Low-Level Computer Vision Tasks

Mahmoud Afifi, Abdelrahman Abdelhamed, Abdullah Abuolaim, Abhijith Punnappurath, and Michael S. Brown

TPAMI 2021

Impact factor: 20.8 (2023)

York University

Learning accurate camera-rendering linearization gives a significant improvement for different computer vision tasks (e.g., denoising, deblurring, and image enhancement).

Paper | arXiv | Supplementary Materials | External Link | Code & Dataset

Semi-Supervised Raw-to-Raw Mapping

Mahmoud Afifi and Abdullah Abuolaim

BMVC 2021

York University

A semi-supervised method to map between two different camera-raw spaces. Training requires an unpaired set of images besides a very small set of paired images taken by these two camera models.

arXiv | Dataset | Presentation

Deep White-Balance Editing

Mahmoud Afifi and Michael S. Brown

CVPR 2020 (Oral Presentation)

22.1% acceptance rate | 5.7% oral presentation acceptance rate

SAIC, Samsung Research America

A multi-task deep learning model for post-capture white-balance correction and editing

Modeling Defocus-Disparity in Dual-Pixel Sensors

Abhijith Punnappurath, Abdullah Abuolaim*, Mahmoud Afifi*, and Michael S. Brown

ICCP 2020

York University

A symmetry property of dual-pixel kernels for unsupervised depth estimation

* Equal contribution

Paper | Code & Dataset | Talk

Interactive White Balancing for Camera-Rendered Images

Mahmoud Afifi and Michael S. Brown

CIC 2020 (Oral Presentation)

York University

A simple method to link the nonlinear white-balance correction functions, introduced in our CVPR'19 work, to the user's selected colors to allow interactive white-balance manipulation

arXiv | Code | Presentation

System and Method for Color Matching

Atima Lui, Nyalia Lui, Mahmoud Afifi, and Ariadne Bazigos

US Patent 2020

My Nudest Inc

A system for analyzing user input, combining user's image(s) and query responses to provide tailored color outputs. Through color correction and comparison to predetermined color identifiers, it delivers accurate results and product recommendations.

Patent | PDF

What Else Can Fool Deep Learning? Addressing Color Constancy Errors on Deep Neural Network Performance

Mahmoud Afifi and Michael S. Brown

ICCV 2019

25% acceptance rate

York University

Deep learning models can be fooled by white-balance errors and their accuracy can be improved by augmented images with different white-balance settings.

When Color Constancy Goes Wrong: Correcting Improperly White-Balanced Images

Mahmoud Afifi, Brian Price, Scott Cohen, and Michael S. Brown

CVPR 2019

25.2% acceptance rate

York University in collaboration with Adobe Research

The first work to directly address the problem of incorrectly white-balanced images; requires a small memory overhead and it is fast.

Image Recoloring Based on Object Color Distributions

Mahmoud Afifi, Brian Price, Scott Cohen, and Michael S. Brown.

Eurographics 2019 (Short Papers)

York University in collaboration with Adobe Research

A fully automated image recoloring without the need for target/reference images

Sensor-Independent Illumination Estimation for DNN Models

Mahmoud Afifi and Michael S. Brown

BMVC 2019 (Oral Presentation)

28% acceptance rate | 5% oral presentation acceptance rate

York University

Learning a new canonical space in an unsupervised manner allows us to train a single deep model on multiple camera sensors and perform accurate illuminant estimation for images captured by new unseen camera sensors in the inference phase.

Color Temperature Tuning: Allowing Accurate Post-Capture White-Balance Editing

Mahmoud Afifi, Abhijith Punnappurath, Abdelrahman Abdelhamed, Hakki Can Karaimer, Abdullah Abuolaim, and Michael S. Brown

CIC 2019 (Oral Presentation)

Best paper award

York University

With a small modification to existing camera ISPs, we can achieve accurate post-capture white balance editing by embedding a set of mapping coefficients in the JPEG metadata.

Paper | Project Page | Code | Presentation | Patent Application

AFIF4: Deep Gender Classification Based on AdaBoost-Based Fusion of Isolated Facial Features and Foggy Faces

Mahmoud Afifi and Abdelrahman Abdelhamed

JVCI 2019

Impact factor: 2.6 (2023)

Assiut University

Gender classification can be improved using different facial features; a user study validates our finding.

arxiv | Dataset

11K Hands: Gender Recognition and Biometric Identification Using a Large Dataset of Hand Images

Mahmoud Afifi

MTA 2019

Impact factor: 3.0 (2023)

Assiut University

Hand images can be used for gender recognition and biometric identification; a large dataset of hand images enables us to train our two-stream deep model.

arxiv | Project Page & Dataset | Code

Tensor Methods for Group Pattern Discovery of Pedestrian Trajectories

Abdullah Sawas*, Abdullah Abuolaim*, Mahmoud Afifi, and Manos Papagelis

MDM 2018

Best paper award

York University

Efficient discovery of evolving groups of pedestrians; a new group pattern is introduced in the journal version.

* Equal contribution

Project Page | Poster | Journal Version

MPB: A Modified Poisson Blending Technique

Mahmoud Afifi and Khaled F. Hussain

CVM 2015

Impact factor: 17.3 (2023)

Assiut University

Bleeding artifacts caused by Poisson image editing can be reduced by a simple two-stage blending approach.

Paper | Project Page | Code | Video

Conference version: Paper | Project Page | Code | Video

Fast Video Completion Using Patch-Based Synthesis and Image Registration

Mahmoud Afifi, Khaled F. Hussain, Hosny M. Ibrahim, and Nagwa M. Omar

ISPACS 2014

Assiut University

In many scenarios, image registration and blending can help to get a fast video completion.

External Link | Code

Honors

Dissertation Award

- CS-Can|Info-Can Canadian Computer Science Distinguished Dissertation Award, 2021 | Lassonde post
- CIPPRS John Barron Doctoral Dissertation Award, 2021 | Lassonde post
- Nominated by EECS for the Best Doctoral Dissertation Prize at York University, 2021

Best Paper Award

- The 27th IS&T Color and Imaging Conference, 2019 (CIC27)
- IEEE International Conference on Mobile Data Management, 2018 (MDM'18)

Outstanding Reviewer

- IEEE/CVF International Conference on Computer Vision, 2025 (ICCV'25)
- IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025 (CVPR'25)
- European Conference on Computer Vision, 2024 (ECCV'24)
- IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024 (CVPR'24)
- IEEE/CVF International Conference on Computer Vision, 2021 (ICCV'21)
- IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020 (CVPR'20)
- Honourable Mention at British Machine Vision Conference, 2019 (BMVC'19)

Challenges

- Runner-Up Award overall tracks in AIM 2020 Challenge on Scene Relighting and Illumination Estimation at ECCV'20
- Best Short CG Film in the Fourth Forum of Egyptian Faculties of Computer and Information Science, 2010

Others

- Named in the Stanford's list of the world’s top 2% scientists, 2022, 2023, 2024, 2025
- Ontario Graduate Scholarship (OGS), 2020-2021
- Hadi and Ozra Arjomandi Scholarship, 2018
- Canadian Institute for Advanced Research (CIFAR) Scholarship for DLRL, 2018
- York Graduate Scholarship (YGS), 2017-2021

Professional Services

Area Chair: WACV'24, WACV'25
Google Advocate for the African Computer Vision Summer School (ACVSS) in Kenya, 2024
Program Committee Member:
- NTIRE: New Trends in Image Restoration and Enhancement workshop and challenges in conjunction with CVPR 2026
- AIM: Advances in Image Manipulation workshop in conjunction with ICCV 2025
- NTIRE: New Trends in Image Restoration and Enhancement workshop and challenges in conjunction with CVPR 2025
- AIM: Advances in Image Manipulation workshop in conjunction with ECCV 2024
- NTIRE: New Trends in Image Restoration and Enhancement workshop and challenges in conjunction with CVPR 2024
- NTIRE: New Trends in Image Restoration and Enhancement workshop and challenges in conjunction with CVPR 2023
- NTIRE: New Trends in Image Restoration and Enhancement workshop and challenges in conjunction with CVPR 2022 | Apple post
- NTIRE: New Trends in Image Restoration and Enhancement workshop and challenges in conjunction with CVPR 2020
- Student Representative at Tenure & Promotion Adjudicating Committee, EECS, York University, 2020
Reviewer:
- Conferences: ECCV'26, CVPR'26, WACV'26, ICCV'25, CVPR'25, ECCV'24, CVPR'24, ACCV'24, LIM'24, SIGGRAPH Asia'23, ICCV'23, CVPR'23, SIGGRAPH Asia'22, CVPR'22, WACV'22, SIGGRAPH Asia'21, SIGGRAPH'21, ICCV'21, CVPR'21, BMVC'21, WACV'21, MASCOTS'21, CVPR'20, WACV'20, ACCV'20, MASCOTS'20, CIC28, CRV'20, WACV'19, CIC27, BMVC'19, BMVC'18
- Journals: T-PAMI, T-IP, IJCV, T-CI, T-MM, CVM, VTM, T-ITS, T-CSVT, T-HMS, T-MECH, T-CE, Neurocomputing, IEEE Access, TOMM, TVCJ, MTAP, COL, Displays, JVCI, MULT, MVAP, SIVP, JRTIP, IET IP, IET CV, IET SP, IET EL, SPIE JEI, CIN, JHE, OPENCS
Vice-Chair: ACM Assiut Student Chapter

Page updated

Google Sites

Report abuse