Federated Learning Research Track

Overview

This page is for students who want to study from deep learning fundamentals to federated learning (FL) and eventually work on research topics such as:

The page is designed for self-study. Students are recommended to read the papers, run small-scale experiments, and contact me after they have built enough depth.


How to use this page


What I am looking for

I am looking for students who can gradually become capable of writing papers on FL-related topics.

Good candidates usually:

Part I. What I recommend students do first

Option 1. Easiest and most practical path

Option 2. Best path for publishable ideas

Option 3. Best path for students interested in current trends

Part II. Suggested first mini-projects

Students should not begin with a huge project. Start with a small project that can realistically be finished.

Project idea A — domain shift

Compare:

on a small multi-domain benchmark. Then ask whether a simple client-specific module improves unseen-domain performance.

Project idea B — personalization

Compare:

on small image datasets under non-IID partition. Then test whether a simple prompt/local-head design improves personalization.

Project idea C — continual / drift

Compare:

under a small class-incremental setting. Then study forgetting vs communication vs privacy constraints.

Part III. What to avoid at the beginning

Do not begin with:

The goal is to build depth, not to fail because the setup is too large.

Part IV. When to contact me

Contact me after you satisfy some of the following.

Part V. Core FL

These papers are the minimum background. Read them first.

1. FedAvg (AISTATS 2017)

2. FedProx (MLSys 2020)

3. SCAFFOLD (ICML 2020)

Part VI. Track A — FL under Domain Shift / Domain Generalization

This track is recommended before personalization. Students often understand FL much more naturally once they first build intuition about distribution shift in the centralized setting.

Typical question:

If different clients correspond to different domains, what kind of model should we learn, and what should generalize across domains?

Why start with this track

This track helps students build intuition for:

It is also close to realistic deployment settings:

Recommended order

A0. First build intuition in the centralized setting

Before jumping into FL-domain papers, start with one tutorial/survey and one simple domain-generalization paper.

A1. FedDG-GA (CVPR 2023)

A2. CCST (WACV 2023)

A3. FDG Survey (Proceedings of the IEEE 2025)

What students should reproduce first in this track

Recommended progression:

Good starter datasets for this track

Avoid very large domain benchmarks at the beginning.

Part VII. Track B — Personalized Federated Learning

Once students understand that different clients may correspond to different domains or data distributions, personalization becomes much easier to motivate.

Typical question:

Why is a single global model insufficient when each client has a different data distribution?

Why this track is good

This track is suitable for students who want:

Recommended order

B1. SGPT (CVPR 2024)

B2. FedPAC (ICLR 2023) — strong recommendation

B3. pFedMoAP (ICLR 2025) — advanced trend

What students should reproduce first in this track

Choose one:

Then use one advanced paper as follow-up reading:

Good starter datasets for this track

Avoid very large-scale setups in the beginning.

Part VIII. Track C — FL + Incremental / Continual Learning / Concept Drift

This track is highly promising for publishable work. It combines temporal shift with client heterogeneity.

Typical questions:

Why this track is attractive

Recommended order

C1. Federated Class-Incremental Learning (CVPR 2022)

C2. TARGET (ICCV 2023)

C3. LANDER (CVPR 2024)

C4. LoRM (ICLR 2025)

C5. FedCCFA (NeurIPS 2024) — optional extension

What students should reproduce first in this track

Choose one:

Then move to:

Use FedCCFA as follow-up reading for extension ideas.

Good starter datasets for this track

Avoid large-scale continual benchmarks at the beginning.

Part IX. Recommended code / benchmark libraries

These libraries are useful because students should not waste too much time building everything from scratch.

1. PFLlib (strongly recommended for personalization)

2. DomainBed (recommended for domain-shift intuition)

3. FCIL official code

4. TARGET official code

5. Awesome Personalized FL list

6. Awesome-FL