SCHEDULE
Hybrid Event: Onsite & Online
A few days before the event starts, we will send over the link to access the platform by email.
A few days before the event starts, we will send over the link to access the platform by email.
9.30 AM - 10:10 AM CET
Marco Lorenzi (INRIA Sophia Antipolis)
Abstract
The application of FL to typical biomedical data analysis scenarios is currently limited by several factors. Model aggregation is often sub-optimal when datasets are heterogeneous, for example when classes and views are not uniformly represented across data centres (i.e. non-iid distributed). Furthermore, although FL avoids data sharing across centers, sharing model parameters may still open up the possibility of information leakage and privacy breaking in presence of malicious clients. Finally, there is currently a limited availability of production ready FL schemes that can be readily used in real-life multi-centric data analysis applications.
This talk aims at providing a comprehensive illustration of FL, with a particular focus in healthcare applications. I will present our current efforts in defining FL schemes robust to data heterogeneity among clients. I will introduce novel theoretical investigations of the statistical properties of client aggregation mechanisms, and will introduce novel Bayesian paradigms for the modelling of clients variability. Finally, I will highlight Fed-BioMed, an open-source and secure federated learning framework for applications in healthcare. I will present the basic paradigms for software components for clients and central node, and illustrate the workflow for deploying models in typical FL scenarios. "
10:10 AM - 10.50 AM CET
Martin Jaggi (EPFL)
Federated learning is enabling many promising new applications for machine learning while respecting users' privacy. In this talk, we will discuss the two aspects of 1) robustness to potentially malicious participants and faulty data, and 2) personalization of the trained ML models to each participant, in the realistic setting of heterogeneous data. We will employ tools from stochastic gradient descent algorithms and discuss both theoretical and practical implications in the federated setting, as well as advocating for removing central coordinators to take steps towards fully decentralized collaborative learning.
10:50 AM - 11.10 AM CET
11:10 AM - 11:50 AM CET
Aurélien Bellet (INRIA)
Federated learning (FL) is a machine learning paradigm where several participants collaboratively train a model while keeping their data decentralized. However, the model parameters or gradients exchanged during the FL training process may leak information about the data. In this talk, I will show how to use the notion of Differential Privacy (DP) to design FL algorithms that provably ensure privacy and confidentiality. In particular, I will present two approaches (one for server-orchestrated FL and one for fully decentralized FL) that nearly match the privacy-utility trade-off of the centralized setting without relying on a trusted curator or complex secure computation primitives.
Related papers:
11:50 AM - 12:30 PM CET
Laurent Massoulié (Microsoft Research, Inria Joint Center)
Abstract
The training of models over distributed data calls for distributed optimization schemes. This has motivated research on distributed convex optimization, leading to the identification of lower bounds on convergence speed, and distributed optimization algorithms with convergence speed potentially matching these lower bounds, for a variety of settings.
In this talk we focus on the important setting of asynchronous operation, for which we propose optimization algorithms with optimal speed. We next consider systems with heterogeneous communication and computation delays, for which we propose fast asynchronous algorithms adapted to these heterogeneous delays.
12:30 PM - 2:00 PM CET
Lunch will be provided.
2:00 PM - 2:40 PM CET
Pascal Paillier (Zama)
2:40 PM - 3:20 PM CET
Yann Favier (University Savoie Mont-Blanc)
Abstract
Health data, qualified as sensitive data, have a special status in the legal regime of data protection defined both in the European framework of the General Data Protection Regulation (GDPR) of April 27, 2016 and by specific laws end others regulations. While public data is being opened up, health data requires an adapted legal treatment that is all the more necessary as artificial intelligence plays a vital role in medicine and clinical research.
3:20 PM - 3:40 PM CET
3:40 PM - 4:00 PM CET
From Me to We: Breaking down silos for new collaboration style
Laetitia Kameni, AI R&D Manager, Accenture Labs
4:00 PM - 4:20 PM CET
Federated Learning Applications in Healthcare: Challenges and Opportunities
Mathieu Andreux, Federated Learning Group Lead - Owkin