My research addresses challenges related to data preparation in data-intensive and data-driven systems. It spans data integration, data quality, scientific workflows, data provenance, and knowledge graphs, with applications in life sciences, biomedicine, biodiversity, and environmental sciences.
Data Quality and Data Profiling
I conduct research on the discovery and maintenance of data quality rules in dynamic datasets.My work focuses on data quality assessment, profiling, and dependency discovery, with particular attention to data used and generated within data preparation pipelines and data-driven workflows.
Scientific Workflows and Computational Reproducibility
My work investigates the design, execution, maintenance, and long-term preservation of scientific workflows. I contribute methods and tools that support reproducibility, workflow evolution, and the validation of data-driven experiments.
Data Provenance and Lineage
I conduct research on the capture, management, querying, and anonymization of provenance information. This includes techniques for scalable provenance ingestion, provenance-based debugging, and privacy-preserving provenance.
Knowledge Graphs and Semantic Technologies
I work on the construction, maintenance, and exploitation of large-scale and dynamic knowledge graphs. My research addresses incremental maintenance, provenance-aware reasoning, and the traceability of inferred knowledge.
ShareFAIR (PEPR Santé Numérique, 2023–2028)
FAIR sharing of reliable protocols and workflows, with applications to neurovascular pathologies. Lead of the architecture work package.
R2P2 (CNRS-funded)
Reproducible and reusable protocols for multimodal data analysis in complex pathologies.
SecSky (CNRS PEPS)
Privacy-preserving integration of multi-source data in data-intensive environments.
COST Action KeyStone
Keyword search over heterogeneous and structured data sources.
I collaborate with researchers from Université Paris Dauphine, PSL, CNRS, INRIA, Institut Pasteur, University of Idaho, University of Manchester, University of Lyon, and several international institutions. My research is conducted within the LAMSADE laboratory, where I co-head the Data Science research pole.