home
Khalid Belhajjame is an associate professor (maître de conférences) with HDR (habilitation to direct research) at the University Paris-Dauphine, where he is a member of the LAMSADE research laboratory. Before moving to Paris, he was a research fellow for several years at the University of Manchester, and before that, a PhD student at the University of Grenoble. His research focuses on information and knowledge management. In particular, he has made significant contributions in the areas of data preparation, data privacy and protection, eScience, scientific workflow management, provenance tracking and exploitation, and knowledge graphs. He has published over 70 papers on the above topics. Most of his research proposals have been validated by real-world applications in the life sciences. He has participated in numerous European, French and UK funded projects, and has been an active member of the W3C Provenance working group, the NSF-funded DataONE Working Group on Science Workflows and Provenance. Since 2018, he is a member of the steering committee of the GDR MaDICS , a national research network for interdisciplinary data science research, and since January 2022, co-leader of the Database Working Group for Remote Sensing Data in the Earth Science Informatics (IEEE GRSS).
News
The slides of some of my talks are available on slideshare.
We have now a blog in which we report on practical aspects provenance. You can also find a blog that I wrote by analyzing W3C PROV implementations here.
Note for prospective students
I am interested in high quality PhD students. Look at my DBLP entry or google scholar profile for the research I am interested in. If you want to pursue a PhD in this area, contact me.
If your are interested in getting some experience with a research environment through an internship in the broad areas of information and knowledge management, then you can contact me.
Research interests
Data Preparation
Scientific Workflows
Data and Process Provenance
Privacy Preserving Data Integration
Knowledge Graphs
eScience
Research projects
ShareFAIR : Sharing reliable protocols to transform datasets into gold standards: Application to Neuro-Vascular Pathologies. PEPR Santé Numérique.
ReProVirtuFlow: I am a member of the MaDICS working group (Action) on "Reproducibility of scientific experiments".
CAIR: Contextual and Aggregrated Information Retrieval. Funded by ANR, 2015-2017.
Privacy4SmartCPs : Une approche de partage et d’intégration de données préservant la vie privée dans des environnements cyber-physiques. PEPS, 2015.
Workflow Driven Data Integration in Cloud Environments. Funded by Campus France, Utique, 2015-2017.
Wf4ever: This project investigates scientific workflow preservation. I am responsible for the design of the research object model that is used for bundling workflow definitions with auxiliary information necessary for workflow sustainability. I am also responsible for coordinating the provenance task force in this project.
myGrid platform: I am in charge of knowledge transfer. In particular, I work closely with software developers to make sure that the research results produced in the Wf4Ever project are implemented in myGrid tools, in particular Taverna and myExperiment, and that latest developments in provenance and annotation standards are adopted.
On-Demand Data Integration: Dataspaces by Refinement (July 2008 - June 2011). I was responsible for investigating the pay-as-you-go feature of dataspaces. I proposed methods and algorithms showing that schema mappings can be incrementally annotated, selected and refined based on user feedback.
FuGE, a Standard Format for Representing Functional Genomics Experiment Data (November 2007 - June 2008). I designed and implemented a web portal for querying and displaying FuGE proteomics data.
iSPIDER, In Silico Proteome Integrated Data Environment Resource (November 2004 - October 2007). I was responsible for designing and implementing a portal that provides integrated access to four proteomics data sources. I have also conducted research on semantic web services. Specifically, I showed how semantic annotations can be used to identify mismatches between connected operations in a workflow, and how such annotations can be inferred.
Publications
Contact details
Dr. Khalid Belhajjame
Université Paris Dauphine
Place du Maréchal de Lattre de Tassigny
75775 Paris Cedex 16, France
email: kbelhajj (at) googlemail (dot) com
Tel: +33 1 44054309
My office is P 213 on the second floor
My twitter account is kbelhajj