I joined Sapienza in 2021 as tenure track assistant professor (RTD-B) in the Department of Statistical Sciences. My research activity is dedicated to different aspects of data management, including entity resolution, data quality, algorithmic methods for large-scale datasets and responsible data science. I received my Ph.D. in Computer science and Engineering at Sapienza University and I was visiting student at AT&T Labs and Rutgers University. Before joining Sapenza I was assistant professor (RTD-A) in the Department of Engineering at Roma Tre University.
Publications
Recent selected papers:
SIGMOD 2022, "Hierarchical Entity Resolution using an Oracle", with Sainyam Galhotra, Barna Saha and Divesh Srivastava
SIGMOD 2022, "Explaining Link Prediction Systems based on Knowledge Graph Embeddings", with Andrea Rossi, Paolo Merialdo and Tommaso Teofili
ICDE 2022, "Effective Explanations for Entity Resolution Models", with Tommaso Teofili, Nick Koudas, Paolo Merialdo and Divesh Srivastava
VLDB Journal 2021, "Efficient and Effective ER with Progressive Blocking", with Sainyam Galhotra, Barna Saha and Divesh Srivastava
SIGMOD 2018, "Robust Entity Resolution using Random Graphs", with Sainyam Galhotra, Barna Saha and Divesh Srivastava
KDD 2017, "Fast Enumeration of Large k-Plexes", with Alessio Conte, Caterina Mordente, Maurizio Patrignani and Riccardo Torlone
VLDB 2016, "Online Entity Resolution Using an Oracle", with Barna Saha and Divesh Srivastava
Full list of papers: Google Scholar, DBLP
Service
Recent conference/workshop organization:
2025: PC member of SIGMOD, PVLDB, ALENEX
2024: PC member of SIGMOD, SEA, IRCDL, IEEE BigData, GUIDE-AI@SIGMOD, ADBIS Workshop, BDA, SEBD, Proceedings co-Chair of EDBT, Program Chair of aiDM@SIGMOD
2023: PC member of EDBT, SIGMOD, SEA, IRCDL, SEBD, AIxIA, ESA, AI4CH@AIxIA, Program Chair of aiDM@SIGMOD
2022: PC member of SEBD, AI*CH@AIxIA and KG@ICDM and Program Chair of aiDM@SIGMOD
2021: PC member of EDBT, co-Chair of SIGMOD progamming contest and Program Chair of PIE@EDBT
2020: PC member of EDBT, aiDM@SIGMOD, SEBD and LSGDA@VLDB, Program Chair of PIE@EDBT, PC member and Poster Chair of ICDE, co-Chair of SIGMOD progamming contest and Program Chair of DI2KG@VLDB
2019: PC member of CIKM and ICDE, Program Chair of DI2KG@KDD and PIE@CAiSE, Scholarship Chair of ACM WomENcourage. Sponsor for the Django Girls workshop.
2018: PC member of SEBD
Editorial:
2020-today: Associate Editor for ACM Journal of Data and Information Quality
2021-2022: Guest Editor for ACM JDIQ, Special issue in "Data Quality and Ethics"
2020-2021: Guest Editor for Frontiers in Big Data in Data Mining and Management, Special issue in "Big Data Management in Industry 4.0"
Teaching
A.A. 2023-24: Data Cleaning and Integration in Official Statistics, Master Degree in Statistical Methods and Applications, Sapienza University.
A.A. 2021-22, 2022-23, 2023-24: Informatica, Laurea in Statistica, economia, finanza e assicurazioni e Statistica gestionale, Sapienza University. (in Italian)
A.A. 2022-23: Introduction to Big Data Integration, Ph.D. program in Computer and Data Science, Modena and Reggio Emilia University.
A.A. 2021-22, 2022-23: Data Management, Master Degree in Statistical Methods and Applications, Sapienza University.
A.A. 2019-20: Modern approaches to Entity Resolution, Ph.D. program in Computer Science, Roma Tre University.
A.A. 2020-21: Elementi di Informatica, Laurea in Ingegneria Meccanica, Roma Tre University. (in Italian)
A.A. 2017-18, 2018-19, 2019-20: Fondamenti di Informatica, Laurea in Ingegneria Elettronica, Roma Tre University. (in Italian)
Advising
Lorenzo Balzotti, post-doc in Statistical Sciences at Sapienza University
Flavia Tagliafierro, Ph.D. in Statistical Sciences at Sapienza University (expected 2026)
Jerin George Mathew, National Ph.D. in Artificial Intelligence at Sapienza University (expected 2025)
Tommaso Teofili, Ph.D. in Computer Science and Automation at Roma Tre University (2023)
Andrea Rossi, Ph.D. in Computer Science and Automation at Roma Tre University (2022)
Elena Nieddu, Ph.D. in Computer Science and Automation at Roma Tre University (2021)
Projects
Recent research projects:
2024-2027: INTEND "Intent-based data operation in the computing continuum". HORIZON Research and Innovation Actions 101135576. (Unit Leader)
2024-2026: Performing arts, economics, and cultural policies. New interpretative paradigms between aesthetics and social sciences. PRIN National Research Project 2022P749MT
2023: FAIR - Spoke 5
2023-2025: "Trustworthy Technologies for Augmenting Knowledge Graphs". Sapienza Research Project B83C22007180001. (Principal Investigator)
2021-2024: FLOWER "Frontiers in Linking records: knOWledge graphs, Explainability and tempoRal data". SEED PNR Project. (Principal Investigator)
Recent industry/government projects:
Master data management of administrations, with Dipartimento della funzione pubblica. The project aims at providing a unified view over different administration indices, such as IPA and ISTAT, and a collection of tools for semi-automatic data integration and data quality management.
Data extraction from fiscal documents, with LAMBO. The project aimed at prototyping advanced OCR tools for fiscal documents, with high-performance on mobile devices (possibly without Internet access). Read the blog articleon our results. (in Italian)
Data managment of fiscal documents, with Mediatica. The project aimed at prototyping indexing and image processing tools for fiscal documents, in order to recognize their main layout features efficiently and speed up their manual processing.
Awards
2021: DL4KG Best Paper Award "Knowledge Graph Embeddings or Bias Graph Embeddings? A Study of Bias in Link Prediction Models'', with Andrea Rossi and Paolo Merialdo
2019: IEEE ICWS Best Paper Award "On Computing Throttling Rate Limits in Web APIs through Statistical Inference", with Francesco Leotta and Massimo Mecella
2018: SIGMOD Reproducibility Award for the paper "Robust Entity Resolution using Random Graphs", with Sainyam Galhotra, Barna Saha and Divesh Srivastava
Other (in Italian)
Componente della Commissione per l'Orientamento in Entrata, Dipartimento di Scienze Statistiche. (Coordinamento iniziative di incontro con le scuole superiori)
Componente del Comitato organizzativo Internship Day https://www.dss.uniroma1.it/en/didattica/internships, Dipartimento di Scienze Statistiche. (Iniziative di incontro tra studenti della laurea magistrale e stakeholder)
Docente per il progetto G4GRETA https://g4greta.di.uniroma1.it/home (Terza Missione)
Contacts
name.lastname at uniroma1.it