November, 2025
Attended the AIES Conference in Madrid and presented our paper: Disciplinary Practices in the Generation of Text Synthetic Data: A Critical Discourse Analysis. In this work, we reported on a Critical Discourse Analysis through which we identified three recurring disciplinary practices of practitioners when generating text synthetic data that led to establishing and reinforcing Cultural Scarcity, and propose a set of recommendations to counteract it.
Our paper: Online Safety for All: Sociocultural Insights from a Systematic Review of Youth Online Safety in the Global South was presented at CSCW by Ozioma Oguine. In this article, we report on a systematic literature review of 66 youth online safety research studies (2014–2024) with emphasis on the Global South, identifying methodological gaps, underrepresented youth populations, and culturally specific risk factors.
October, 2025
I am happy to share that insights from the data work research I have been leading have been incorporated into the IBM AI Risk Atlas. We translated empirical research into risks that are specifically associated with the use of synthetic data, and risks that are increased or more likely to occur due to the use of synthetic data.
September, 2025
Excited to share the recent publication of our latest POV on synthetic data, exploring its benefits, risk mitigations, and offering actionable guidance on best practices for its generation and responsible use. This work is the result of a dedicated workstream of the IBM Responsible Technology Board, focused solely on synthetic data and data work, which I am proud to co-lead with Alina Glaubitz, alongside an exceptional group of contributors whose insights made this possible.
May, 2025
From May 26 to 30, I had the opportunity to participate in the third edition of the Summer School in Responsible AI and Human Rights, organized by Mila and Université de Montréal.
April, 2025
Attended CHI and presented our paper "Emerging Data Practices: Data Work in the Era of Large Language Models", in which we report on an interview study with AI practitioners to examine how data practices evolve across the LLM development lifecycle, identifying key ethical challenges and actionable opportunities for human-centered support in generative AI development.
Excited to share the publication of the Guidance for Inclusive AI: Practicing Participatory Engagement. This was the result of the wonderful effort of the Global Task Force for Inclusive AI, which I have been the luck to be part of, convened by Partnership on PAI.
January, 2025
I am excited to announce that our paper, “What Knowledge Do We Produce from Social Media Data and How?” won a Best Paper Award at GROUP this year.
Our paper, “Emerging Data Practices: Data Work in the Era of Large Language Models” was accepted at CHI. I will be attending the conference in person in Yokohama from 26 April to 1 May.