GOOD-DATA @ AAAI 2025

1st Workshop on Preparing Good Data for Generative AI: Challenges and Approaches

3 March 2025

Location: Philadelphia, PA, USA
Room TBD

motivation

Foundation models highly depend on the data they are trained on. While self-supervised learning is one of their promises, it is clear that the carefully processed datasets lead to better models. While datasets and models are frequently released by the community, the data preparation recipes are relatively nascent and not fully open. In this workshop, we invite contributions and collaborations in data preparation recipes for creating and using foundation models and generative AI applications, including (but not limited to) pre-training, alignment, fine-tuning, and in-context learning. Data preparation spans data acquisition, cleaning, processing, mixtures, quality assessments, value of data, ablation studies, safety, and governance. 

Participation

We encourage submissions that are under one of the topics of interest, but also we welcome other interesting and relevant research for preparing good data.

Papers will be peer-reviewed under a double-blind policy, and the submission deadline is November 15th, 2024. Accepted papers will be presented at the poster session, some as oral presentations, and one paper will be awarded as the best paper.

Please see the call for papers page for more details about paper submission.


Registration: To register for the workshop, please follow the workshop registration guidelines of AAAI 2025.

Contact

For questions, you can contact us at: good-data-aaai-25@googlegroups.com


Sponsors