Data-driven decision making powered by Artificial Intelligence (AI) algorithms is changing the global economy and has a profound positive impact on our daily life. With the exception of very large companies that have both the data and the skills to develop powerful AI-driven services, the large majority of provably possible ML services, from e-health, to transportation and predictive maintenance, to name just a few, still remain at the idea (or prototype) level, for the simple reason that data, the skills to manipulate them, and the business models to bring them to market, seldom co-exist under the same roof. The value of data comes from its contextualisation and combination with other data. Indeed, this can give way to many new services and products. Furthermore, data must be combined with AI and business skills that can unleash its full power for society and economy. This landscape has given rise to the highly dynamic sector of Data Economy, involving Data Providers/Controllers, Data Intermediaries, oftentimes in the form of Data Marketplaces or Personal Information Management Systems for end-users to control and even monetise their personal data. Despite its huge potential and observed initial growth, the Data Economy is still at its nascent phase and faces several challenges and a broad range of technical issues across multiple disciplines of Computer Science including databases, machine learning, distributed systems, security, privacy, and human-computer interaction. The Data Economy workshop aims at bringing together all the data management and CS skills required for helping the Data Economy liftoff by addressing a range of technical challenges
Santiago Andrés Azcoitia (Universidad Politécnica de Madrid, Spain)
George Konstantinidis (University of Southampton, United Kingdom)
To participate in the half-day workshop, please register through the online portal.
Note that the event is co-located with VLDB 2025 and will be hosted in London
Paul Groth is Professor of Algorithmic Data Science at the University of Amsterdam where he leads the Intelligent Data Engineering Lab (INDElab). He is scientific director of the UvA’s Data Science Center, and co-scientific director of two Innovation Center for Artificial Intelligence (ICAI) labs.
Paul was co-chair of the W3C Provenance WG and contributed to the FAIR data principles. He is co-author of “Provenance: an Introduction to PROV” and “The Semantic Web Primer: 3rd Edition” as well as numerous academic articles.
More detailed bio here.
In this talk, I describe our experience in creating a new data provider - longform.ai - premised on the capabilities of AI. longform.ai is a startup focused on creating curated datasets from conversations such as podcasts and webinars. I talk about how AI capabilities change the economics of provisioning and curating data. The importance of data provenance and quality. The surprising importance of user interfaces in making data available and the way AI changes interfaces to data providers. Lastly, I reflect on implications for research.
7.30 - 8.30 – Registration
8.30 – 8.45 – Welcome and Introduction
8.45 – 9.40 – Invited Talk by Paul Groth: "Building a new Data Provider based on AI"
9.40 – 10 – Paper 1. Soulmaz Gheisari, Jaime Osvaldo Salas, Semih Yumusak, and George Konstantinidis. MINiDM: Multi-Issue Negotiation in Decentralised DataMarketplaces.
10 -10.30 – Coffee Break
10.30 – 10.50 – Paper 2. S. Andrés Azcoitia and Alicia Cabrero Jiménez. An Interpretable Market-based Data Price Prediction Tool
10.50 – 11.10 – Paper 3. Hajar Baghcheband, Carlos Soares, and Luis Paulo Reis. UxV-DPN: Utility-vs-Value Data Pricing and Negotiation Mechanism in Machine Learning Data Marketplace
11.10 – 11.30 – Paper 4. Yizhou Ma, Xikun Jiang, Wenbo Wu, Zhuoqin Yang, and Luis-Daniel Ibáñez. Mixture-of-Experts based Model Market
11.30 – 11.50 – Paper 5. Shanshan Jiang, Sondre Sørbø, Phil Tinn, Shang Ferheng Karim, and Dumitru Roman. LLMDap: LLM-based Data Profiling and Sharing.
11.50 – 12 – Closure
12 - 13.30 – Lunch break & networking
Feb 14: The website is online!
Apr 30: Deadline extended -> Jun 9
May 1: Paul Groth confirmed as keynote speaker
June 20: The list of accepted papers is out
July 18: Camera-ready papers in
Aug 1: Program is out
Aug 29: Program updated to include registration and lunch break / networking
Submission Deadline: Jun 9, 2025, 23.59 AoE
Notification: June 20, 2025
Camera Ready: July 18, 2025
Workshop Date: September 5, 2025
The Third Data Economy Workshop will be co-located with the 2025 International Conference on eVery Large Data Bases (VLDB 2025). The workshop and the conference will be held at in the Queen Elizabeth II Center in the heart of Westminster, London.
The first edition of the workshop took place on December 8th 2022 in Rome, co-located with the ACM CoNEXT conference.
The second edition of the workshop took place on June 18th 2023 in Seattle, co-located with ACM SIGMOD/PODS conference.