Incentives for Collaborative Learning and Data Sharing
August 13-15
TTIC Summer Workshop (Chicago, IL)
Incentives for Collaborative Learning and Data Sharing
August 13-15
TTIC Summer Workshop (Chicago, IL)
Machine Learning (ML) has achieved remarkable milestones in recent years, but its future hinges on a robust, ethically grounded, and well-incentivized data infrastructure. The next era of AI innovation requires specialized data that is scarce, proprietary, or sensitive (e.g., medical records, autonomous vehicle data, pharmaceutical test results). Collecting such data is cost and labor-intensive and subject to strict privacy and legal constraints. Simultaneously, the era of “free” or lightly regulated data (e.g., scraped from the public internet) is drawing to a close as copyright and user consent concerns rise — evidenced by legal challenges to large generative models and platform restrictions on web scraping. Taken together, these developments underscore the need for a new AI paradigm — one in which data rights are respected, data owners receive fair compensation, and valuable datasets are shared and reused to unlock novel applications. This raises the central question we want to address in our workshop
"How can we design incentive mechanisms to promote data sharing while safeguarding privacy, fairness, and intellectual property?"
To make progress towards the above question, our workshop would focus on themes such as:
Data valuation and attribution: Methods to determine data's value, detect bad/fake data, credit contributors, and establish fair compensation.
Mechanism design for data and model markets: theoretic approaches to foster honest participation, data quality, and fair profit-sharing.
Data privacy and autonomy: designing federated/distributed learning mechanisms and effectively utilizing personalization and differential privacy tools that respect data ownership and autonomy.
Collaboration among competitors: managing conflicts of interest between participants when they are competing for finite resources (e.g., firms compete for market share, autonomous vehicles compete for road usage)
Fairness and efficiency: study the potential definitions of fairness and the tension between designing fair systems and socially optimal efficiency.
Legal perspectives: Address data/model ownership, data market regulations, and alternatives such as data co-ops.
Practical algorithms: Develop implementations that can sustain multiple rounds of collaboration.
Empirical studies: Simulate multi-agent environments to demonstrate the need for or value of incentives.
CONFIRMED SPEAKERS LIST
UChicago
Harvard
Tel Aviv University
UC Berkeley
CMU
Tentative Schedule
9-9:30 Breakfast/coffee
9:30-9:45 Opening remarks
9:45-10:45 Talk
10:45-11:15 Coffee break
11:15-12:15 Talk
12:15-1:15 Lunch
1:15-2:30 Spotlight talks
2:30-3 Break
3-4 Talk #3
4-5 Poster session #1
5-6 Poster session #2
9-9:30 Breakfast/coffee
9:30-10:30 Talk
10:30-11 Coffee break
11-12 Talk
12-1 Lunch
1-2 Spotlight talks
2-2:30 Break
3-4 Talk
4-5:30 Open Problems Discussion
6-onwards Social!
9-9:30 Breakfast/coffee
9:30-10:30 Talk
10:30-11 Coffee break
11-12:30 Talks
12:30-2 Boxed lunch + speed-"dating"
2-3 Panel
2-2:30 Break
2:30-3:30 Open Problems Discussion or
Additional Spotlights
3:30 Closing remarks
VENUE
Safety: Look for places above 63rd St and East of Cottage Grove
Some very close by options:
Hyatt Place: (We are still negotiating with them to see if they can offer a better rate.)
SOPHY - Hyde Park: Promo Code: UNCH. (We are still negotiating with them to see if they can offer a better rate.)
Some cheaper options:
Quadrangle Club (Queen $179.00/night, King $189.00/night, King Suite $204.00/night): Make reservations by contacting them at quadclub@uchicago.edu or phone: 773.702.2550.
HI USA: There is a direct bus (line 6) and train (ME) connecting the hotel and the venue.
Options a bit further (14 min Uber to TTIC):
We offer financial aid for travel & accommodation, subject to availability.
Sponsored by