Program
Session 1: 10:30 to 12:40
Session 1: 10:30 to 12:40
- 10:30 to 11:30:
- Keynote: Data Profiling for Data Integration, Prof Felix Naumann, HPI / Potsdam University
- 11:30 to 12:00:
- Noise Reduction in Distant Supervision for Relation Extraction using Probabilistic Soft Logic, Birgit Kirsch, Zamira Niyazova, Stefan Rüping and Michael Mock (full paper)
- 12:00 to 12:40:
- Privacy-Preserving Record Linkage to Identify Fragmented Electronic Medical Records in the All of Us Research Program, Abel Kho, Jingzhi Yu, Molly Scannell Bryan, Charon Gladfelter, Howard Gordon, Shaun Grannis, Margaret Madden, Eneida Mendonca, Vesna Mitrovic, Raj Shah, Umberto Tachinardi and Bradley Taylor (short paper)
- Data integration for the development of a seismic loss prediction model for residential buildings in New Zealand, Samuel Roeslin, Quincy Ma and Joerg Wicker (short paper)
Lunch: 12:40 to 14:00
Lunch: 12:40 to 14:00
Session 2: 14:00 to 16:00
Session 2: 14:00 to 16:00
- 14:00 to 15:00:
- Industry keynote: Record Linkage At Amazon Scale Using Deep Siamese Networks, Yoni Lev and Grant Galloway, Amazon Development Centre, Edinburgh
- 15:00 to 16:00:
- Linking IT Product Records, Katsiaryna Mirylenka, Paolo Scotton, Christoph Miksovic and Salah-Eddine Bariol Alaoui (full paper)
- Pharos: Query-driven schema inference for the Semantic Web, David Haller and Richard Lenz (full paper)
Coffee break: 16:00 to 16:20
Coffee break: 16:00 to 16:20
Session 3: 16:20 to 18:00
Session 3: 16:20 to 18:00
- 16:20 to 17:20:
- Informativeness-Based Active Learning for Entity Resolution, Victor Christen, Erhard Rahm and Peter Christen (full paper)
- Encoding hierarchical classification codes for Privacy-preserving Record Linkage using Bloom filters, Rainer Schnell and Christian Borgs (full paper)
- 17:20 to 18:00:
- Wrap up and discussion