Call for Papers

Data are at the core of research in many domains outside of computer science, such as healthcare, social sciences, and business. Combining diverse sources of data provides potentially very useful and powerful data, but it is also a challenging research problem. There are a multitude of challenges in data integration: the data collections to be integrated may come from different sources; the collections may have been created by different groups; their characteristics can be different (different schema, different data types); and the data may contain duplicates. Solving these challenges requires substantial effort and domain experts need to be involved. In the era of Big Data, with organizations scaling up the volume of their data, it is critical to develop new and scalable approaches to deal with all these challenges. In addition, it is important to properly assess the quality of the source data as well as the integrated data. As a consequence, the quality of the source data will drive the methods needed for its integration. Data integration is an important phase in the KDD process, by creating new and enriched records from a multitude of sources. These new records can be queried, searched, mined and analyzed for discovering new, interesting and useful patterns.

The goal of this workshop is to bring together computer scientists with researchers from other domains and practitioners from businesses and governments to present and discuss current research directions on multi source data integration and its application. The workshop will provide a forum for original high-quality research papers on record linkage, data integration, population informatics, mining techniques of integrated data, and applications, as well as multidisciplinary research opportunities.

Topics of interest include (but are not limited to):

Data Integration Methodologies

  • Automating data cleaning and pre-processing
  • Algorithms and techniques for data integration
  • Entity resolution, record linkage, data matching, and duplicate detection
  • Big Data integration
  • Integrating complex data

Evaluation, Quality and Privacy

  • Evaluation of linkage/matching/data integration methods
  • Data quality evaluation for source data and/or integrated data
  • Bias and quality of longitudinal data
  • Preserving privacy in data integration

Population Informatics

  • Algorithms and techniques for managing, processing, analyzing, and mining large population databases
  • Requirements analysis for population informatics
  • Models and algorithms for population informatics
  • Architectures and frameworks for population informatics
  • Research case studies of population informatics in health, demographics, ecology, economics, the social sciences, and other research domains

Integrated Data and Longitudinal Data Applications

  • Mining and analysis of longitudinal data
  • Applications of population informatics in governments and businesses
  • Data integration applications for healthcare, social sciences, digital humanities, bioinformatics, genomics, etc.

Key Dates (All deadlines are at AoE time zone (UTC - 12))

  • Due date for workshop papers: Sunday, June 16, 2019
  • Notification of workshop papers acceptance to authors: Friday, July 19, 2019
  • Camera-ready deadline for accepted papers: Monday, July 26, 2019
  • Workshop date: September 20, 2019

Paper Submission and Publications

Papers submitted to this workshop must not be under review or accepted for publication elsewhere. All submitted papers will be reviewed and selected by the program committee on the basis of originality, technical quality, relevance to the workshop and presentation quality. Accepted papers will be included in the LNCS proceedings.

Papers must be written in English and formatted according to the Springer LNCS guidelines. Author instructions, style files and the copyright form can be downloaded here. The maximum length of papers is 16 pages (including references) in this format. Short papers of up-to eight pages are welcome.

All papers should be submitted via the workshop submission system. Detailed instructions and submission link are available on the workshop Submission page.

Conference Attendance

For each accepted paper, at least one author must register and attend the conference and present the paper. Please make sure to make early travel arrangements and take care of possible immigration requirements (e.g., visa).

Proceedings

The conference proceedings will be published by Springer in the Lecture Notes in Computer Science Series (LNCS). The proceedings will be published after the conference and will only include papers that were presented at the conference. Online versions of the papers will be available at the time of the conference.

Workshop Organizers:

Luiza Antonie, University of Guelph, Canada

Peter Christen, The Australian National University, Australia

Erhard Rahm, University of Leipzig, Germany

Osmar Zaïane, University of Alberta, Canada