Citation Extraction and Parsing

Final programme below

28./29. May 2026 | Frankfurt a. M. (Germany)

About the Workshop

Registration

Workshop programme

Topics

Participation and Participation fees

Submission guidelines

About the Workshop

Accurate and open citation data are essential foundations for transparent, reproducible, and networked scholarship. The Workshop on Citation Extraction and Parsing (CiteX 2026) provides an interdisciplinary forum for researchers, developers, and practitioners to explore current advances in the automated identification, structuring, and dissemination of bibliographic references.

Building upon the growing momentum around open citations and initiatives such as WikiCite, WOOC, and the Frankfurt Workshop series, CiteX 2026 aims to foster dialogue across communities working on both the methodological and infrastructural aspects of citation data. The workshop welcomes contributions addressing technical innovations—ranging from traditional rule-based and machine-learning methods to state-of-the-art approaches using large language models (LLMs)—as well as practical and conceptual discussions about the creation, quality, and integration of open citation datasets.

CiteX 2026 invites participation from all disciplines interested in the extraction, parsing, and reuse of citation information. By bringing together perspectives from research, infrastructure, and applied contexts, the workshop seeks to strengthen collaboration, identify shared challenges, and promote the development of interoperable and openly accessible citation ecosystems.

Registration

Registration is now open! Registrate via https://t1p.de/p6e0s.

Workshop programme

The workshop proceedings are available here: Proceedings of the Workshop on Citation Extraction and Parsing (CiteX)

Topics

Automated extraction and parsing of references
Creation and sharing of gold standards and test datasets
Standardization and interoperability of citation data
Quality assessment and validation of extracted references
Provision and integration of open citation data into repositories and search systems
Citation practices across disciplines
Data linking between scholarly works, datasets, and other research outputs
Annotation and enrichment of citation data
Prompt engineering and fine-tuning of LLMs (e.g., GPT-4, LLaMA) for citation tasks
Comparison of LLM-based and tool-based (e.g., GROBID, Anystyle, Cermine) extraction pipelines
In-text citation extraction and context analysis using LLMs
Use of open web search APIs or LLMs for source retrieval

Participation and Participation fees

Participation is as well possible without contribution. We will charge 36 Euro for on-site-participation to cover the costs for organisation. Online participation is free of charge.

Submission guidelines

Please submit an extended abstract (1250-1500 words excl. references) for any contribution format.
Only submissions in English will be considered.
Accepted extended abstract will be published under a Zenodo community.
Please indicate in the submission form, if you are interested in a Special Issue.

Important Dates

Submission deadline: 01. February 2026
Camera-ready version: 31.03.2026
Workshop dates: 28./ 29. May 2026

Venue

The Workshop will take place at DIPF | Leibniz Institute of Research and Information in Education, Rostocker Straße 6, 60323 Frankfurt a. M.

Submission & Contact

Please submit your contribution via Google Forms: Submission – Workshop on Citation Extraction and Parsing (CiteX 2026)
For any question, please contact us via e-mail: workshop.cep.2026@gmail.com

Programme Committee

Tamara Heck (Leibniz Institute for Research and Information in Education)
Angelo Di Iorio (University of Bologna)
Philipp Mayr-Schlegel (Leibniz Institute for the Social Sciences)
Marta Soricetti (University of Bologna)
Matteo Romanello (Odoma)
Silvio Peroni (OpenCitations)

Christoph Schindler (Leibniz Institute for Research and Information in Education)
Stephan Stahlschmidt (German Centre for Higher Education Research and Science Studies)
Christian Boulanger (Max Planck Society for the Advancement of Science)
Andreas Wagner (Max Planck Society for the Advancement of Science)

Daniel Mietchen (FU Berlin)
Muhammad Ahsan Shahid (GESIS)
Verena Weimer (Leibniz Institute for Research and Information in Education)

Page updated

Google Sites

Report abuse