Jannik Strötgen - Publications
Book
Jannik Strötgen, Michael Gertz: Domain-Sensitive Temporal Tagging.
Synthesis Lectures on Human Language Technologies. Morgan & Claypool Publishers, 2016.
This book covers the topic of temporal tagging, the detection of temporal expressions and the normalization of their semantics to some standard format. It places a special focus on the challenges and opportunities of domain-sensitive temporal tagging. After providing background knowledge on the concept of time, the book continues with [link]
Theses
[pdf] Jannik Strötgen: Domain-sensitive Temporal Tagging for Event-centric Information Retrieval. PhD thesis, Institute of Computer Science, Heidelberg University, 2015.
[pdf] Jannik Strötgen: UTEMPL - Aufbau und Evaluierung einer UIMA-basierten Textmining Pipeline für biomedizinische Literatur. Magisterarbeit, Department of Computational Linguistics, Heidelberg University, 2009.
Publications
[Google Scholar] [DBLP] [ACM] [ACL] [MPG PuRe][Semantic Scholar] [arXiv]
2024
Better Call SAUL: Fluent and Consistent Language Model Editing with Generation Regularization. Mingyang Wang, Heike Adel, Lukas Lange, Jannik Strötgen, Hinrich Schütze. Findings of EMNLP'24. [arXiv]
Learn it or Leave it: Task Representation-Guided Module Composition and Pruning for Continual Learning. Mingyang Wang, Heike Adel, Lukas Lange, Jannik Strötgen, Hinrich Schütze. RepL4NLP'24. [pdf] [arXiv]
Rehearsal-Free Modular and Compositional Continual Learning for Language Models. Mingyang Wang, Heike Adel, Lukas Lange, Jannik Strötgen, Hinrich Schütze. NAACL'24. [pdf] [arXiv]
Discourse-Aware In-Context-Learning for Temporal Expression Normalization. Akash Kumar Gautam, Lukas Lange, Jannik Strötgen. NAACL'24. [pdf] [arXiv]
2023
GradSim: Gradient-Based Language Grouping for Effective Multilingual Training. Mingyang Wang. Mingyang Wang, Heike Adel, Lukas Lange, Jannik Strötgen, Hinrich Schütze. EMNLP'23. [pdf] [arXiv]
TADA – Efficient Task-Agnostic Domain Adaptation for Transformers. Chia-Chien Hung, Lukas Lange, Jannik Strötgen. Findings of the ACL'23. [pdf] [arXiv]
NLNDE at SemEval-2023 Task 12: Adaptive Pretraining and Source Language Selection for Low-Resource Multilingual Sentiment Analysis. Mingyang Wang, Heike Adel, Lukas Lange, Jannik Strötgen, Hinrich Schütze. SemEval@'ACL'23. [pdf] [arXiv]
Multilingual Normalization of Temporal Expressions with Masked Language Models. Lukas Lange, Jannik Strötgen, Heike Adel, Dietrich Klakow. EACL'23. [pdf] [arXiv]
2022
Three Real-World Datasets and Neural Computational Models for Classification Tasks in Patent Landscaping. Subhash C. Pujari, Jannik Strötgen, Mark Giereth, Michael Gertz, Annemarie Friedrich. EMNLP'22. [pdf]
CLIN-X: pre-trained language models and a study on cross-task transfer for concept extraction in the clinical domain. Lukas Lange, Heike Adel, Jannik Strötgen, Dietrich Klakow. Oxford Bioinformatics. [publisher version]
A Study on Entity Linking Across Domains: Which Data is Best for Fine-Tuning? Hassan Soliman, Heike Adel, Mohamed H. Gad-Elrab, Dragan Milchevski, Jannik Strötgen. RepL4NLP'22. [pdf]
Evaluating Neural Multi-Field Document Representations for Patent Classification. Subhash Chandra Pujari, Fryderyk Mantiuk, Mark Giereth, Jannik Strötgen, Annemarie Friedrich. BIR'22. [pdf]
Enhancing Knowledge Bases with Quantity Facts. Vinh Thinh Ho, Daria Stepanova, Dragan Milchevski, Jannik Strötgen, Gerhard Weikum. WWW'22. [pdf]
2021
To Share or not to Share: Predicting Sets of Sources for Model Transfer Learning. Lukas Lange, Jannik Strötgen, Heike Adel, Dietrich Klakow. EMNLP'21. [pdf] [arxiv] [code]
FAME: Feature-Based Adversarial Meta-Embeddings for Robust Input Representations. Lukas Lange, Heike Adel, Jannik Strötgen, Dietrich Klakow. EMNLP'21. [pdf] [arxiv] [code]
RobertNLP at the BioCreative VII - LitCovid track: Neural Document Classification Using SciBERT. Subhash Pujari, Tim Tarsi, Annemarie Friedrich, Jannik Strötgen. BioCreative'21. [pdf]
Boosting Transformers for Job Expression Extraction and Classification in a Low-Resource Setting. Lukas Lange, Heike Adel, Jannik Strötgen. IberLEF'21. [pdf] [arxiv]
A Survey on Recent Approaches for Natural Language Processing in Low-Resource Scenarios. Michael A. Hedderich, Lukas Lange, Heike Adel, Jannik Strötgen, Dietrich Klakow. NAACL'21. [pdf] [arxiv]
A Multi-Task Approach to Neural Multi-Label Hierarchical Patent Classification using Transformers. Subhash Pujari, Annemarie Friedrich, Jannik Strötgen. ECIR'21. [author version] [publisher version] [code]
2020
Adversarial Learning of Feature-based Meta-Embeddings. Lukas Lange, Heike Adel, Jannik Strötgen, Dietrich Klakow. Arxiv 2020. [arxiv]
NLNDE at CANTEMIST: Neural Sequence Labeling and Parsing Approaches for Clinical Concept Extraction. Lukas Lange, Xiang Dai, Heike Adel, Jannik Strötgen. IberLEF@SEPLN'20. [pdf] [arxiv]
Closing the Gap: Joint De-Identification and Concept Extraction in the Clinical Domain. Lukas Lange, Heike Adel, Jannik Strötgen. ACL'20. [pdf] [arxiv]
Adversarial Alignment of Multilingual Models for Extracting Temporal Expressions from Text. Lukas Lange, Anastasiia Iurshina, Heike Adel, Jannik Strötgen. RepL4NLP@ACL'20. [pdf] [arxiv]
On the Choice of Auxiliary Languages for Improved Sequence Tagging. Lukas Lange, Heike Adel, Jannik Strötgen. RepL4NLP@ACL'20. [pdf] [arxiv]
Fast Computation of Explanations for Inconsistency in Large-Scale Knowledge Graphs. Trung Kien Tran, Mohamed H Gad-Elrab, Daria Stepanova, Evgeny Kharlamov, Jannik Strötgen. WWW'20. [pdf] [ACM]
2019
"A Buster Keaton of Linguistics": First Automated Approaches for the Extraction of Vossian Antonomasia. Michel Schwab, Robert Jäschke, Frank Fischer, Jannik Strötgen. EMNLP'19. [pdf][ACL]
NLNDE: Enhancing Neural Sequence Taggers with Attention and Noisy Channel for Robust Pharmacological Entity Detection. Lukas Lange, Heike Adel, Jannik Strötgen. BioNLP-OST@EMNLP'19. [pdf][ACL]
Towards the Bosch Materials Science Knowledge Base. Jannik Strötgen, Trung Kien Tran, Annemarie Friedrich, Dragan Milchevski, Federico Tomazic, Anika Marusczyk, Heike Adel, Daria Stepanova, Felix Hildebrand, Evgeny Kharlamov. ISWC'19 Industry Track. [pdf]
NLNDE: The Neither-Language-Nor-Domain-Experts' Way of Spanish Medical Document De-Identification. Lukas Lange, Heike Adel, Jannik Strötgen. IberLEF@SEPLN'19. [pdf]
winner of shared task
Generating Semantic Aspects for Queries. Dhruv Gupta, Klaus Berberich, Jannik Strötgen, Demetrios Zainalipour-Yazti. ESWC'19. [pdf]
The Power of Temporal Features for Classifying News Articles. Lukas Lange, Omar Alonso, Jannik Strötgen. TempWeb@WWW'19. [pdf] [ACM]
Epitaph or Breaking News? Analyzing and Predicting the Stability of Knowledge Base Properties. Ioannis Dikeoulias, Jannik Strötgen, Simon Razniewski. TempWeb@WWW'19. [pdf] [ACM]
2018
[pdf] Zhen Jia, Abdalghani Abujabal, Rishiraj Saha Roy, Jannik Strötgen, Gerhard Weikum: TEQUILA: Temporal Question Answering over Knowledge Bases. CIKM'18.
[pdf] Prabal Agarwal, Jannik Strötgen, Luciano del Corro, Johannes Hoffart, Gerhard Weikum: diaNED: Time-Aware Named Entity Disambiguation for Diachronic Corpora. ACL'18.
[pdf] Jannik Strötgen, Rosita Andrade, Dhruv Gupta: Putting Dates on the Map: Harvesting and Analyzing Street Names with Date Mentions and their Explanations. JCDL'18.
[pdf] Dhruv Gupta, Klaus Berberich, Jannik Strötgen, Demetrios Zainalipour-Yazti: Generating Semantic Aspects for Queries. JCDL'18.
[pdf] Jannik Strötgen, Anne-Lyse Minard, Lukas Lange, Manuela Speranza, Bernardo Magnini: KRAUTS: A German Temporally Annotated News Corpus. LREC'18.
[pdf] Kashyap Popat, Subhabrata Mukherjee, Jannik Strötgen, Gerhard Weikum: CredEye: A Credibility Lens for Analyzing and Explaining Misinformation. WWW'18.
[pdf] Zhen Jia, Abdalghani Abujabal, Rishiraj Saha Roy, Jannik Strötgen, Gerhard Weikum: TempQuestions: A Benchmark for Temporal Question Answering. HQA@WWW18.
[pdf] Andreas Spitz, Jannik Strötgen, Michael Gertz: Predicting Document Creation Times in News Citation Networks. TempWeb@WWW'18.
[pdf] Evelyn Gius, Nils Reiter, Jannik Strötgen, Marcus Willand: SANTA: Systematische Analyse Narrativer Texte durch Annotation. DHd'18.
2017
[pdf] Stefania Degaetano-Ortlieb, Jannik Strötgen: Diachronic Variation of Temporal Expressions in Scientific Writing through the Lens of Relative Entropy. GSCL'17.
[pdf] Nils Reiter, Evelyn Gius, Jannik Strötgen, Marcus Willand: A Shared Task for a Shared Goal: Systematic Annotation of Literary Texts. DH'17.
[pdf] Natalia Boldyrev, Marc Spaniol, Jannik Strötgen, Gerhard Weikum: SESAME: European Statistics Explored via Semantic Alignment onto Wikipedia. WWW'17.
[pdf] Rosita Andrade, Jannik Strötgen: All Dates Lead to Rome: Extracting and Explaining Temporal References in Street Names. WWW'17.
[pdf] Prabal Agarwal, Jannik Strötgen: Tiwiki: Searching Wikipedia with Temporal Constraints. TempWeb@WWW'17.
[pdf] Kashyap Popat, Subhabrata Mukherjee, Jannik Strötgen, Gerhard Weikum: Where the Truth Lies: Explaining the Credibility of Emerging Claims on the Web and Social Media. WWW'17 (Web Science Track).
[pdf] Robert Jäschke, Jannik Strötgen, Elena Krotova, Frank Fischer: "Der Helmut Kohl unter den Brotaufstrichen". Zur Extraktion vossianischer Antonomasien aus großen Zeitungskorpora. DHd'17.
[pdf] Jannik Strötgen: Multilingual and Domain-sensitive Temporal Tagging with HeidelTime. DGfS'17 (Computational linguistics poster session).
2016
[pdf] Jannik Strötgen: Domänen-sensitives Temporal Tagging für Event-zentriertes Information Retrieval. Ausgezeichnete Informatikdissertationen 2015. Lecture Notes in Informatics, GI-Edition, 2016. (invited paper)
[pdf] Kashyap Popat, Subhabrata Mukherjee, Jannik Strötgen, Gerhard Weikum: Credibility Assessment of Textual Claims on the Web. CIKM'16.
[pdf] Dhruv Gupta, Jannik Strötgen, Klaus Berberich: EventMiner: Mining Events from Annotated Documents. ICTIR'16.
[pdf] Dhruv Gupta, Jannik Strötgen, Klaus Berberich: DigitalHistorian: Search & Analytics Using Annotations. HistoInformatics@DH'16.
[link] Thomas Bögel, Evelyn Gius, Janina Jacke, Jannik Strötgen: From Order to Order Switch. Mediating between Complexity and Reproducibility in the Context of Automated Literary Annotation. DH'16.
[pdf] Leon Derczynski, Jannik Strötgen, Diana Maynard, Mark A. Greenwood, Manuel Jung: GATE-Time: Extraction of Temporal Expressions and Events. LREC'16.
[pdf] Erdal Kuzey, Jannik Strötgen, Vinay Setty, Gerhard Weikum: Temponym Tagging: Temporal Scopes for Textual Phrases. TempWeb@WWW'16.
[pdf] Edal Kuzey, Vinay Setty, Jannik Strötgen, Gerhard Weikum: As Time Goes By: Comprehensive Tagging of Textual Phrases with Temporal Scopes. WWW'16.
2015
[url] Thomas Bögel, Michael Gertz, Evelyn Gius, Janina Jacke, Jan-Christoph Meister, Marco Petris, Jannik Strötgen: Collaborative Text Annotation Meets Machine Learning: heureCLÉA, a Digital Heuristic of Narrative. DHCommons journal, 2015.
[pdf] Johanna Geiß, Andreas Spitz, Jannik Strötgen, Michael Gertz: The Wikipedia Location Network - Overcoming Borders and Oceans. GIR'15.
[pdf] Frank Fischer, Jannik Strötgen: Un calendario de la literatura española (aplicación para Android e iOS). HDH'15.
best long submission award
[pdf] Leon Derczynski, Jannik Strötgen, Ricardo Campos, Omar Alonso: Time and Information Retrieval: Introduction to the Special Issue. Information Processing Management (IPM), 2015. Elsevier.
[pdf] Jannik Strötgen, Michael Gertz: A Baseline Temporal Tagger for all Languages. EMNLP'15.
[pdf] Thomas Bögel, Jannik Strötgen, Michael Gertz: A Hybrid Approach to Extract Temporal Signals from Narratives. GSCL'15.
[link] Evelyn Gius, Janina Jacke, Jan-Christoph Meister, Thomas Bögel, Jannik Strötgen: Beyond Pragmatics: Disciplinary Profits of Interdisciplinary Approaches. DH'15.
[pdf] Frank Fischer, Jannik Strötgen: When Does German Literature Take Place? - On the Analysis of Temporal Expressions in Large Corpora. DH'15.
[pdf] Bilel Moulahi, Jannik Strötgen, Michael Gertz, Lynda Tamine-Lechani: HeidelToul: A Baseline Approach for Cross-document Event Ordering. SemEval@NAACL'15.
[pdf] Andreas Spitz, Jannik Strötgen, Thomas Bögel, Michael Gertz: Terms in Time and Times in Context: A Graph-based Term-Time Ranking Model. TempWeb@WWW'15.
[pdf] Frank Fischer, Jannik Strötgen: Wann findet die deutsche Literatur statt? Zur Untersuchung von Zeitausdrücken in großen Korpora. DHd'15.
[pdf] Thomas Bögel, Michael Gertz, Evelyn Gius, Janina Jacke, Jan-Christoph Meister, Marco Petris, Jannik Strötgen: Gleiche Textdaten, unterschiedliche Erkenntnisziele? Zum Potential vermeintlich widersprüchlicher Zugänge zu Textanalyse. DHd'15.
[pdf] Thomas Bögel, Marco Petris, Jannik Strötgen, Michael Gertz: An End-to-End Integration of Automatic Annotations into CATMA. DHd'15.
2014
[pdf] Giulio Manfredi, Jannik Strötgen, Julian Zell, Michael Gertz: HeidelTime at EVENTI: Tuning Italian Resources and Addressing TimeML's Empty Tags. EVALITA@AI*IA'14.
winner of the temporal tagging subtask
[pdf] Thomas Bögel, Jannik Strötgen, Michael Gertz: Computational Narratology: Extracting Tense Clusters from Narrative Texts. LREC'14.
[pdf] Jannik Strötgen, Thomas Bögel, Julian Zell, Ayser Armiti, Tran Van Canh, Michael Gertz: Extending HeidelTime for Temporal Expressions Referring to Historic Dates. LREC'14.
[pdf] Hui Li, Jannik Strötgen, Julian Zell, Michael Gertz: Chinese Temporal Tagging with HeidelTime. EACL'14.
[pdf] Jannik Strötgen, Ayser Armiti, Tran Van Canh, Julian Zell, Michael Gertz: Time for More Languages: Temporal Tagging of Arabic, Italian, Spanish, and Vietnamese. Transactions on Asian Language Information Processing (TALIP), 2014. ACM.
[pdf] Thomas Bögel, Jannik Strötgen, Christoph Mayer, Michael Gertz: A Flexible NLP Pipeline for Computational Narratology. DHd'14.
2013
[pdf] Jannik Strötgen, Michael Gertz: Proximity2-aware Ranking for Textual, Temporal, and Geographic Queries. CIKM'13. [extended version]
[pdf] Jannik Strötgen, Michael Gertz: Multilingual and Cross-domain Temporal Tagging. Language Resources and Evaluation, 2013. Springer.
[pdf] Jannik Strötgen, Julian Zell, Michael Gertz: HeidelTime: Tuning English and Developing Spanish Resources for TempEval-3. SemEval@NAACL'13.
winner of temporal tagging subtask for English and Spanish
[pdf] Christian Kapp, Jannik Strötgen, Michael Gertz: EvenPers: Event-based Person Exploration and Correlation. BTW'13.
2012
[pdf] Britta Keller, Jannik Strötgen, Michael Gertz: Event-centric Document Similarity for Biomedical Literature. SMBM'12.
[pdf] Jannik Strötgen, Michael Gertz: Event-centric Search and Exploration in Document Collections. JCDL'12.
nominated for Best Student Paper
[pdf] Jannik Strötgen, Michael Gertz: Temporal Tagging on Different Domains: Challenges, Strategies, and Gold Standards. LREC'12.
[pdf] Jannik Strötgen, Omar Alonso, Michael Gertz: Identification of Top Relevant Temporal Expressions in Documents. TempWeb@WWW'12.
[pdf] Jannik Strötgen, Omar Alonso, Michael Gertz: Retro: Time-based Exploration of Product Reviews. ECIR'12.
2011
[pdf] Jannik Strötgen, Michael Gertz: WikiWarsDE: A German Corpus of Narratives Annotated with Temporal Expressions. GSCL'11.
[pdf] Jannik Strötgen, Michael Gertz, Conny Junghans: An Event-centric Model for Multilingual Document Similarity. SIGIR'11.
[pdf] Omar Alonso, Jannik Strötgen, Ricardo Baeza-Yates, Michael Gertz: Temporal Information Retrieval: Challenges and Opportunities. TempWeb@WWW'11.
2010
[pdf] Jannik Strötgen, Michael Gertz: TimeTrails: A System for Exploring Spatio-Temporal Information in Documents. VLDB'10.
[pdf] Jannik Strötgen, Michael Gertz: HeidelTime: High Quality Rule-based Extraction and Normalization of Temporal Expressions. SemEval@ACL'10.
winner of temporal tagging subtask for English
[pdf] Jannik Strötgen, Michael Gertz, Pavel Popov: Extraction and Exploration of Spatio-Temporal Information in Documents. GIR'10.
2009
[pdf] Jannik Strötgen, Juliane Fluck, Anke Holler: Dependenz-basierte Relationsextraktion mit der UIMA-basierten Text-Mining Pipeline UTEMPL. GSCL'09.