Publications (and patents)


2024

Vincent Jung and Lonneke van der Plas. 2024. Understanding the effects of language-specific class imbalance in multilingual fine-tuning. Findings of the Association for Computational Linguistics: EACL 2024



2023

Alessandro Miani, Lonneke van der Plas, and Adrian Bangerter. 2023. Loose and tight: Creative formation but rigid use of nominal compounds in conspiracist texts. The Journal of Creative Behavior. 


Molly Petersen and Lonneke van der Plas. 2023. Can language models learn analogical reasoning? Investigating training objectives and comparisons to human performance. EMNLP main papers.


Colin Layfield, René Zandbergen, Lisa Fagin Davis, John Abela, Claire Bowern, Michael Rosner, Lonneke van der Plas. 2023. International Conference on the Voynich Manuscript 2022. In proceedings of HistoCrypt. https://doi.org/10.3384/ecp195. 


Milind Agarwal, Sweta Agrawal, Antonios Anastasopoulos, Luisa Bentivogli, Ondřej Bojar, Claudia Borg, Marine Carpuat, Roldano Cattoni, Mauro Cettolo, Mingda Chen, William Chen, Khalid Choukri, Alexandra Chronopoulou, Anna Currey, Thierry Declerck, Qianqian Dong, Kevin Duh, Yannick Estève, Marcello Federico, et al.. 2023. FINDINGS OF THE IWSLT 2023 EVALUATION CAMPAIGN. In Proceedings of the 20th International Conference on Spoken Language Translation (IWSLT 2023), pages 1–61, Toronto, Canada (in-person and online). Association for Computational Linguistics.


Aiden Williams, Kurt Abela, Rishu Kumar, Martin Bär, Hannah Billinghurst, Kurt Micallef, Ahnaf Mozib Samin, Andrea DeMarco, Lonneke van der Plas, and Claudia Borg. 2023. UM-DFKI Maltese Speech Translation. In Proceedings of the 20th International Conference on Spoken Language Translation (IWSLT 2023), pages 433–441, Toronto, Canada (in-person and online). Association for Computational Linguistics.


2022

Andreas Marfurt, Ashley Thornton, David Sylvan, Lonneke van der Plas, and James Henderson. A Corpus and Evaluation for Predicting Semi-Structured Human Annotations. In Proceedings of the 2nd Workshop on Natural Language Generation, Evaluation, and Metrics (GEM), pages 262–275, Abu Dhabi, United Arab Emirates (Hybrid). Association for Computational Linguistics.


Kurt Micallef, Albert Gatt, Marc Tanti, Lonneke van der Plas and Claudia Borg

Pre-training Data Quality and Quantity for a Low-Resource Language: New Corpus and BERT Models for Maltese

In Proceedings of the workshop on Deep Learning for Low-Resource NLP


Inga Lang, Lonneke van der Plas, Malvina Nissim and Albert Gatt

Visually Grounded Interpretation of Noun-Noun Compounds in English

In Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics, Association for Computational Linguistics


Kevin Farrugia, Colin Layfield, Lonneke van der Plas

Demystifying the Scribes behind the Voynich Manuscript using Computational Linguistic Techniques.

Proceedings of the 1st International Conference on the Voynich Manuscript 2022


2021

Marc Tanti, Lonneke van der Plas, Claudia Borg, Albert Gatt

On the Language-specificity of Multilingual BERT and the Impact of Fine-tuning

In Proceedings of the Workshop on Analyzing and Interpreting Neural Networks for NLP (Blackbox NLP)


2020

Michele Loi, Eleonora Viganò, and Lonneke van der Plas

The societal and ethical relevance of computational Creativity

In Proceedings of the International Conference on Computational Creativity


Yunyao Li,  Carlos Diez Sanchez, and  Lonneke van der Plas

Patent P201900541US01: Using a joint distributional semantic system to correct redundant semantic verb frames

holder: IBM US


Patrick Ziering and Lonneke van der Plas 

Compound or phrase or in between? Testing Linguistic Criteria for Compoundhood. 

In Compounds Between Phrases and Words. Special issue Word Structure. 


Michele Loi and Lonneke van der Plas

A blind spot of AI ethics: anti-fragility in statistical prediction

Poster at the Swiss Conference on Data Science (SDS2020)

ELSI best paper award!!


Gianina Iordăchioaia, Lonneke van der Plas, Glorianna Jagfeld

Compositionality in English deverbal compounds: The role of the head

In Sabine Schulte im Walde  and Eva Smolka (eds.) The role of constituents in multiword expressions: An interdisciplinary, cross-lingual perspective. Phraseology and Multiword Expressions. Language Science Press.


Stavros Assimakopoulos, Rebecca Vella Muskat, Lonneke van der Plas, Albert Gatt

Annotating for Hate Speech: The MaNeCo Corpus and Some Input from Critical Discourse Analysis

Proceedings of The 12th Language Resources and Evaluation Conference


Carlos Daniel Hernandez Mena, Albert Gatt, Andrea DeMarco, Claudia Borg, Lonneke van der Plas, Amanda Muscat, Ian Padovani

MASRI-HEADSET: A Maltese Corpus for Speech Recognition

Proceedings of The 12th Language Resources and Evaluation Conference


Colin Layfield, Lonneke van der Plas, Michael Rosner, John Abela

Word Probability Findings in the Voynich Manuscript

Proceedings of LT4HALA 2020 - 1st Workshop on Language Technologies for Historical and Ancient Languages


Automatic Removal of Identifying Information in Official EU Languages for Public Administrations: The MAPA Project (Gianola, Lucie; Ajausks, Ēriks; Arranz, Victoria; Bendahman, Chomicha; Bié, Laurent; Borg, Claudia; Cerdà, Aleix; Choukri, Khalid; Cuadros, Montse; Gibert, Ona de; Degroote, Hans; Edelman, Elena; Etchegoyhen, Thierry; Torres, Ángela Franco; Hernandez, Mercedes García; Pablos, Aitor García; Gatt, Albert; Grouin, Cyril; Herranz, Manuel; Kohan, Alejandro Adolfo; Lavergne, Thomas; Melero, Maite; Paroubek, Patrick; Rigault, Mickaël; Rosner, Mike; Rozis, Roberts; Plas, Lonneke van der; Vīksna, Rinalds and Zweigenbaum, Pierre), In Proceedings of the 33rd International Conference on Legal Knowledge and Information Systems (JURIX'20), IOS Press, 2020.




2019

Prajit Dhar and Lonneke van der Plas

Learning to predict novel noun-noun-compounds

Proceedings of the Joint Workshop  on  Multiword  Expressions  and  WordNet (MWE-WN 2019).


Prajit Dhar, Janis Pagel and Lonneke van der Plas

Measuring the compositionality of noun-noun compounds over time

Proceedings of the 1st International Workshop on Computational Approaches to Historical Language Change 2019


2018

Agata Savary, Marie Candito, Verginica Barbu Mititelu, Eduard Bejček, Fabienne Cap, Slavomir Čéplö, Silvio Ricardo Cordeiro, Gülşen Eryiğit, Voula Giouli, Maarten van Gompel, Yaakov HaCohen-Kerner, Jolanta Kovalevskaitė, Simon Krek, Chaya Liebeskind, Johanna Monti, Carla Parra Escartín, Lonneke van der Plas, Behrang QasemiZadeh, Carlos Ramisch, Federico Sangati, Ivelina Stoyanova, Veronika Vincze

PARSEME multilingual corpus of verbal multiword expressions 

Chapter in Markantonatou, Stella, Carlos Ramisch, Agata Savary & Veronika Vincze (eds.). 2018. Multiword expressions at length and in depth: Extended papers from the MWE 2017 workshop Phraseology and Multiword Expressions 2). Berlin: Language Science Press.


Albert Gatt, Marc Tanti, Adrian Muscat, Patrizia Paggio, Reuben A. Farrugia, Claudia Borg, Kenneth P. Camilleri, Mike Rosner, Lonneke van der Plas

Face2Text: Collecting an Annotated Image Description Corpus for the Generation of Rich Face Descriptions

Proceedings of LREC


2017

M. Constant, G. Eryiğit, J. Monti, L. van der Plas, C. Ramisch, M. Rosner, A. Todariscu

Multiword Expression Processing: A Survey

Journal of Computational Linguistics December 2017, Vol. 43, No. 4, pp. 837–892


Glorianna Jagfeld, Patrick Ziering, Lonneke van der Plas

Evaluation of Compound Splitting Extrinsically with Textual Entailment

Proceedings of ACL


Hoa Trong Vu, Thuong-Hai Pham, Xiaoyu Bai Marc Tanti, Lonneke van der Plas, Albert Gatt

LCT-MALTA’s Submission to RepEval 2017 Shared Task 

Proceedings of the 2nd Workshop on Evaluating Vector-Space Representations for NLP, pages 56–60


2016

Jörg Tiedemann, Lonneke van der Plas

Bootstrapping a Dependency Parser for Maltese - A Real-World Test Case

Chapter in 'From Semantics to Dialectometry : Festschrift in honor of John Nerbonne', Ed. Martijn Weiling, Martin Kroon, Gertjan Van Noord


Gianina Iordachioaia, Lonneke van der Plas and Glorianna Jagfeld

The Grammar of Deverbal Compounds and their Meaning

GramLex @ Coling


Patrick Ziering and Lonneke van der Plas

Towards Unsupervised and Language-independent Compound Splitting using Inflectional Morphological Transformations

NAACL


Patrick Ziering, Stefan Müller and Lonneke van der Plas

Top a Splitter: Using Distributional Semantics for Improving Compound Splitting

Proceedings of the 12th ACL-Workshop on Multiword Expressions (MWE 2016)


2015


Glorianna Jagfeld and Lonneke van der Plas

Towards a better Semantic Role Labelling of Complex Predicates

NAACL Student Research Workshop


Patrick Ziering and Lonneke van der Plas

One tree is not enough:  Cross-lingual Accumulative Structure Transfer for Semantic Indeterminacy

Pages 739–746 of the Proceedings of RANLP


Patrick Ziering and Lonneke van der Plas

From a Distance: Using Cross-lingual Word Alignments for Noun Compound Bracketing 

IWCS


Ngoc Quan Pham and Lonneke van der Plas

Predicting cross-lingual pronouns with continuous word spaces

DiscoMT (@ EMNLP)


2014

Lonneke van der Plas, Marianna Apidianaki, and Chenhua Chen

Global methods for cross-lingual semantic role and predicate labelling

Coling


Patrick Ziering and Lonneke van der Plas

What good are ‘Nominalkomposita’ for ‘noun compounds’: Multilingual Extraction and Structure Analysis of Nominal Compositions using Linguistic Restrictors

Coling


Lonneke van der Plas and Marianna Apidianaki

Crosslingual word sense disambiguation for  predicate labelling of French

TALN


2013

Patrick Ziering, Lonneke van der Plas, Hinrich Schütze

Multilingual Lexicon Bootstrapping - Improving a Lexicon Induction System Using a Parallel Corpus

IJCNLP


Patrick Ziering, Lonneke van der Plas, Hinrich Schütze

Bootstrapping Semantic Lexicons for Technical Domains

IJCNLP


Joerg Tiedemann, Lonneke van der Plas, Begona Villada Morón.

Bitexts as Semantic Mirrors.

Workshop on Twenty Years of Bitext in connection with EMNLP 2013


2012

Sarah Cruchet, Celia Boyer, Lonneke van der Plas.

Trustworthiness and relevance in web-based clinical question answering.

in Health Informatics: Building a Healthcare Future Through Trusted Information. Stud Health Technol Inform.:180:863-7.


2011

Lonneke van der Plas, Paola Merlo and James Henderson

Scaling up Cross-Lingual Semantic Annotation Transfer

In Proceedings of ACL/HLT, Portland, US, pp 299-304.


Lonneke van der Plas, Jörg Tiedemann, and Jean-Luc Manguin 

Synonym acquisition across domains and languages

Chapter in V. Pallotta, A. Soro, and E. Vargiu, ed., Advances in Distributed Agent-based Retrieval Tools, Springer-Verlag, Berlin, pp 41-58.


Lonneke van der Plas, Jörg Tiedemann, and Ismail Fahmi

Automatic extraction of medical term variants from multilingual parallel translations

Chapter in A. van den Bosch, and G. Bouma, ed., Interactive Multi-modal Question Answering. Theory and Applications of Natural

Language Processing. Springer Verlag, Berlin, ISBN 978-3-642-17524-4, pp 149-170.


2010

Tanja Samardzic, Lonneke van der Plas,  Goljihan Kashaeva, and Paola Merlo

Variation in verbal predicates in English and French [pdf]

In Generative Grammar in Geneva (GG@G), Volume 6, pp 109 - 135.


Tanja Samardzic, Lonneke van der Plas,  Goljihan Kashaeva, and Paola Merlo

The Scope and the Sources of Variation in Verbal Predicates in English and French 

In Proceedings of the 9th International Workshop on Treebanks and Linguistic Theories, Tartu, Estonia.


Lonneke van der Plas, Jörg Tiedemann

Finding Medical Term Variations using Parallel Corpora and Distributional Similarity

In Proceedings of the Coling workshop on ontologies and lexical resources, Beijing, China.


Lonneke van der Plas, Tanja Samardzic, and Paola Merlo 

Cross-lingual Validity of PropBank in the Manual Annotation of French 

In Proceedings of the 4th Linguistic Annotation Workshop (The LAW IV), Uppsala, Sweden.


Lonneke van der Plas, Gosse Bouma, Jori Mur 

Automatic Acquisition of Lexico-semantic Knowledge for QA

Chapter in Chu-Ren Huang, ed.,  Ontology and the Lexicon, Studies in Natural Language Processing, 

Cambridge University Press, Cambridge, UK. pp 271--287

ISBN 978-0-521-88659-8.


Lonneke van der Plas, Jörg Tiedemann and Jean-Luc Manguin 

Automatic acquisition of synonyms for French using parallel corpora

In Proceedings of the 4th International Workshop on Distributed Agent-based Retrieval Tools, Geneva, Switzerland.


2009

Paola Merlo and Lonneke van der Plas

Abstraction and Generalisation in Semantic Role Labels: PropBank, VerbNet or both?

In Proceedings of ACL-IJCNLP, Singapore.


Cedric Boidin, Verena Rieser, Lonneke van der Plas, Oliver Lemon, Jonathan Chevelu

Predicting how it sounds: Re-ranking dialogue prompts based on TTS quality for adaptive Spoken Dialogue Systems

In the Interspeech special session on Machine Learning for Adaptivity in Spoken Dialogue Systems.


Lonneke van der Plas

Combining Syntactic Co-occurrences and Nearest Neighbours in Distributional Methods to Remedy Data Sparseness

In Proceedings of the NAACL workshop on Unsupervised and Minimally Supervised Learning of Lexical Semantics, Boulder, US.


Lonneke van der Plas, James Henderson, and Paola Merlo

Domain Adaptation with Artificial Data for Semantic Parsing of Speech

In Proceedings of NAACL, Boulder, US.

2008

Lonneke van der Plas

Automatic lexico-semantic acquisition for question answering (PhD thesis)

In the GRODIL series.

ISBN 978-90-367-3564-3.

Lonneke van der Plas and Jörg Tiedemann

Using Lexico-Semantic Information for Query Expansion in Passage Retrieval for Question Answering

In Coling 2008 Workshop: Information Retrieval for Question Answering, Manchester, UK.


Lonneke van der Plas, Jean-Luc Manguin and Jörg Tiedeman

Extraction de synonymes à partir d'un corpus multilingue aligné

In Actes des journées de linguistique de corpus, Lorient, France.


Jean-Luc Manguin, Lonneke van der Plas, Jörg Tiedemann

Le traitement automatique: un moteur pour l'évolution des dictionnaire de synonymes

In Actes du colloque "Lexicographie et informatique: bilan et perspectives, Nancy, France.


Lonneke van der Plas and Jörg Tiedeman

Finding Synonyms Automatically in Multilingual Parallel Corpora In Proceedings of the HERA Conference, Tallinn, Estonia. 

Bouma G., Kloosterman G., Mur J., van Noord G., van der Plas L., Tiedemann J. (2008) Question Answering with Joost at CLEF 2007. In: Peters C. et al. (eds) Advances in Multilingual and Multimodal Information Retrieval. CLEF 2007. Lecture Notes in Computer Science, vol 5152. Springer, Berlin, Heidelberg


2007

Ismail Fahmi, Gosse Bouma and Lonneke van der Plas

Using Multilingual Terms for Biomedical Term Extraction

In Proceedings of the RANLP Workshop on Acquisition and Management of Multilingual Lexicons , Borovetz, Bulgaria.


Bouma G., Fahmi I., Mur J., van Noord G., van der Plas L., Tiedemann J. (2007) Using Syntactic Knowledge for QA. In: Peters C. et al. (eds) Evaluation of Multilingual and Multi-modal Information Retrieval. CLEF 2006. Lecture Notes in Computer Science, vol 4730. Springer, Berlin, Heidelberg


2006

Lonneke van der Plas and Jörg Tiedemann

Finding Synonyms Using Automatic Word Alignment and Measures of Distributional Similarity

In Proceedings of ACL/Coling.


Jori Mur and Lonneke van der Plas

Anaphora Resolution for Off-line Answer Extraction using Instances

In Proceedings of the Workshop for Anaphora Resolution (WAR).


Gosse Bouma, Ismail Fahmi, Jori Mur, Gertjan van Noord, Lonneke van der Plas, Jörg Tiedemann

Linguistic Knowledge and Question Answering

In Traitement Automatique des Langues, vol 46(3), pp 15-39.


Bouma G., Mur J., van Noord G., van der Plas L., Tiedemann J. (2006) Question Answering for Dutch Using Dependency Relations. In: Peters C. et al. (eds) Accessing Multilingual Information Repositories. CLEF 2005. Lecture Notes in Computer Science, vol 4022. Springer, Berlin, Heidelberg


2005

Lonneke van der Plas and Gosse Bouma

Automatic Acquisition of Lexico-Semantic Knowledge for QA

In Proceedings of the IJCNLP workshop on Ontologies and Lexical Resources, Jeju Island, South Korea.


Lonneke van der Plas and Gosse Bouma

Syntactic Contexts for finding Semantically Similar Words

In Proceedings of CLIN 04.


2004

Lonneke van der Plas, Vincenzo Palotta, Martin Rajman and Hatem Ghorbel

Keyword Extraction from Spoken Text. A Comparison of Two Lexical Resources: EDR and WordNet

In Proceedings of LREC, volume VI, Lisbon, Portugal.


Fabio Rinaldi, James Dowdall, Michael Hess, Kaarel Kaljurand, Andreas Persidis, Babis Theodoulidis, Bill Black, John McNaught, Haralampos Karanikas, Argyris Vasilakopoulos, Kelly Zervanou, Luc Bernard, Gian Piero Zarri, Hilbert Bruins Slot, Chris van der Touw, Margaret Daniel-King, Nancy Underwood, Agnes Lisowska, Lonneke van der Plas, Veronique Sauron, Myra Spiliopoulou, Marko Brunzel, Jeremy Ellman, Giorgos Orphanos, Thomas Mavroudakis, Spiros Taraviras

Parmenides: an opportunity for ISO TC37 SC4? In the ACL workshop Workshop on Linguistic Annotation: Getting the Model Right Sapporo, Japan.