Publications

(Under permanent reconstruction...)

Profile on Google Scholar

2024

Benjamin Minixhofer, Edoardo Maria Ponti, and Ivan Vulić,. Zero-Shot Tokenizer Transfer. CoRR, abs/2405.07883, May 2024. [Project Website]

Yaoyiran Li, Anna Korhonen, and Ivan Vulić. Self-Augmented In-Context Learning for Unsupervised Word Translation. In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024), to appear, July 2024

Han Zhou,* Xingchen Wan,* Ivan Vulić, and Anna Korhonen. AutoPEFT: Automatic Configuration Search for Parameter-Efficient Fine-Tuning. Transactions of the Association for Computational Linguistics (TACL), 2024. *equal contribution

Evgeniia Razumovskaia, Goran Glavaš, Anna Korhonen, and Ivan Vulić. SQATIN: Supervised Instruction Tuning Meets Question Answering for Improved Dialogue NLU. In Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT 2024), June 2024.

Chen Cecilia Liu, Jonas Pfeiffer, Ivan Vulić, and Iryna Gurevych. FUN with Fisher: Improving Generalization of Adapter-Based Cross-lingual Transfer with Scheduled Unfreezing. In Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT 2024), June 2024

Songbo Hu, Xiaobin Wang, Zhangdie Yuan, Anna Korhonen, and Ivan Vulić. DiaLight: Lightweight Multilingual Development and Evaluation of Task-Oriented Dialogue Systems with Large Language Models. In Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - System Demonstrations (NAACL-HLT 2024), June 2024. [Project Website]

Songbo Hu, Ivan Vulić, Fangyu Liu, and Anna Korhonen. Reranking Overgenerated Responses for End-to-End Task-Oriented Dialogue Systems. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), May 2024.

Marinela Parović, Ivan Vulić, and Anna Korhonen. Investigating the Potential of Task Arithmetic for Cross-Lingual Transfer. In Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2024), Mar 2024. 

Evgeniia Razumovskaia, Ivan Vulić, and Anna Korhonen. Analyzing and Adapting Large Language Models for Few-Shot Multilingual NLU: Are We There Yet CoRR, abs/2403.01929, Mar 2024.

Alan Ansell,  Ivan Vulić, Hannah Sterz, Anna Korhonen, and Edoardo Maria Ponti. Scaling Sparse Fine-Tuning to Large Language Models. CoRR, abs/2401.16405, Jan 2024.

Paweł Budzianowski, Taras Sereda, Tomasz Cichy, and Ivan Vulić. Pheme: Efficient and Conversational Speech Generation. CoRR, abs/2401.02839, Jan 2024. [Project Website]


2023

Jonas Pfeiffer, Sebastian Ruder, Ivan Vulić, and Edoardo Maria Ponti. Modular Deep Learning. Transactions on Machine Learning Research (TMLR), Dec 2023. [Project Website]

Songbo Hu, Han Zhou, Zhangdie Yuan, Milan Gritta, Guchun Zhang, Ignacio Iacobacci, Anna Korhonen, and Ivan Vulić. A Systematic Study of Performance Disparities in Multilingual Task-Oriented Dialogue Systems. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023), Dec 2023.

Yaoyiran Li, Anna Korhonen, and Ivan Vulić. On Bilingual Lexicon Induction with Large Language Models. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023), Dec 2023.

Benjamin Minixhofer, Jonas Pfeiffer, and Ivan Vulić. CompoundPiece: Evaluating and Improving Decompounding Performance of Language Models. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023), Dec 2023.

Alan Ansell, Marinela Parović, Ivan Vulić, Anna Korhonen, and Edoardo Maria Ponti. Unifying Cross-Lingual Transfer across Scenarios of Resource Scarcity. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023), Dec 2023.

Evgeniia Razumovskaia, Ivan Vulić, and Anna Korhonen. Transfer-Free Data-Efficient Multilingual Slot Labeling. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023), Dec 2023.

Han Zhou, Xingchen Wan, Ivan Vulić, and Anna Korhonen. Survival of the Most Influential Prompts: Efficient Black-Box Prompt Search via Clustering and Pruning. In Findings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023), Dec 2023.

Anjali Kantharuban, Ivan Vulić, and Anna Korhonen. Quantifying the Dialect Gap in Large Language Models and its Correlates Across Languages. In Findings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023), Dec 2023.

Fabian David Schmidt, Ivan Vulić, and Goran Glavaš. One For All & All For One: Bypassing Hyperparameter Tuning with Model Averaging for Cross-Lingual Transfer. In Findings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023), Dec 2023.

Sukannya Purkayastha, Sebastian Ruder, Jonas Pfeiffer, Iryna Gurevych, and Ivan Vulić. Romanization-based Large-scale Adaptation of Multilingual Language Models.  In Findings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023), Dec 2023.

Clifton Poth, Hannah Sterz, Indraneil Paul, Sukannya Purkayastha, Leon Engländer, Timo Imhof, Ivan Vulić, Sebastian Ruder, Iryna Gurevych, and Jonas Pfeiffer. Adapters: A Unified Library for Parameter-Efficient and Modular Transfer Learning. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: System Demonstrations (EMNLP 2023), Dec 2023.

Songbo Hu,* Han Zhou,* Mete Hergul, Milan Gritta, Guchun Zhang, Ignacio Iacobacci, Ivan Vulić,** and Anna Korhonen.** Multi3WOZ: A Multilingual, Multi-Domain, Multi-Parallel Dataset for Training and Evaluating Culturally Adapted Task-Oriented Dialog Systems. Transactions of the Association for Computational Linguistics (TACL). *equal contribution **equal senior contribution

Benjamin Minixhofer, Jonas Pfeiffer, and Ivan Vulić. Where's the Point? Self-Supervised Multilingual Punctuation-Agnostic Sentence Segmentation. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023), July 2023

Fabian David Schmidt, Ivan Vulić, and Goran Glavaš. Free Lunch: Robust Cross-Lingual Transfer via Model Checkpoint Averaging. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023), July 2023

Yaoyiran Li, Ching-Yun Chang, Stephen Rawls, Ivan Vulić, and Anna Korhonen. Translation-Enhanced Multilingual Text-to-Image Generation. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023), July 2023. 

Alan Ansell, Edoardo Maria Ponti, Anna Korhonen, and Ivan Vulić. Distilling Efficient Language-Specific Models for Cross-Lingual Transfer. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023), July 2023. 

Marinela Parović, Alan Ansell, Ivan Vulić, and Anna Korhonen. Cross-Lingual Transfer with Target Language-Ready Task Adapters. In Findings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023), July 2023. 

Nikita Moghe,* Evgeniia Razumovskaia,* Liane K. Guillou, Ivan Vulić, Anna Korhonen, and Alexandra Birch. Multi3NLU++: A Multilingual, Multi-Intent, Multi-Domain Dataset for Natural Language Understanding in Task-Oriented Dialogue. In Findings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023), July 2023. *equal contribution

Guangzhi Sun, Chao Zhang, Ivan Vulić, Paweł Budzianowski, and Philip C. Woodland. Knowledge-Aware Audio-Grounded Generative Slot Filling for Limited Annotated Data. CoRR, abs/2307.01764, July 2023.

Ivan Vulić, Goran Glavaš, Fangyu Liu, Nigel Collier, Edoardo Maria Ponti, and Anna Korhonen. Probing Cross-Lingual Lexical Knowledge from Multilingual Sentence Encoders. In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2023), May 2023.

Zhangdie Yuan,* Songbo Hu,* Ivan Vulić, Anna Korhonen and Zaiqiao Meng. Can Pretrained Language Models (Yet) Reason Deductively? In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2023), May 2023. equal contribution

Chen Cecilia Liu, Jonas Pfeiffer, Anna Korhonen, Ivan Vulić, and Iryna Gurevych. Delving Deeper into Cross-lingual Visual Question Answering. In Findings of the 17th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2023), May 2023.

Vésteinn Snæbjarnarson, Annika Simonsen, Goran Glavaš, and Ivan Vulić. Transfer to a Low-Resource Language via Close Relatives: The Case Study on Faroese. In Proceedings of the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa 2023), May 2023.

Olga Majewska,* Evgeniia Razumovskaia,* Edoardo Maria Ponti, Ivan Vulić, and Anna Korhonen. Cross-Lingual Dialogue Dataset Creation via Outline-Based Generation. Transactions of the Association for Computational Linguistics (TACL), volume 11, Jan 2023. [Code and Data] *equal contribution


2022

Olga Majewska,* Ivan Vulić,* and Anna Korhonen.* Linguistically Guided Multilingual NLP: Current Approaches, Challenges, and Future Perspectives. Book chapter in Algebraic Structures in Natural Language, Dec 2022. *equal contribution

Ivan Vulić, Iñigo Casanueva, Georgios Spithourakis, Avishek Mondal, Tsung-Hsien Wen, and Paweł Budzianowski. Multi-Label Intent Detection via Contrastive Task Specialization of Sentence Encoders. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP 2022), Dec 2022.

Fabian David Schmidt, Ivan Vulić, and Goran Glavaš. Don't Stop Fine-Tuning: On Training Regimes for Few-Shot Cross-Lingual Transfer with Multilingual Language Models. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP 2022), Dec 2022.

Fabian David Schmidt, Ivan Vulić, and Goran Glavaš. SLICER: Sliced Fine-Tuning for Low-Resource Cross-Lingual Transfer for Named Entity Recognition. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP 2022), Dec 2022.

Yaoyiran Li, Fangyu Liu, Ivan Vulić, and Anna Korhonen. Improving Bilingual Lexicon Induction with Cross-Encoder Reranking. In Findings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP 2022), Dec 2022. 

Robert Litschko, Ivan Vulić, and Goran Glavaš. Parameter-Efficient Neural Reranking for Cross-Lingual and Multilingual Retrieval. In Proceedings of the 29th International Conference on Computational Linguistics (COLING 2022), Oct 2022.

Emanuele Bugliarello, Fangyu Liu, Jonas Pfeiffer, Siva Reddy, Desmond Elliott, Edoardo Maria Ponti, and Ivan Vulić. IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages . In Proceedings of the 39th International Conference on Machine Learning (ICML 2022) , July 2022 [Code and Data]

Marinela Parović, Ivan Vulić, Goran Glavaš, and Anna Korhonen. BAD-X: Bilingual Adapters Improve Zero-Shot Cross-Lingual Transfer. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT 2022), July 2022. 

Chia-Chien Hung, Anne Lauscher, Ivan Vulić, Simone Paolo Ponzetto, and Goran Glavaš. Multi2WOZ: A Robust Multilingual Dataset and Conversational Pretraining for Task-Oriented Dialog. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT 2022), July 2022. 

Iñigo Casanueva,* Ivan Vulić,* Georgios Spithourakis, and Paweł Budzianowski. NLU++: A Multi-Label, Slot-Rich, Generalisable Dataset for Natural Language Understanding in Task-Oriented Dialogue. In Findings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT 2022), July 2022. *equal contribution

Georgios Spithourakis, Ivan Vulić, Michał Lis, Iñigo Casanueva, and Paweł Budzianowski. EVI: Multilingual Spoken Dialogue Tasks and Dataset for Knowledge-Based Enrolment, Verification, and Identification. In Findings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT 2022), July 2022. 

Alan Ansell, Edoardo Maria Ponti, Anna Korhonen, and Ivan Vulić. Composable Sparse Fine-Tuning for Cross-Lingual Transfer. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (ACL 2022), May 2022.

Yaoyiran Li, Fangyu Liu, Nigel Collier, Anna Korhonen, and Ivan Vulić. Improving Word Translation via Two-Stage Contrastive Learning. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (ACL 2022), May 2022. 

Wenxuan Zhou,* Fangyu Liu,* Ivan Vulić, Nigel Collier, and Muhao Chen. Prix-LM: Pretraining for Multilingual Knowledge Base Construction. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (ACL 2022), May 2022. *equal contribution

Sebastian Ruder,* Ivan Vulić,* and Anders Søgaard* . Square One Bias in NLP: Towards a Multi-Dimensional Exploration of the Research Manifold. In Findings of the 60th Annual Meeting of the Association for Computational Linguistics (ACL 2022), May 2022. *equal contribution from all co-authors

Jonas Pfeiffer, Gregor Geigle, Aishwarya Kamath, Jan-Martin O. Steitz, Stefan Roth, Ivan Vulić, and Iryna Gurevych. xGQA: Cross-Lingual Visual Question Answering. In Findings of the 60th Annual Meeting of the Association for Computational Linguistics (ACL 2022), May 2022.

Evgeniia Razumovskaia, Ivan Vulić, and Anna Korhonen. Data Augmentation and Learned Layer Aggregation for Improved Multilingual Language Understanding in Dialogue. In Findings of the 60th Annual Meeting of the Association for Computational Linguistics (ACL 2022), May 2022.

Evgeniia Razumovskaia, Goran Glavaš, Olga Majewska, Anna Korhonen, and Ivan Vulić. Crossing the Conversational Chasm: A Primer on Natural Language Processing for Multilingual Task-Oriented Dialogue Systems. Journal of Artificial Intelligence Research, 2022. [Github page]

Gabor Fuisz, Ivan Vulić, Samuel Gibbons, Iñigo Casanueva, and Paweł Budzianowski. Improved and Efficient Conversational Slot Labeling through Question Answering. CoRR, abs/2204.02123, Apr 2022.

Gregor Geigle,* Jonas Pfeiffer,* Nils Reimers, Ivan Vulić, and Iryna Gurevych. Retrieve Fast, Rerank Smart: Cooperative and Joint Approaches for Improved Cross-Modal Retrieval. Transactions of the Association for Computational Linguistics (TACL), volume 10. [Code] *equal contribution

Robert Litschko, Ivan Vulić, Simone Paolo Ponzetto, and Goran Glavaš. On Cross-Lingual Retrieval with Multilingual Text Encoders. Information Retrieval Journal, Mar 2022.


2021

Ivan Vulić, Pei-Hao Su, Sam Coope, Daniela Gerz, Paweł Budzianowski, Iñigo Casanueva, Nikola Mrkšić, and Tsung-Hsien Wen. ConvFiT: Conversational Fine-Tuning of Pretrained Language Models. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021), Nov 2021.

Jonas Pfeiffer, Ivan Vulić, Iryna Gurevych, and Sebastian Ruder. UNKs Everywhere: Adapting Multilingual Language Models to New Scripts. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021), Nov 2021.

Fangyu Liu, Ivan Vulić, Anna Korhonen, and Nigel Collier. Fast, Effective, and Self-Supervised: Transforming Masked Language Models into Universal Lexical and Sentence Encoders. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021), Nov 2021. [Code]

Qianchu Liu, Edoardo Maria Ponti, Diana McCarthy, Ivan Vulić, and Anna Korhonen. AM2iCo: Evaluating Word Meaning in Context across Low-Resource Languages with Adversarial Examples. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021), Nov 2021. [Code and Data]

Daniela Gerz, Pei-Hao Su, Razvan Kusztos, Avishek Mondal, Michał Lis, Eshan Singhal, Nikola Mrkšić, Tsung-Hsien Wen, and Ivan Vulić. Multilingual and Cross-Lingual Intent Detection from Spoken Data. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021), Nov 2021.

Alan Ansell, Edoardo Maria Ponti, Jonas Pfeiffer, Sebastian Ruder, Goran Glavaš, Ivan Vulić, and Anna Korhonen. MAD-G: Multilingual Adapter Generation for Efficient Cross-Lingual Transfer. In Findings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021), Nov 2021.

Qianchu Liu,* Fangyu Liu,* Nigel Collier, Anna Korhonen, and  Ivan Vulić. Mirror-WiC: On Eliciting Word-in-Context Representations from Pretrained Language Models. In Proceedings of the 25th SIGNLL Conference on Computational Language Learning (CoNLL 2021), Nov 2021. *equal contribution

Edoardo Maria Ponti, Julia Kreutzer, Ivan Vulić, and Siva Reddy. Modelling Latent Translations for Cross-Lingual Transfer. CoRR, abs/2107.11353, July 2021.

Ivan Vulić, Edoardo Maria Ponti, Anna Korhonen, and Goran Glavaš. LexFit: Lexical Fine-Tuning of Pretrained Language Models. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics (ACL 2021), Aug 2021.

Olga Majewska, Ivan Vulić, Goran Glavaš, Edoardo Maria Ponti, and Anna Korhonen. Verb Knowledge Injection for Multilingual Event Processing. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics (ACL 2021), Aug 2021.

Mengjie Zhao,* Yi Zhu,* Ehsan Shareghi, Ivan Vulić, Roi Reichart, Anna Korhonen, and Hinrich Schütze. A Closer Look at Few-Shot Cross-Lingual Transfer: The Choice of Shots Matters. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics (ACL 2021), Aug 2021. *equal contribution

Soumya Barikeri, Anne Lauscher, Ivan Vulić, and Goran Glavaš. RedditBias: A Real-World Resource for Bias Evaluation and Debiasing of Conversational Language Models.  In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics (ACL 2021), Aug 2021.

Phillip Rust,* Jonas Pfeiffer,* Ivan Vulić, Sebastian Ruder, and Iryna Gurevych. How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics (ACL 2021), Aug 2021. *equal contribution

Fangyu Liu, Ivan Vulić, Anna Korhonen, and Nigel Collier. Learning Domain-Specialised Representations for Cross-Lingual Biomedical Entity Linking. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics (ACL 2021), Aug 2021.

Goran Glavaš and Ivan Vulić. Climbing the Tower of Treebanks: Improving Low-Resource Dependency Parsing via Hierarchical Source Selection. In Findings of the 59th Annual Meeting of the Association for Computational Linguistics (ACL 2021), Aug 2021. [Pretrained parsers and code]

Matthew Henderson and Ivan Vulić. ConVEx: Data-Efficient and Few-Shot Slot Labeling. In Proceedings of the 18th Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT 2021), Jun 2021.

Edoardo Maria Ponti, Ivan Vulić, Ryan Cotterell, Marinela Parović, Roi Reichart, and Anna Korhonen. Parameter Space Factorization for Zero-Shot Learning across Tasks and Languages. Transactions of the Association for Computational Linguistics (TACL), volume 9, Apr 2021. [Code]

Olga Majewska, Diana McCarthy, Jasper van den Bosch, Nikolaus Kriegeskorte, Ivan Vulić, and Anna Korhonen. Semantic Data Set Construction from Human Clustering and Spatial Arrangement. Computational Linguistics, volume 47 (1), Apr 2021. [Dataset]

Goran Glavaš and Ivan Vulić. Is Supervised Syntactic Parsing Beneficial for Language Understanding Tasks? An Empirical Investigation. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2021), Apr 2021. Best Long Paper Award. [Code]

Robert Litschko, Ivan Vulić, Simone Paolo Ponzetto, and Goran Glavaš. Evaluating Multilingual Text Encoders for Unsupervised Cross-Lingual Retrieval. In Proceedings of the 43rd European Conference on Information Retrieval (ECIR 2021), Apr 2021.

Nicolas Garneau,* Mareike Hartmann,* Anders Sandholm,* Sebastian Ruder,* Ivan Vulić,* and Anders Søgaard*. Analogy Training Multilingual Encoders. In Proceedings of the 35th AAAI Conference on Artificial Intelligence (AAAI 2021), Feb 2021. *equal contribution from all co-authors


2020

Anne Lauscher, Ivan Vulić, Edoardo Maria Ponti, Anna Korhonen, and Goran Glavaš. Specializing Unsupervised Pretraining Models for Word-Level Semantic Similarity. In Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020), Dec 2020. 

Goran Glavaš,* Mladen Karan,* and Ivan Vulić*. XHate-999: Analyzing and Detecting Abusive Language Across Domains and Languages. In Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020), Dec 2020. [Data] *equal contribution from all co-authors

Yaoyiran Li, Edoardo Maria Ponti, Ivan Vulić, and Anna Korhonen. Emergent Communication Pretraining for Few-Shot Machine Translation. In Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020), Dec 2020. 

Robert Litschko, Ivan Vulić, Željko Agić, and Goran Glavaš. Towards Instance-Level Parser Selection for Cross-Lingual Transfer of Dependency Parsers. In Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020), Dec 2020. 

Olga Majewska, Ivan Vulić, Diana McCarthy, and Anna Korhonen. Manual Clustering and Spatial Arrangement of Verbs for Multilingual Evaluation and Typology Analysis. In Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020), Dec 2020. 

Marko Vidoni, Ivan Vulić, and Goran Glavaš. Orthogonal Language and Task Adapters in Zero-Shot Cross-Lingual Transfer. CoRR, abs/2012.06460, Dec 2020.

Ivan Vulić, Edoardo Maria Ponti, Robert Litschko, Goran Glavaš, and Anna Korhonen. Probing Pretrained Language Models for Lexical Semantics. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), Nov 2020.

Jonas Pfeiffer, Ivan Vulić, Iryna Gurevych, and Sebastian Ruder. MAD-X: An Adapter-Based Framework for Multi-Task Cross-Lingual Transfer. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), Nov 2020. [Code and Models]

Anne Lauscher, Vinit Ravishankar, Ivan Vulić, and Goran Glavaš. From Zero to Hero: On the Limitations of Zero-Shot Cross-Lingual Transfer with Multilingual Transformers.  In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), Nov 2020.

Edoardo Maria Ponti,* Goran Glavaš,* Olga Majewska, Qianchu Liu, Ivan Vulić, and Anna Korhonen. XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning.  In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), Nov 2020. [Dataset] *equal contribution

Haim Dubossarsky, Ivan Vulić, Roi Reichart, and Anna Korhonen. The Secret is in the Spectra: Predicting Cross-Lingual Task Performance with Spectral Similarity Measures. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), Nov 2020.

Ivan Vulić,* Sebastian Ruder,* and Anders Søgaard*. Are All Good Word Vector Spaces Isomorphic? In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), Nov 2020. [Code] *equal contribution from all co-authors

Matthew Henderson, Iñigo Casanueva, Nikola Mrkšić, Pei-Hao Su, Tsung-Hsien Wen, and Ivan Vulić. ConveRT: Efficient and Accurate Conversational Representations from Transformers.  In Findings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), Nov 2020.

Jonas Pfeiffer,* Andreas Rücklé,* Clifton Poth,* Aishwarya Kamath, Ivan Vulić, Sebastian Ruder, Kyunghyun Cho, and Iryna Gurevych. AdapterHub: A Framework for Adapting Transformers. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations (EMNLP 2020), Nov 2020. [Code and Demos] *equal contribution

Ivan Vulić,* Simon Baker,* Edoardo Maria Ponti,* Ulla Petti, Ira Leviant, Kelly Wing, Olga Majewska, Eden Bar, Matt Malone, Thierry Poibeau, Roi Reichart, Anna Korhonen. Multi-SimLex: A Large-Scale Evaluation of Multilingual and Cross-Lingual Lexical Semantic Similarity. Computational Linguistics, volume 46 (4), Oct 2020. [Project page and datasets] *equal contribution

Goran Glavaš, Ivan Vulić, Anna Korhonen, and Simone Paolo Ponzetto. SemEval-2020 Task 2: Predicting Multilingual and Cross-Lingual (Graded) Lexical Entailment. In Proceedings of the 14th International Workshop on Semantic Evaluation (SemEval 2020), Dec 2020. [Data]

Carlos Santos Armendariz, Matthew Purver, Senja Pollak, Nikola Ljubešić, Matej Ulčar, Ivan Vulić, and Mohammad Taher Pilehvar. SemEval-2020 Task 3: Graded Word Similarity in Context. n Proceedings of the 14th International Workshop on Semantic Evaluation (SemEval 2020), Dec 2020. [Data]

Ivan Vulić,* Mladen Karan,* Anna Korhonen, and Goran Glavaš. Classification-Based Self-Learning for Weakly Supervised Bilingual Lexicon Induction. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL 2020), Jul 2020. [Code] *equal contribution

Daniela Gerz, Ivan Vulić, Marek Rei, Roi Reichart, and Anna Korhonen. Multidirectional Associative Optimization of Function-Specific Word Representations. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL 2020), Jul 2020. [Code]

Sam Coope, Tyler Farghly, Daniela Gerz, Ivan Vulić, and Matthew Henderson. Span-ConveRT: Few-shot Span Extraction for Dialog with Pretrained Conversational Representations. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL 2020), Jul 2020. [Data]

Goran Glavaš and Ivan Vulić. Non-Linear Instance-Based Cross-Lingual Mapping for Non-Isomorphic Embedding Spaces. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL 2020), Jul 2020. [Code]

Ivan Vulić, Anna Korhonen, and Goran Glavaš. Improving Bilingual Lexicon Induction with Unsupervised Post-Processing of Monolingual Word Vector Spaces. In Proceedings of the 5th Workshop on Representation Learning for NLP (RepL4NLP, collocated with ACL 2020), Jul 2020. Best Short Paper Award.

Iñigo Casanueva, Tadas Temčinas, Daniela Gerz, Matthew Henderson, and Ivan Vulić. Efficient Intent Detection with Dual Sentence Encoders. Proceedings of the 2nd Workshop on NLP for Conversational AI (NLP4ConvAI 2020, collocated with ACL 2020), Jul 2020.

Olga Majewska, Diana McCarthy, Jasper van den Bosch, Nikolaus Kriegeskorte, Ivan Vulić, and Anna Korhonen. Spatial Multi-Arrangement for Clustering and Multi-way Similarity Dataset Construction. In Proceedings of the 12th International Conference on Language Resources and Evaluation (LREC 2020), Apr 2020.

Anne Lauscher, Goran Glavaš, Simone Paolo Ponzetto, and Ivan Vulić. A General Framework for Implicit and Explicit Debiasing of Distributional Word Vector Spaces. In Proceedings of the 34th AAAI Conference on Artificial Intelligence (AAAI 2020), Feb 2020.


2019

Edoardo Maria Ponti, Ivan Vulić, Ryan Cotterell, Roi Reichart, and Anna Korhonen. Towards Zero-Shot Language Modeling. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing (EMNLP 2019), Nov 2019. [Code]

Ivan Vulić, Goran Glavaš, Roi Reichart, and Anna Korhonen. Do We Really Need Fully Unsupervised Cross-Lingual Embeddings? In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing (EMNLP 2019), Nov 2019. [Data]

Edoardo Maria Ponti, Ivan Vulić, Goran Glavaš, Roi Reichart, and Anna Korhonen. Cross-Lingual Semantic Specialization via Lexical Relation Induction.  In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing (EMNLP 2019), Nov 2019. [Code and Data]

Matthew Henderson, Ivan Vulić, Daniela Gerz, Iñigo Casanueva, Paweł Budzianowski, Sam Coope, Georgios Spithourakis, Tsung-Hsien Wen, Nikola Mrkšić, and Pei-Hao Su. PolyResponse: A Rank-Based Approach to Task-Oriented Dialogue with Application in Restaurant Search and Booking. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing: System Demonstrations (EMNLP 2019), Nov 2019.

Yi Zhu, Benjamin Heinzerling, Ivan Vulić, Michael Strube, Roi Reichart, and Anna Korhonen. On the Importance of Subword Information for Morphological Tasks in Truly Low-Resource Languages. In Proceedings of the 23rd SIGNLL Conference on Computational Language Learning (CoNLL 2019), Nov 2019.

Qianchu Liu, Diana McCarthy, Ivan Vulić, and Anna Korhonen. Investigating Cross-Lingual Alignment Methods for Contextualized Embeddings with Token-Level Evaluation. In Proceedings of the 23rd SIGNLL Conference on Computational Language Learning (CoNLL 2019), Nov 2019.

Anders Søgaard, Ivan Vulić, Sebastian Ruder, and Manaal Faruqui. Cross-Lingual Word Embeddings. Synthesis Lectures on Human Language Technologies, Morgan & Claypool Publishers, Jun 2019. Book available online

Paweł Budzianowski and Ivan Vulić. Hello, it's GPT-2 - How Can I Help You? Towards the Use of Pretrained Language Models for Task-Oriented Dialogue Systems. In Proceedings of the 3rd Workshop on Neural Generation and Translation (WNGT, collocated with EMNLP 2019), Nov 2019.

Matthew Henderson, Ivan Vulić, Daniela Gerz, Iñigo Casanueva, Paweł Budzianowski, Sam Coope, Georgios Spithourakis, Tsung-Hsien Wen, Nikola Mrkšić, and Pei-Hao Su. Training Neural Response Selection for Task-Oriented Dialogue Systems. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL 2019), Jul 2019.

Ivan Vulić, Simone Paolo Ponzetto, and Goran Glavaš. Multilingual and Cross-Lingual Graded Lexical Entailment.  In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL 2019), Jul 2019.

Goran Glavaš, Robert Litschko, Sebastian Ruder, and Ivan Vulić. How to (Properly) Evaluate Cross-Lingual Word Embeddings: On Strong Baselines, Comparative Analyses, and Some Misconceptions. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL 2019), Jul 2019. [Code and Data]

Goran Glavaš and Ivan Vulić. Generalized Tuning of Distributional Word Vectors for Monolingual and Cross-Lingual Lexical Entailment. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL 2019), Jul 2019.

Željko Agić and Ivan Vulić. JW300: A Wide-Coverage Parallel Corpus for Low-Resource Languages. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL 2019), Jul 2019. [Dataset]

Edoardo Maria Ponti, Helen O'Horan, Yevgeni Berzak, Ivan Vulić, Roi Reichart, Thierry Poibeau, Ekaterina Shutova, and Anna Korhonen. Modeling Language Variation and Universals: A Survey on Typological Linguistics for Natural Language Processing. Computational Linguistics, volume 45 (3), Sept 2019.

Matthew Henderson, Pawel Budzianowski, Iñigo Casanueva, Sam Coope, Daniela Gerz, Girish Kumar, Nikola Mrkšić, Georgios Spithourakis, Pei-Hao Su, Ivan Vulić, and Tsung-Hsien Wen. A Repository of Conversational Datasets. In Proceedings of the 1st Workshop on NLP for Conversational AI (NLP4ConvAI 2019, collocated with ACL 2019), Aug 2019. [Code and Data]

Aishwarya Kamath,* Jonas Pfeiffer,* Edoardo Maria Ponti, Goran Glavaš, and Ivan Vulić. Specialising Distributional Vectors of All Words for Lexical Entailment. In Proceedings of the 4th Workshop on Representation Learning for NLP (RepL4NLP, collocated with ACL 2019), Aug 2019. Best Long Paper Award. *equal contribution

Sebastian Ruder, Ivan Vulić, and Anders Søgaard. A Survey of Cross-Lingual Word Embedding Models. Journal of Artificial Intelligence Research, volume 64, Aug 2019.

Robert Litschko, Goran Glavaš, Ivan Vulić, and Laura Dietz. Evaluating Resource-Lean Cross-Lingual Embedding Models in Unsupervised Retrieval. In Proceedings of the 42nd Annual International Conference on Research and Development in Information Retrieval (SIGIR 2019), Jul 2019. 

Yi Zhu, Ivan Vulić, and Anna Korhonen. A Systematic Study of Leveraging Subword Information for Learning Word Representations. In Proceedings of the 17th Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT 2019), Jun 2019. [Code]

Ehsan Shareghi, Daniela Gerz, Ivan Vulić, and Anna Korhonen. Show Some Love to Your n-Grams: A Bit of Progress and Stronger n-Gram Language Modeling Baselines. In Proceedings of the 17th Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT 2019), Jun 2019.

Geert Heyman, Ivan Vulić, Bregt Verreet, and Marie-Francine Moens. Learning Unsupervised Multilingual Word Embeddings with Incremental Multilingual Hubs. In Proceedings of the 17th Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT 2019), Jun 2019.

Goran Glavaš and Ivan Vulić. Zero-Shot Language Transfer for Cross-Lingual Sentence Retrieval with the Bidirectional Attention Model. In Proceedings of the 41st European Conference on Information Retrieval (ECIR 2019), Apr 2019. 


2018

Daniela Gerz,* Ivan Vulić,* Edoardo Maria Ponti, Roi Reichart, and Anna Korhonen. On the Relation between Linguistic Typology and (Limitations of) Multilingual Language Modeling. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP 2018), Nov 2018. *equal contribution

Edoardo Maria Ponti,* Ivan Vulić,* Goran Glavaš, Nikola Mrkšić, and Anna Korhonen. Adversarial Propagation and Zero-Shot Cross-Lingual Transfer of Word Vector Specialization. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP 2018), Nov 2018. *equal contribution [Code]

Daniela Gerz, Ivan Vulić, Edoardo Maria Ponti, Jason Naradowsky, Roi Reichart, and Anna Korhonen. Language Modeling for Morphologically Rich Languages: Character-Aware Modeling for Word-Level Prediction. Transactions of the Association for Computational Linguistics (TACL), volume 6, Jul 2018.

Geert Heyman, Ivan Vulić, Yannick Laevaert, and Marie-Francine Moens. Automatic Detection and Correction of Context-Dependent DT-Mistakes in Dutch using Neural Networks. Computational Linguistics in the Netherlands (CLIN) Journal, volume 8, Dec 2018.

Geert Heyman, Ivan Vulić, and Marie-Francine Moens. A Deep Learning Approach to Bilingual Lexicon Induction in the Biomedical Domain. BMC Bioinformatics, volume 19, article 259, Jul 2018.

Edoardo Maria Ponti, Roi Reichart, Anna Korhonen, and Ivan Vulić. Isomorphic Transfer of Syntactic Structures in Cross-Lingual Natural Language Processing. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL 2018), Jul 2018.

Anders Søgaard, Sebastian Ruder, and Ivan Vulić. On the Limitations of Unsupervised Bilingual Dictionary Induction. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL 2018), Jul 2018.

Goran Glavaš and Ivan Vulić. Explicit Retrofitting of Distributional Word Vectors. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL 2018), Jul 2018. [Code]

Nikola Mrkšić and Ivan Vulić. Fully Statistical Neural Belief Tracking. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL 2018), Jul 2018. [Code]

Marek Rei, Daniela Gerz, and Ivan Vulić. Scoring Lexical Entailment with a Supervised Directional Similarity Network.  In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL 2018), Jul 2018.

Guy Rotman, Ivan Vulić, and Roi Reichart. Bridging Languages through Images with Deep Partial Canonical Correlation Analysis. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL 2018), Jul 2018.

Ivan Vulić and Anna Korhonen. Injecting Lexical Contrast into Word Vectors by Guiding Vector Space Specialisation. In Proceedings of the 3rd Workshop on Representation Learning for NLP (RepL4NLP, collocated with ACL 2018), Jul 2018.

Robert Litschko, Goran Glavaš, Simone Paolo Ponzetto, and Ivan Vulić. Unsupervised Cross-Lingual Information Retrieval using Monolingual Data Only. In Proceedings of the 41st Annual International Conference on Research and Development in Information Retrieval (SIGIR 2018), Jul 2018. [Code]

Ivan Vulić and Nikola Mrkšić. Specialising Word Vectors for Lexical Entailment. In Proceedings of the 16th Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT 2018), Jun 2018. [Code]

Ivan Vulić, Goran Glavaš, Nikola Mrkšić, and Anna Korhonen. Post-Specialisation: Retrofitting Vectors of Words Unseen in Lexical Resources. In Proceedings of the 16th Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT 2018), Jun 2018. [Code]

Goran Glavaš and Ivan Vulić. Discriminating between Lexico-Semantic Relations with the Specialization Tensor Model. In Proceedings of the 16th Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT 2018), Jun 2018. [Code]

Olga Majewska, Diana McCarthy, Ivan Vulić, and Anna Korhonen. Acquiring Verb Classes through Bottom-Up Semantic Verb Clustering. In Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC 2018).

Billy Chiu, Sampo Pyysalo, Ivan Vulić, and Anna Korhonen. Bio-SimVerb and BioSimLex: Wide-Coverage Evaluation Sets of Word Similarity in Biomedicine. BMC Bioinformatics, volume 19, article 33, Feb 2018. [Data]


2017

Ivan Vulić, Daniela Gerz, Douwe Kiela, Felix Hill, and Anna Korhonen. HyperLex: A Large-Scale Evaluation of Graded Lexical Entailment. Computational Linguistics, volume 43 (4), Dec 2017. [Data]

Olga Majewska, Ivan Vulić, Diana McCarthy, Yan Huang, Akira Murakami, Veronika Laippala, and Anna Korhonen. Investigating the Cross-Lingual Translatability of VerbNet-Style Classification. Language Resources and Evaluation, volume 52 (3), Oct 2017.

Ivan Vulić, Nikola Mrkšić, and Anna Korhonen. Cross-Lingual Induction and Transfer of Verb Classes Based on Word Vector Space Specialisation. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP 2017), Sept 2017.

Nikola Mrkšić, Ivan Vulić, Diarmuid Ó Séaghdha, Roi Reichart, Ira Leviant, Milica Gašić, Anna Korhonen, and Steve Young. Semantic Specialisation of Distributional Word Vector Spaces using Monolingual and Cross-Lingual Constraints. Transactions of the Association for Computational Linguistics (TACL), volume 5, Sept 2017. [Code]

Ivan Vulić, Roy Schwartz, Ari Rappoport, Roi Reichart, and Anna Korhonen. Automatic Selection of Context Configurations for Improved Class-Specific Word Representations. In Proceedings of the 21st SIGNLL Conference on Computational Language Learning (CoNLL 2017), Jul 2017.

Ivan Vulić, Nikola Mrkšić, Roi Reichart, Diarmuid Ó Séaghdha, Steve Young, and Anna Korhonen. Morph-Fitting: Fine-Tuning Word Vector Spaces with Simple Language-Specific Rules. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (ACL 2017), Jul 2017.

Edoardo Maria Ponti, Ivan Vulić, and Anna Korhonen. Decoding Sentiment from Distributed Representations of Sentences. In Proceedings of the 6th Joint Conference on Lexical and Computational Semantics (*SEM 2017), Jul 2017.

Goran Glavaš, Ivan Vulić, and Simone Paolo Ponzetto. If Sentences Could See: Investigating Visual Information for Semantic Textual Similarity. In Proceedings of the 12th International Conference on Computational Semantics (IWCS 2017), Sept 2017.

Ivan Vulić. Cross-Lingual Syntactically Informed Distributed Word Representations. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2017), Apr 2017.

Ivan Vulić, Douwe Kiela, and Anna Korhonen. Evaluation by Association: A Systematic Study of Quantitative Word Association Evaluation. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2017), Apr 2017.

Geert Heyman, Ivan Vulić, and Marie-Francine Moens. Bilingual Lexicon Induction by Learning to Combine Word-Level and Character-Level Representations. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2017), Apr 2017.


2016

Helen O'Horan, Yevgeni Berzak, Ivan Vulić, Roi Reichart, and Anna Korhonen. Survey on the Use of Typological Information in Natural Language Processing. In Proceedings of the 26th International Conference on Computational Linguistics (COLING 2016), Dec 2016.

Daniela Gerz, Ivan Vulić, Felix Hill, Roi Reichart, and Anna Korhonen. SimVerb-3500: A Large-Scale Evaluation Set of Verb Similarity. In Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing (EMNLP 2016), Nov 2016. [Dataset]

Susana Zoghbi, Ivan Vulić and Marie-Francine Moens. Latent Dirichlet Allocation for Linking User-Generated Content and E-Commerce Data. Information Sciences, volume 367-368, Jun 2016.

Ivan Vulić and Anna Korhonen. On the Role of Seed Lexicons in Learning Bilingual Word Embeddings. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (ACL 2016), Aug 2016.

Ivan Vulić, Douwe Kiela, Stephen Clark and Marie-Francine Moens. Multi-Modal Representations for Improved Bilingual Lexicon Learning. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (ACL 2016), Aug 2016.

Ivan Vulić and Anna Korhonen. Is "Universal Syntax" Universally Useful for Learning Distributed Word Representations? In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (ACL 2016), Aug 2016.

Ivan Vulić and Marie-Francine Moens. Bilingual Distributed Word Representations from Document-Aligned Comparable Data. Journal of Artificial Intelligence Research, volume 55, Apr 2016.

Geert Heyman, Ivan Vulić and Marie-Francine Moens. C-BiLDA: Extracting Cross-Lingual Topics from Non-Parallel Texts by Distinguishing Shared from Unshared Content. Data Mining and Knowledge Discovery, volume 30 (5), Apr 2016.


2015

Douwe Kiela, Ivan Vulić and Stephen Clark. Visual Bilingual Lexicon Induction with Transferred ConvNet Features. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP 2015), Sept 2015.

Ivan Vulić and Marie-Francine Moens. Monolingual and Cross-Lingual Information Retrieval Models Based on (Bilingual) Word Embeddings. In Proceedings of the 38th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2015), Aug 2015.

Ivan Vulić and Marie-Francine Moens. Bilingual Word Embeddings from Non-Parallel Document-Aligned Data Applied to Bilingual Lexicon Induction. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics (ACL 2015), Jul 2015.

Douwe Kiela, Laura Rimell, Ivan Vulić and Stephen Clark. Exploiting Image Generality for Lexical Entailment Detection. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics (ACL 2015), Jul 2015.

Niraj Shrestha, Ivan Vulić, and Marie-Francine Moens. Semantic Role Labeling of Speech Transcripts. In Proceedings of the 16th International Conference on Intelligent Text Processing and Computational Linguistics (CICLING 2015), Apr 2015.

Mladen Karan, Goran Glavaš, Jan Šnajder, Bojana Dalbelo Bašić, Ivan Vulić and Marie-Francine Moens. TKLBLIIR: Detecting Twitter Paraphrases with TweetingJay. In Proceedings of the 9th International Workshop on Semantic Evaluations (SemEval 2015), Jun 2015.

Ivan Vulić, Wim De Smet, Jie Tang and Marie-Francine Moens. Probabilistic Topic Modeling in Multilingal Settings: An Overview of Its Methodology and Applications. Information Processing & Management, volume 51 (1), Jan 2015.


2014

Ivan Vulić and Marie-Francine Moens. Probabilistic Models of Cross-Lingual Semantic Similarity in Context Based on Latent Cross-Lingual Concepts Induced from Comparable Data. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP 2014), Oct 2014.

Ivan Vulić, Susana Zoghbi and Marie-Francine Moens. Learning to Bridge Colloquial and Formal Language Applied to Linking and Search of E-Commerce Data. In Proceedings of the 37th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2014), Jul 2014.

Kris Heylen, Stephen Bond, Dirk De Hertog, Ivan Vulić, and Hendrik Kockaert. TermWise: A CAT-Tool with Context-Sensitive Terminological Support. In Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014), May 2014.

Kris Heylen, Stephen Bond, Dirk De Hertog, Hendrik Kockaert, Frieda Steurs, and Ivan Vulić. TermWise: Leveraging Big Data for Terminological Support in Legal Translation. In Proceedings of the 11th International Conference on Terminology and Knowledge Engineering (TKE 2014), Jun 2014.

Ivan Vulić. Unsupervised Algorithms for Cross-Lingual Text Analysis, Translation Mining, and Information Retrieval. PhD Thesis. Supervisor: Marie-Francine Moens, xxxvi+278 pages, KU Leuven, Department of Computer Science, Jun 2014.


2013

Ivan Vulić and Marie-Francine Moens. A Study on Bootstrapping Bilingual Vector Spaces from Non-Parallel Data (and Nothing Else). In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2013), Oct 2013.

Niraj Shrestha, Ivan Vulić and Marie-Francine Moens. An IR-Inspired Approach to Recovering Named Entity Tags in Broadcast News. In Proceedings of the 6th Information Retrieval Facility Conference (IRFC 2013), Oct 2013.

Ivan Vulić and Marie-Francine Moens. Cross-Lingual Semantic Similarity of Words as the Similarity of Their Semantic Word Responses. In Proceedings of the 13th Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT 2013), Jun 2013.

Susana Zoghbi, Ivan Vulić and Marie-Francine Moens. Are Words Enough? A Study on Text-Based Representations and Retrieval Models for Linking Pins to Online Shops. In Proceedings of the CIKM 2013 Workshop on Mining Unstructured Big Data Using Natural Language Processing (UnstructureNLP@CIKM 2013), Oct 2013.

Susana Zoghbi, Ivan Vulić and Marie-Francine Moens. I Pinned It. Where Can I Buy One like It? Automatically Linking Pinterest Pins to Online Webshops. In Proceedings of the CIKM 2013 Workshop on Data-Driven User Behavioral Modelling and Mining from Social Media (DUBMOD@CIKM 2013), Oct 2013.

Ivan Vulić, Wim De Smet and Marie-Francine Moens. Cross-Language Information Retrieval Models Based on Latent Topic Models Trained with Document-Aligned Comparable Corpora. Information Retrieval, volume 16 (3), Apr 2013.

Ivan Vulić and Marie-Francine Moens. A Unified Framework for Monolingual and Cross-Lingual Relevance Modeling Based on Probabilistic Topic Models. In Proceedings of the 35th European Conference on Information Retrieval (ECIR 2013), Mar 2013.

Marie-Francine Moens and Ivan Vulić. Monolingual and Cross-Lingual Probabilistic Topic Models and Their Application in Information Retrieval. In Proceedings of the 35th European Conference on Information Retrieval (ECIR 2013), Mar 2013.


2012

Ivan Vulić and Marie-Francine Moens. Sub-Corpora Sampling with Application to Bilingual Lexicon Extraction. In Proceedings of the 24th International Conference on Computational Linguistics (COLING 2012), Dec 2012.

Ivan Vulić and Marie-Francine Moens. Detecting Highly Confident Word Translations from Comparable Corpora without Any Prior Knowledge. In Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2012), Apr 2012.

Bram Jans, Steven Bethard, Ivan Vulić and Marie-Francine Moens. Skip N-grams and Ranking Functions for Predicting Script Events. In Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2012), Apr 2012.


2011

Ivan Vulić, Wim De Smet and Marie-Francine Moens. Cross-Language Information Retrieval with Latent Topic Models Trained on a Comparable Corpus. In Proceedings of the 7th Asia Information Retrieval Societies Conference - Information Retrieval Technology (AIRS 2011), Dec 2011.

Ivan Vulić, Wim De Smet and Marie-Francine Moens. Identifying Word Translations from Comparable Corpora Using Latent Topic Models. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics (ACL 2011), Jun 2011.