Publications
2025
Idiart, M., A. Lenci, T. Poibeau, A. Villavicencio (To appear 2024) Special Issue on Computational Models of Language and Cognition. Journal of Natural Language Engineering.
Diab M., A. Villavicencio, M. Apidianaki, V. Kordoni, A. Korhonen, P. Nakov, M. Stevenson (To appear in 2024). Computational lexical semantics and lexicography essays: In honor of Adam Kilgarriff. Springer.
2024
W. He, M. Idiart, C. Scarton, A. Villavicencio (2024) Learning Multilingual Idiomatic Representations by an Adaptive Contrastive Triplet Loss. Accepted for Findings of ACL-2024.
R Wilkens, L Zilio, A Villavicencio (2024) Assessing linguistic generalisation in language models: a dataset for Brazilian Portuguese. Language Resources and Evaluation 58 (1), 175-201
G Soroka, M Idiart, A Villavicencio (2024) Mechanistic role of alpha oscillations in a computational model of working memory. Plos One 19 (2), e0296217
B Peng, W He, B Chen, A Villavicencio, C Wu (2024) Multi-perspective thought navigation for source-free entity linking. Pattern Recognition Letters 178, 84-90
W. He, K. Farrahi, B. Chen, B. Peng, A. Villavicencio (2023) Representation transfer and data cleaning in multi-views for text simplification. Pattern Recognition Letters. https://doi.org/10.1016/j.patrec.2023.11.011 pdf
D. Phelps, T. Pickard, M. Mi, E. Gow-Smith, A. Villavicencio (2024) Sign of the Times: Evaluating the use of Large Language Models for Idiomaticity Detection. Proceedings of the Joint Workshop on Multiword Expressions and Universal Dependencies (MWE-UD) @ LREC-COLING 2024. Best Paper Award. pdf
E Gow-Smith, D Phelps, HT Madabushi, C Scarton, A Villavicencio (2024) Word Boundary Information Isn't Useful for Encoder Language Models. arXiv preprint arXiv:2401.07923
A Yamaguchi, A Villavicencio, N Aletras (2024) An Empirical Study on Cross-lingual Vocabulary Adaptation for Efficient Generative LLM Inference. arXiv preprint arXiv:2402.10712
2023
R Wilkens, L Zilio, A Villavicencio (2023) Assessing linguistic generalisation in language models: a dataset for Brazilian Portuguese. Language Resources and Evaluation, 1-27, 2023. pdf
Zhao K., Yang B., Lin C., Rong W., Villavicencio A. and Cui X. Evaluating Open-Domain Dialogues in Latent Space with Next Sentence Prediction and Mutual Information, The 61st Annual Meeting of the Association for Computational Linguistics (ACL), 2023.
2022
Salle, A., A. Villavicencio (2022) Understanding the Effects of Negative (and Positive) Pointwise Mutual Information on Word Vectors. Journal of Experimental & Theoretical Artificial Intelligence.
Gow-Smith, E., H.T. Madabushi, C. Scarton, A. Villavicencio (2022). Improving Tokenisation by Alternative Treatment of Spaces. In Proceedings of EMNLP 2022. pdf
Phelps, D., X.-R. Fan, E. Gow-Smith, H.T. Madabushi, C. Scarton, A. Villavicencio (2022) Sample Efficient Approaches for Idiomaticity Detection. In Proceedings of the 18th Workshop on Multiword Expressions (MWE 2022). pdf
Madabushi, H.T., E. Gow-Smith, M. Garcia, C. Scarton, M. Idiart, A. Villavicencio (2022) SemEval-2022 Task 2: Multilingual Idiomaticity Detection and Sentence Embedding. Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022). pdf
Boito, M.Z., B. Yusuf, L. Ondel, A. Villavicencio, L. Besacier (2022) Unsupervised Word Segmentation from Discrete Speech Units in Low-Resource Settings. In Proceedings of the 1st Annual Meeting of the ELRA/ISCA Special Interest Group on Under-Resourced Languages (SIGUL 2022). pdf
Ramisch, C., A. Villavicencio (2022) Computational treatment of multiword expressions. In R. Mitkov. Oxford Handbook on Computational Linguistics (2nd edition).
Bigoulaeva, I. R. S. Sachdeva, H. T. Madabushi, A. Villavicencio, I. Gurevych (2022) Effective Cross-Task Transfer Learning for Explainable Natural Language Inference with T5. In Proceedings of the 3rd Workshop on Figurative Language Processing. EMNLP 2022.
2021
Madabushi, H.T., E. Gow-Smith, C. Scarton, A. Villavicencio (2021) AStitchInLanguageModels: Dataset and Methods for the Exploration of Idiomaticity in Pre-Trained Language Models. In Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021. pdf
Garcia, M., T.K. Vieira, C. Scarton, M. Idiart, A. Villavicencio (2021) Assessing Idiomaticity Representations in Vector Models with a Noun Compound Dataset Labeled at Type and Token Levels. Proceedings of the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2021). pdf
Boito, M.Z., A. Villavicencio and L. Besacier (2021). Investigating Alignment Interpretability for Low-resource NMT. Journal of Machine Translation. pdf
Garcia, M., T.K. Vieira, C. Scarton, M. Idiart, A. Villavicencio (2021) Probing for idiomaticity in vector space models. Proceedings of the 16th conference of the European Chapter of the Association for Computational Linguistics (EACL 2021). pdf dataset software
Vickers, P., R. Wainwright, H. T. Madabushi, A. Villavicencio (2021) CogNLP-Sheffield at CMCL 2021 Shared Task: Blending Cognitively Inspired Features with Transformer-based Language Models for Predicting Eye Tracking Patterns. Proceedings of the Cognitive Modeling and Computational Linguistics (CMCL 2021). pdf
Ali Hürriyetoğlu, H. Tanev, V. Zavarella, J. Piskorski, R. Yeniterzi, O. Mutlu, D. Yuret, A. Villavicencio (2021) Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE 2021): Workshop and Shared Task Report. Proceedings of the 4th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE 2021). pdf
2020
Boito, M.Z., A. Villavicencio and L. Besacier (2020) Investigating Language Impact in Bilingual Approaches for Computational Language Documentation. In Proceedings of 1st Joint Spoken Language Technologies for Under-resourced languages and Collaboration and Computing for Under-Resourced Languages Workshop (SLTU-CCURL 2020). pdf
Hashempour, R., A. Villavicencio (2020) Leveraging Contextual Embeddings and Idiom Principle for Detecting Idiomaticity in Potentially Idiomatic Expressions. In Proceedings of the Workshop on the Cognitive Aspects of the Lexicon. pdf.
2019
Idiart, M., A. Villavicencio, B. Katz, C. Rennó-Costa, J. Lisman (2019) How the brain represents language and answers questions? Using an AI system to understand the underlying neurobiological mechanisms. In Frontiers In Computational Neuroscience. 12 March 2019, Volume 13, DOI 10.3389/fncom.2019.00012. pdf.
Cordeiro, S., A. Villavicencio, M. Idiart, C. Ramisch (2019) Unsupervised Compositionality Prediction of Nominal Compounds. In Computational Linguistics. Volume 45, Issue 1, March 2019, p.1-57.
Villavicencio, A., M. Idiart (2019) Discovering Multiword Expressions. Journal of Natural Language Engineering.
Boito, M. Z., A. Villavicencio, L. Besacier (2019) Empirical Evaluation of Sequence-to-Sequence Models for Word Discovery in Low-resource Settings. In Proceedings of the 20th Annual Conference of the International Speech Communication Association (Interspeech 2019). pdf
Boito, M. Z., A. Villavicencio, L. Besacier (2019) How Does Language Influence Documentation Workflow? Unsupervised Word Discovery Using Translations in Multiple Languages. arXiv:1910.05154. pdf
Salle A., A. Villavicencio (2019) Why So Down? The Role of Negative (and Positive) Pointwise Mutual Information in Distributional Semantics. arXiv:1908.06941 pdf
2018
Salle A., A. Villavicencio (2018) Restricted Recurrent Neural Tensor Networks: Exploiting Word Frequency and Compositionality. In Proceedings of The 56th Annual Meeting of the Association for Computational Linguistics (ACL 2018). pdf
Paula F., R. Wilkens, M. Idiart, A. Villavicencio (2018) Similarity Measures for the Detection of Clinical Conditions with Verbal Fluency Tasks. In Proceedings of The 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL HLT 2018). pdf
Salle A., A. Villavicencio (2018) Incorporating subword information into matrix factorization word embeddings. arXiv: 1805.03710 pdf code
Godard, P., M. Z. Boito, L. Ondel, A. Berard, F. Yvon, A. Villavicencio, L. Besacier (2018) Unsupervised Word Segmentation from Speech with Attention. In Proceedings of Interspeech 2018. pdf
Boito, M. Z., A. Anastasopoulos, M. Lekakou, A. Villavicencio, L. Besacier (2018) A small Griko-Italian speech translation corpus. arXiv:1807.10740. pdf
Poibeau T., A. Villavicencio. (eds.) (2018). Language, Cognition and Computational Models. Cambridge University Press.
Idiart, M., A. Lenci, T. Poibeau, A. Villavicencio (eds.) (2018) Proceedings of the Eight Workshop on Cognitive Aspects of Computational Language Learning and Processing. Association for Computational Linguistics. pdf
Villavicencio, A., V. Moreira, A. Abad, H. Caseli, P. Gamallo, C. Ramisch, H. Gonçalo Oliveira, G. Paetzold (2018) Computational Processing of the Portuguese Language. Proceedings of the13th International Conference (PROPOR 2018). Springer
Ramisch, C., R. Ramisch, L. Zilio, A. Villavicencio, S. Cordeiro (2018) A Corpus Study of Verbal Multiword Expressions in Brazilian Portuguese. In Proceedings of The 13th edition of the International Conference on the Computational Processing of Portuguese (PROPOR 2018). Resource.
Wagner Filho, J., R. Wilkens, M. Idiart, A. Villavicencio (2018). The brWaC Corpus: A New Open Resource to Aid in the Processing of Brazilian Portuguese. In Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC). pdf
2017
Wilkens, R., L. Zilio, S. Cordeiro, F. Paula, C. Ramisch, M. Idiart, A. Villavicencio. LexSubNC: A Dataset of Lexical Substitution for Nominal Compounds. In Proceedings of the 12th International Conference on Computational Semantics (IWCS). pdf
Boito, M.B., A. Bérard, A. Villavicencio, L. Besacier. Unwritten Languages Demand Attention Too! Word Discovery With Encoder-Decoder Models. In Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2017). pdf
2016
Cordeiro, S., Ramisch, C., Idiart, M., Villavicencio, A. Predicting the Compositionality of Nominal Compounds: Giving Word Embeddings a Hard Time. In Proceedings of The 54th Annual Meeting of the Association for Computational Linguistics (ACL-2016), 2016. pdf, dataset
Salle, A., Idiart, M., Villavicencio, A. Matrix Factorization using Window Sampling for Improved Word Representations. In Proceedings of The 54th Annual Meeting of the Association for Computational Linguistics (ACL-2016), 2016. pdf, code
Ramisch, C., Cordeiro, S., Idiart, M., Villavicencio, A. How Naked is the Naked Truth? A Multilingual Lexicon of Nominal Compound Compositionality. In Proceedings of The 54th Annual Meeting of the Association for Computational Linguistics (ACL-2016), 2016. pdf
Zilio, L., Wilkens, R., Mollmann, L., Idiart, Marco, Wehrli, E., Cordeiro, S., Villavicencio, A. Joining Forces for Multiword Expression Identification. In Proceedings of 12th International Conference on the Computational Processing of Portuguese (PROPOR), 2016. pdf
Wilkens, R., Zilio, L., Ferreira, E., Villavicencio, A. The Portuguese B2SG: a semantic test for distributional thesaurus. In Proceedings of 12th International Conference on the Computational Processing of Portuguese (PROPOR), 2016. pdf
Wagner Filho, J., Wilkens, R., Zilio, L., Idiart, M., Villavicencio, A. Crawling by Readability Level. In Proceedings of 12th International Conference on the Computational Processing of Portuguese (PROPOR), 2016. pdf
Cordeiro, S., Ramisch, C., Villavicencio, A. mwetoolkit+sem: Integrating Word Embeddings in the mwetoolkit for Semantic MWE Processing. In Proceedings of 10th edition of the Language Resources and Evaluation Conference (LREC), 2016. pdf
Wilkens, R., Zilio, L., Ferreira, E., Villavicencio, A. B2SG: a TOEFL-like task for Portuguese. In Proceedings of 10th edition of the Language Resources and Evaluation Conference (LREC), 2016. pdf
Wilkens, R., Idiart, M., Villavicencio, A. Multiword Expressions in Child Language. In Proceedings of 10th edition of the Language Resources and Evaluation Conference (LREC), 2016. pdf
Cordeiro, S., Ramisch, C., Villavicencio, A. UFRGS&LIF: Rule-Based MWE Identification and Predominant-Supersense Tagging. In Proceedings of SemEval 2016 Task 10: Detecting Minimal Semantic Units and their Meanings (DiMSUM), 2016. pdf
Wilkens, R., Zilio, L., Idiart, M., Wagner Filho, J., Ferreira, E., Santos, L., Pasqualini, B., Villavicencio, A. Resources for Monolingual Translation: a case study of Text Simplification for Portuguese. In Proceedings of PROPOR 2016 Workshop on Corpora and Tools for Processing Corpora, 2016.
Cordeiro, S., Ramisch, C., Idiart, M., Villavicencio, A. MWE-aware corpus processing with the mwetoolkit and word embeddings. In Proceedings of PROPOR 2016 Workshop on Corpora and Tools for Processing Corpora, 2016. pdf
Silvio Cordeiro, Carlos Ramisch, Aline Villavicencio, "Nominal Compound Compositionality: A Multilingual Lexicon and Predictive Model", Proceedings of the 7th PARSEME General Meeting, Dubrovnik, Croatia, September, 2016. pdf
2015
Wilkens, R., Zilio, L., Ferreira, E., Goncalves, G., Villavicencio, A. Tesauros Distribucionais para o Português. In Proceedings of the 10th Brazilian Symposium in Information and Human Language Technology, 2015. pdf
Zilio, L., Finatto, M.J.B., Villavicencio, A. VerbLexPor: um recurso léxico com anotação de papéis semânticos para o português. In Proceedings of the 10th Brazilian Symposium in Information and Human Language Technology, 2015. pdf
Ferreira, E., Boito, F.Z., Villavicencio, A. Comparação dos algoritmos sequencial e paralelo para contagem de palavras e contexto. In Proceedings of IV Workshop de Iniciação Científica em Tecnologia da Informação e da Linguagem Humana (Tilic 2015), 2015, Natal. pdf
Ferreira, E., Boito, F.Z., Villavicencio, A. Comparison of sequential and parallel algorithms for word and context count. WSPPD 2015 - XIII Workshop de Processamento Paralelo e Distribuído pdf
Ferreira, E., Wilkens, R., Villavicencio, A. Exploração dos paramêtros para criação de Tesauros Distribucionais preditivos. In Proceedings of IV Workshop de Iniciação Científica em Tecnologia da Informação e da Linguagem Humana (Tilic 2015), 2015, Natal. pdf
Wehrli, E., Villavicencio, A. Extraction of Multilingual MWEs from Aligned Corpora. In Proceedings of PARSEME 5 Meeting, 2015, Iasi. pdf
Cordeiro, S., Ramisch, C., Villavicencio, A. Token-based MWE Identification Strategies in the mwetoolkit. In Proceedings of PARSEME 4th general meeting, 2015, Valetta. pdf
2014
M. Padró; M. Idiart, C. Ramisch and A. Villavicencio. Nothing like Good Old Frequency: Studying Context Filters for Distributional Thesauri. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2014. pdf
B. Laranjeira, V. Moreira, A. Villavicencio, C. Ramisch and M. J. Finatto. Comparing the Quality of Focused Crawlers and of the Translation Resources Obtained from them. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), 2014. pdf
M. Padró, M. Idiart, A. Villavicencio and C. Ramisch. Comparing Similarity Measures for Distributional Thesauri. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), 2014. pdf
R. A. S. Boos, K. V. Prestes, A. Villavicencio. Identification of Multiword Expressions in the brWaC. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), 2014. pdf
R. Wilkens, A. D. Vecchia, M. Z. Boito, M. Padró and A. Villavicencio. Size does not matter. Frequency does. A study of features for measuring lexical complexity. In Proceedings of the 14th edition of the Ibero-American Conference on Artificial Intelligence, 2014. Springer Lecture Notes in Computer Science/Lecture Notes in Artificial Intelligence LNCS/LNAI. pdf
R. A. S. Boos, K. V. Prestes, A. Villavicencio and M. Padró. brWaC: a WaCky Corpus for Brazilian Portuguese. In Proceedings of the 11th International Conference on Computational Processing of Portuguese (PROPOR 2014), 2014. Springer Lecture Notes in Computer Science LNCS, Vol. 8775. pdf
M. Z. Boito, L. Hagemann, R. Wilkens, A. Villavicencio. Uma análise do perfil de entropia das estruturas sintáticas do português. In Proceedings of the PROPOR-2014 Workshop on Tools and Resources for Automatically Processing Portuguese and Spanish, 2014. pdf
L. A. Alemany, M. Padró, A. Rademaker, A. Villavicencio. Preface. In Proceedings of the PROPOR 2014 Workshop on Tools and Resources for Automatically Processing Portuguese and Spanish (ToRPorEsp), 2014. pdf
R. Wilkens, G. Schwade, A. Villavicencio. A web interface for language acquisition studies. In Proceedings of the 11th International Conference on Computational Processing of Portuguese Language (PROPOR 2014) Software Demonstration, 2014. pdf
M. Zortea, B. Menegola, A. Villavicencio, J. F. de Salles. Estratégias de evocação lexical com critério semântico em adultos após acidente vascular cerebral no hemisfério direito. In Psicologia: Reflexão e Crítica, v. 27, n.2, 2014. pdf
N. Becker, J. L. Müller, J. C. Rodrigues, A. Villavicencio, J. F. de Salles. Análise de Grafos de Associação Semântica de Palavras entre Crianças, Adultos e Idosos. In Letrônica, v. 7, n.1, 2014. pdf
2013
A. Villavicencio, Poibeau, T., A. Korhonen, A. Alishahi (eds). Cognitive Aspects of Computational Language Acquisition. Springer, 2013. pdf
A. Villavicencio, M. Idiart, R. Berwick, and I. Malioutov. Language Acquisition and Probabilistic Models: keeping it simple. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics. pdf
C. Ramisch, A. Villavicencio, V. Kordoni. Introduction to the special issue on multiword expressions: From theory to practice and use. In ACM Transactions on Speech and Language Processing (TSLP) - Special issue on multiword expressions: From theory to practice and use, part 1, v. 10 n.2, June 2013. pdf
M. Parente, A. Villavicencio, M. Siqueira, C. Ping, L. Tonietto. The Lexical Bootstrapping Hypothesis and conventionality: a crosslinguistic study on verb acquisition by Chinese Mandarin- and Brazilian Portuguese-speaking children. In D. Bittner and N. Ruhlig (Eds). Lexical Bootstrapping. The Role of Lexis and Semantics in Child Language Development. Berlin, Boston: De Gruyter Mouton, 2013. pdf
2012
L. Almeida, M. Idiart, A. Villavicencio, J. Lisman. Alternating predictive and short-term memory modes of entorhinal grid cells. In Hippocampus, May 2, 2012. [pdf]
A. Villavicencio, M. Idiart, C. Ramisch, V. Araujo, B. Yankama, R.C. Berwick. Get out but don’t fall down: verb-particle constructions in child language. In Proceedings of the EACL-2012 Workshop on Computational Models of Language Acquisition and Loss. pdf
A. Villavicencio, B.Yankama, R. Wilkens, M. Idiart, R.C. Berwick. An annotated English child language database. In Proceedings of the EACL 2012 Workshop on Computational Models of Language Acquisition and Loss. pdf
A. Villavicencio, B. Yankama, M. Idiart and R.C. Berwick. A large scale annotated child language construction database. In Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC’12). pdf
H. Caseli, A. Villavicencio, A. Teixeira, F. Perdigão. Proceedings of the 10th International Conference Computational Processing of the Portuguese Language. Springer. pdf
A. Villavicencio. Book Review: Syntax-Based Collocation Extraction (by Violeta Seretan). In Natural Language Engineering, v. 18, n. 4 (October 2012), pp. 575-579. pdf
R. Wilkens; A. Villavicencio. I say have you say tem: profiling verbs in children data in English and Portuguese. In Proceedings of the EACL 2012 Workshop on Computational Models of Language Acquisition and Loss. pdf
C. Ramisch; V. Araujo; A. Villavicencio. A Broad Evaluation of Techniques for Automatic Acquisition of Multiword Expressions. In Proceedings of the ACL 2012 Student Research Workshop. pdf
R. Granada, L. Lopes, C. Ramisch, C. Trojahn, Renata Vieira, A. Villavicencio. A Comparable Corpus Based on Aligned Multilingual Ontologies. In Proceedings of the ACL 2012 Workshop on Multilingual Modeling. pdf
2011
Villavicencio, A. Language Acquisition with a Unification-Based Grammar. In R. Borsley e K. Borjars (eds.) Non-transformational Syntax. Blackwell, 2011. [pdf]
Duran, M., C. Ramisch, S. Aluìsio, A. Villavicencio. Identifying and Analyzing Brazilian Portuguese Complex Predicates. In Proceedings of the ACL-2011 Workshop on Multiword Expressions: from Parsing and Generation to the Real World, 2011. [pdf]
Acosta, O., A. Villavicencio, V. Moreira. Identification and Treatment of Multiword Expressions Applied to Information Retrieval. In Proceedings of the ACL-2011 Workshop on Multiword Expressions: from Parsing and Generation to the Real World, 2011. [pdf]
Araujo, V., C. Ramisch, A. Villavicencio. Fast and Flexible MWE Candidate Generation with the mwetoolkit. In Proceedings of the ACL-2011 Workshop on Multiword Expressions: from Parsing and Generation to the Real World, 2011. [pdf]
Schreiner, P., A. Villavicencio, L. Zilio, H. M. Caseli. Improving Lexical Alignment Using Hybrid Discriminative and Post-Processing Techniques. In Proceedings of the 8th Brazilian Symposium in Information and Human Language Technology. [pdf]
Prestes, K., R. Wilkens, L. Zilio, A. Villavicencio. Extração e Validação de Ontologias a partir de Recursos Digitais. In Proceedings of the 6th International Workshop on Metamodels, Ontologies and Semantic Technologies. [pdf]
Gonçalves, G., R. Wilkens, A. Villavicencio. Sistema de Aquisição semi-automática de Ontologias. In Proceedings of the 6th International Workshop on Metamodels, Ontologies and Semantic Technologies. [pdf]
Bocorny, A., A Villavicencio, R. Wikens, C. Killian. Flexible Media Environments for Collaborative Lexicography. In Proceedings of Electronic lexicography in the 21st century.
2010
Caseli, H.M., Nunes, M.G.V., Ramisch, C., and Villavicencio, A. Alignment-based extraction of multiword expressions. Language Resources and Evaluation, Special Issue on Multiword Expressions. Volume 44, Number 1-2, p. 59-77, 2010. [pdf]
Ramisch, C., Caseli, H. M., Villavicencio, A., Finatto, M. J. B., Machado, A. A Hybrid Approach for Multiword Expression Identification. In T. Pardo, A. Branco, A. Klautau, R. Vieira, V. Lima (eds.) Computational Processing of the Portuguese Language, 9th International Conference. Lecture Notes in Computer Science 6001 Springer 2010 (ISBN 978-3-642-12319-1). [pdf]
Villavicencio, A., Ramisch, C., Machado, A., Caseli, H.M., Finatto, M. J. B. Identificação de Expressões Multipalavra em Domínios Específicos. Linguamática. Volume 2, Number 1, p.15-33, 2010. [pdf]
Ramisch, C., A. Villavicencio, C. Boitet. Web-based and combined language models: a case study on noun compound identification. In Proceedings of Coling 2010, Beijing, p. 1041-1049, 2010. [pdf]
Ramisch, C., Villavicencio, A., Boitet, C. Multiword Expressions in the wild? The mwetoolkit comes in handy. In Proceedings of Coling 2010 Demonstrations, Beijing, p. 57-60, 2010. [pdf]
Wilkens, R., A. Villavicencio, D. Muller, L. Wives, F. Silva, S. Loh. COMUNICA - A Question Answering System for Brazilian Portuguese. In Proceedings of Coling 2010 Demonstrations, Beijing, p. 21-24, 2010. [pdf]
Santos, A. S., A. Villavicencio, J. Salles. Investigating characteristics of semantic networks of verbs in patients with Alzheimer’s disease. In Proceedings of Interdisciplinary Workshop on Verbs. The Identification and Representation of Verb Features, Pisa, 2010. [pdf]
Germann, D., A. Villavicencio, M.S.G Siqueira. Modeling the Lexical Organization of Verbs. In Proceedings of NAACL-HLT 2010 Workshop on Computational Neurolinguistics, Los Angeles, 2010. [pdf]
Germann, D., A. Villavicencio, M.S.G Siqueira. An Investigation on the Influence of Frequency on the Lexical Organization of Verbs. In Proceedings of TextGraphs-5: Graph-based Methods for Natural Language Processing, Uppsala, 2010. [pdf]
Wilkens, R., A. Villavicencio. Question Answering for Portuguese: how much is needed? In Proceedings of Brazilian Symposium on Artificial Intelligence (SBIA), 2010. [pdf]
Linardaki, E., C. Ramisch, A. Villavicencio, A. Fotopoulou. Towards the Construction of Language Resources for Greek Multiword Expressions: Extraction and Evaluation. In Proceedings of the LREC Workshop on Exploitation of multilingual resources and tools for Central and (South) Eastern European Languages, Malta, 2010. [pdf]
2009
Baldwin, T., Kordoni, V. and Villavicencio, A. Prepositions in Applications: A Survey and Introduction to the Special Issue. Computational Linguistics, 35(2), pp. 119—149, 2009. [pdf]
Villavicencio, A., H.M. Caseli, A. Machado. Identification of Multiword Expressions in Technical Domains: Investigating Statistical and Alignment-based Approaches. In Proceedings of the 7th Brazilian Symposium in Information and Human Language Technology, São Carlos, 2009. [pdf]
Caseli, H. M., A. Villavicencio, A. Machado, M. J. B. Finatto. Statistically-Driven Alignment-Based Multiword Expression Identification for Technical Domains. In Proceedings of the ACL/IJCNLP Workshop on Multiword Expressions: Identification, Interpretation, Disambiguation and Applications, Singapore, 2009. [pdf]
2008
Tonietto, L., Villavicencio, A., Siqueira, M. S. G., Parente, M. A. M. P., Sperb, T. M. A especificidade semântica como fator determinante na aquisição de verbos. Revista Psico, v. 29, p. 343-351, 2008. [pdf]
Ramisch, C., A. Villavicencio, L. Moura and M. Idiart. Picking them up and Figuring them out: Verb-Particle Constructions, Noise and Idiomaticity. In Proceedings of the Twelfth Conference on Computational Natural Language Learning, Manchester, 2008. [pdf]
Ramisch, C., P. Schreiner, M. Idiart and A. Villavicencio. An Evaluation of Methods for the Extraction of Multiword Expressions. In Proceedings of the LREC 2008 Workshop on Multiword Expressions, Marrakech, 2008. [pdf]
Villavicencio, A., B. Menegola,, J. Rodrigues, M. Siqueira, M.A. Parente. Lexical Organization and its Dissolution in Ageing. In Proceedings of the 2008 Mid-Year Meeting of the International Neuropsychological Society, Buenos Aires, 2008.
2007
Villavicencio, A., V. Kordoni, Y. Zhang, M. Idiart, and C. Ramisch. Validation and evaluation of automatically acquired multiword expressions for grammar engineering. In Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL 2007), Prague, 2008. [pdf]
Machado, M., A. Villavicencio. Examining Syntactic Constructions for Verb Meaning Acquisition. In Proceedings of EPIA-2007 Workshop on General Artificial Intelligence, Lisbon, 2007. [pdf]
2006
Zhang, Y., V. Kordoni, A. Villavicencio and M. Idiart. Automated Multiword Expression Prediction for Grammar Engineering. In Proceedings of COLING/ACL Workshop on Multiword Expressions: Identifying and Exploiting Underlying Properties, Sydney, Australia, 2006. [pdf]
Sadler, L., D. Arnold, and A. Villavicencio. Portuguese: Corpora, Coordination and Agreement. In Proceedings of Linguistic Evidence: Empirical, Theoretical, and Computational Perspectives, Tübingen, 2006. [pdf]
2005
Villavicencio, A. The Availability of Verb-Particle Constructions: How much is Enough? Computer Speech & Language, v. 19, n. 4, p. 415-432, 2005. [pdf]
Villavicencio, A., Bond, F., Korhonen, A., McCarthy, D. Introduction to the Special Issue on Multiword Expressions: having a crack at a hard nut. Computer Speech & Language, v. 19, n. 4, p. 365-377, 2005. [pdf]
Villavicencio, A., Sadler, L. and Arnold, D. An HPSG Account of Closest Conjunct Agreement in NP Coordination in Portuguese. In Proceedings of the 12th International Conference on Head-Driven Phrase Structure Grammar, 2005. [pdf]
Villavicencio, A., Sadler, L. Agreement Patterns in Corpora. In Proceedings of Workshop on Exploring Syntactically Annotated Corpora. Corpus Linguistics 2005, Birmingham, 2005. [pdf]
Villavicencio, A., M.J. Finatto, V. Possamai. Padrões da Preposição “DE” entre Sintagmas Nominais em Linguagem Cotidiana e Linguagens Técnico-Científicas. In Proceedings of the V Encontro de Corpora, São Carlos, 2005. [pdf]
2004
Villavicencio, A., T. Baldwin, B. Waldron. A Multilingual Database of Idioms. In Proceedings of the 4th International Conference On Language Resources and Evaluation, LREC-2004, Lisboa, 2004. [pdf]
Villavicencio, A., A. Copestake, B. Waldron, F. Lambeau (2004). The Lexical Encoding of MWEs. In T.Tanaka, A. Villavicencio, F. Bond, A. Korhonen eds. Proceedings of the ACL 2004 Workshop on Multiword Expressions: Integrating Processing. Barcelona, 2004. [pdf]
2003
Villavicencio, A. Verb-Particle Constructions and Lexical Resources. In Francis Bond, Anna Korhonen, Diana McCarthy and Aline Villavicencio, eds. Proceedings of the ACL 2003 Workshop on Multiword Expressions: Analysis, Acquisition and Treatment, p. 57—64. Sapporo, 2003. [pdf]
Terkourafi, M. and Villavicencio, A. Toward a formalisation of speech-act functions of questions in conversation. In Proceedings of the 2nd CoLogNET-ElsNET Symposium: Questions and Answers: Theoretical and Applied Perspectives. Amsterdam, 2003. [pdf]
Buttery, P. and Villavicencio, A. Language Acquisition and the Universal Grammar. Proceedings of AMLaP'2003: Architectures and Mechanisms for Language Processing. Glasgow, 2003. [pdf]
Villavicencio, A. Verb-Particle Constructions in the World Wide Web. Proceedings of the ACL-SIGSEM Workshop on the Linguistic Dimensions of Prepositions and their use in Computational Linguistics Formalisms and Applications. Toulouse, France, 2003. [pdf]
2002
Villavicencio, A. and A. Copestake. Verb-particle constructions in a computational grammar of English. In Jongbok Kim and Stephen Wechsler, eds., Proceedings of the Ninth International Conference on Head-Driven Phrase Structure Grammar. Kyung-Hee University, Seoul. Stanford: CSLI Publications. Available at: http://cslipublications.stanford.edu/HPSG/3/hpsg02.htm. [pdf]
Copestake, A., F. Lambeau, A. Villavicencio, F. Bond, T. Baldwin, I. A. Sag and D. Flickinger. Multiword expressions: linguistic precision and reusability. Proceedings of the Third conference on Language Resources and Evaluation (LREC-2002), p. 1941—1947. Las Palmas, Canary Islands, 2002. [pdf]
Baldwin, T. and A. Villavicencio. Extracting the unextractable: A case study on verb-particles. Proceedings of the 6th Conference on Natural Language Learning (CoNLL-2002), Taipei, Taiwan, 2002. [pdf]
Villavicencio, A. Learning to distinguish PP arguments from adjuncts. Proceedings of the 6th Conference on Natural Language Learning (CoNLL-2002), Taipei, Taiwan, 2002. [pdf]
2001
Villavicencio, A. The Acquisition of a Unification-Based Generalised Categorial Grammar. Thesis published as Technical Report UCAM-CL-TR-533, Computer Laboratory, University of Cambridge, 2001. [pdf]
2000 and before
Villavicencio, A. The acquisition of word order by a computational learning system. Proceedings of the 2nd Learning Language in Logic Workshop, Lisbon, 2000. [pdf]
Villavicencio, A. The Use of Default Unification in a System of Lexical Types. Proceedings of the Workshop on Linguistic Theory and Grammar Implementation, Birmingham, 2000. [pdf]
Villavicencio, A. Grammatical Learning Using Unification-Based Generalised Categorial Grammars. Proceedings of AMLaP' 2000: Architectures and Mechanisms for Language Processing, Leiden, 2000. [pdf]
Villavicencio, A. The Acquisition of a Unification-Based Generalised Categorial Grammar. Proceedings of Cluk, Brighton, 2000. [pdf]
Villavicencio, A. Representing a System of Lexical Types Using Default Unification. Proceedings of EACL Student Session, Bergen, 1999. [pdf]
Villavicencio, A. Building a Wide-Coverage Combinatory Categorial Grammar. MPhil Thesis, University of Cambridge, 1997. [pdf]
Villavicencio, A., Viccari, R.M. Evaluating Stochastic Past-of-Speech Taggers for the Portuguese Language. Second Meeting of the Computational Processing of Spoken and Written Portuguese Language, 1996. [pdf]
Villavicencio, A., Marques, N.M., Lopes, J.G.P.,Villavicencio, F. Part-of-Speech tagging for Portuguese Texts",(ed.) In Jacques Wainer and Ariadne Carvalho, eds. Advances in Artificial Intelligence: Proceedings of the of the XII Brazilian Symposium on Artificial Intelligence, 1995. Lecture Notes in Artificial Intelligence 991, pp. 323-332. Springer Verlag. [pdf]