Research & funding

My research interests all broadly fall within the remit of variationist linguistics and variation studies, including their interfaces with typology, geolinguistics, and psycholinguistics. I view linguistic variation as a window into the hidden structure of human language and the nature of linguistic knowledge, and I am ultimately interested in what fuels linguistic variation in synchrony and diachrony.

Research interests

  • variation studies (synchronic & diachronic)
  • probabilistic grammar
  • language complexity
  • geolinguistics, dialectology, and dialect typology
  • varieties of English world-wide
  • methods: probabilistic modeling, aggregate analysis techniques, corpus-based dialectometry

Current funded projects

  • Exploring probabilistic grammar(s) in varieties of English around the world
    Funded by a Type II Odysseus grant awarded by the Research Foundation Flanders (FWO) (grant # G.0C59.13N, budget: €856,260)
    2013-2018
The project is situated at the crossroads of research on English as a World Language, usage-based theoretical linguistics, variationist linguistics, and cognitive sociolinguistics. It specifically marries the spirit of the Probabilistic Grammar framework (which posits that grammatical knowledge is experience-based and partially probabilistic) to research along the lines of the "English World-Wide" paradigm (which is concerned with the dialectology and sociolinguistics of post-colonial English-speaking communities around the world). The overarching objective is to understand the lectal plasticity of probabilistic knowledge of English grammar, on the part of language users with diverse regional and cultural backgrounds.
  • Nephological Semantics: Using token clouds for meaning detection in variationist linguistics
    Co-PI with Dirk Geeraerts, Stefania Marzo & Dirk Speelman
    Funded by a C1 grant awarded by the KU Leuven Research Council (grant # 3H150305, budget: €1,271,200)
    2015-2021
The increasing importance of corpus data in linguistics creates a need for appropriate methods for retrieving semantic information from corpora. In the project proposed here, existing computational methods of distributional corpus semantics are further developed in the form of a meaning detection approach based on token clouds, i.e. clusters of distributionally similar attestations of words or expressions in a multidimensional vector space. The first phase of the project has a methodological orientation, focusing on the finetuning of such a 'nephological' method for detecting linguistic meanings in corpus data. In the second phase of the project, the method is put to use in two descriptive research lines: lectometrical research into the relationship between language varieties, and variationist grammar research.
project website

Networking grants

  • Frontiers of language variety
    NWO internationalization grant
    Main applicant: John Nerbonne, collaborators: Peter Auer, Gosse Bouma, Dirk Geeraerts, Charlotte Gooskens, Kris Heylen, Nanna Hilton, Bernd Kortmann, Christian Mair, Stefan Pfänder, Dirk Speelman, Benedikt Szmrecsanyi
    2014-2016
    summary