note: go here for a list of publications ordered by publication type.

to appear & in press

  • Ehret, Katharina & Benedikt Szmrecsanyi (in press). "Compressing learner language: an information-theoretic measure of complexity in SLA production data". Second Language Research.
    DOI: 10.1177/0267658316669559 (Sage OnlineFirst) | manuscript 
  • Grafmiller, Jason, Benedikt Szmrecsanyi & Lars Hinrichs (in press). "Restricting the restrictive relativizer: Constraints on subject and non-subject English relative clauses". Corpus Linguistics and Linguistic Theory.
    ahead-of-print version | pdf manuscript
  • Röthlisberger, Melanie, Jason Grafmiller & Benedikt Szmrecsanyi (to appear). "Cognitive indigenization effects in the English dative alternation". Accepted for publication in Cognitive Linguistics.
    pdf manuscript
  • Szmrecsanyi, Benedikt (in press).  "Variationist sociolinguistics and corpus-based variationist linguistics: overlap and cross-pollination potential". Canadian Journal of Linguistics/Revue canadienne de linguistique.
    uncorrected page proofs
  • Szmrecsanyi, Benedikt & Lieselotte Anderwald (to appear). "Corpus-based approaches to dialect study". In: Charles Boberg, Dominic Watt & John Nerbonne (eds.), Handbook of Dialectology. Malden, MA: Wiley-Blackwell.
    uncorrected page proofs
  • Szmrecsanyi, Benedikt Daniela Kolbe-Hanna (to appear).  "New ways of analyzing dialect grammars: complementizer omission in traditional  British English dialects". In: Stefan Grondelaers & Roeland van Hout (eds.), New Ways of Analyzing Syntactic Variation. Berlin, Boston: de Gruyter Mouton.
    pdf manuscript
  • Szmrecsanyi, Benedikt, Jason Grafmiller, Joan Bresnan, Anette Rosenbach, Sali Tagliamonte & Simon Todd  (to appear).  "Spoken syntax in a comparative perspective: the dative and genitive alternation in varieties of English". Accepted for publication in Glossa: a journal of general linguistics.
    pdf manuscript


  • Grafmiller, Jason, Benedikt Szmrecsanyi, Melanie Röthlisberger & Benedikt Heller (eds) (2017). Special collection on "Probabilistic grammars: Syntactic variation in a comparative perspective". Glossa: a journal of general linguistics.
  • Heller, Benedikt, Benedikt Szmrecsanyi & Jason Grafmiller (2017). "Stability and fluidity in syntactic variation world-wide: the genitive alternation across varieties of English"Journal of English Linguistics 45(1): 3--27.
    DOI: 10.1177/0075424216685405
  • Szmrecsanyi, Benedikt (2017). "``Featurometry’’". In: Martijn Wieling, Martin Kroon, Gertjan van Noord & Gosse Bourma (eds.), From Semantics to Dialectometry. Festschrift in honor of John Nerbonne (Tributes 32). College Publications, 345–353.
    pdf version


  • Biber, Douglas, Jesse Egbert, Bethany Gray, Rahel Oppliger & Benedikt Szmrecsanyi (2016). "Variationist versus text-linguistic approaches to grammatical change in English: Nominal modifiers of head nouns". In: Merja Kytö & Päivi Pahta (eds.), Handbook of English historical linguistics. Cambridge: Cambridge University Press, 351-375.
    pdf manuscript
  • Ehret, Katharina & Benedikt Szmrecsanyi (2016). "An information-theoretic approach to assess linguistic complexity". In: Raffaela Baechler & Guido Seiler (eds.), Complexity and Isolation. Berlin: de Gruyter, 71-94.
    DOI: 10.1515/9783110348965-004 | uncorrected page proofs
  • Ruette, Tom, Katharina Ehret & Benedikt Szmrecsanyi (2016). "A lectometric analysis of aggregated lexical variation in written Standard English with Semantic Vector Space models". International Journal of Corpus Linguistics 21(1): 48-79.
    DOI: 10.1075/ijcl.21.1.03rue | uncorrected page proofs
  • Ruette, Tom, Katharina Ehret & Benedikt Szmrecsanyi (2016). "Frequency effects in lexical sociolectometry are insubstantial". In: Heike Behrens & Stefan Pfänder (eds.), Experience Counts: Frequency Effects in Language. Berlin, Boston: De Gruyter, 111-132.
    DOI: 10.1515/9783110346916-006 | pdf manuscript
  • Szmrecsanyi, Benedikt (2016). "About text frequencies in historical linguistics: disentangling environmental and grammatical change". Corpus Linguistics and Linguistic Theory 12(1): 153-171 (special issue, ed. by Martin Hilpert & Hubert Cuyckens).
    DOI: 10.1515/cllt-2015-0068 | uncorrected page proofs
  • Szmrecsanyi, Benedikt (2016). "An analytic-synthetic spiral in the history of English". In: Elly van Gelderen (ed.), Cyclical Change Continued. Amsterdam: Benjamins, 93-112.
    DOI: 10.1075/la.227.04szm | uncorrected page proofs
  • Szmrecsanyi, Benedikt, Douglas Biber, Jesse Egbert & Karlien Franco (2016). "Toward more accountability: Modeling ternary genitive variation in Late Modern English". Language Variation and Change 28(1): 1-29.
    DOI: 10.1017/S0954394515000198 | uncorrected page proofs | download the dataset here

  • Szmrecsanyi, Benedikt, Jason Grafmiller, Benedikt Heller & Melanie Röthlisberger (2016). "Around the world in three alternations: Modeling syntactic variation in varieties of English". English World-Wide 37(2): 109-137.
    DOI: 10.1075/eww.37.2.01szm | uncorrected page proofs
  • Wolk, Christoph & Benedikt Szmrecsanyi (2016). "Top-down and bottom-up advances in corpus-based dialectometry". In: Marie-Hélène Côté, Remco Knooihuizen & John Nerbonne (eds.), The future of dialects: Selected papers from Methods in Dialectology XV. (Series: Language Variation 1). Berlin: Language Science Press.
    DOI: 10.17169/langsci.b81.152 (open access) |pdf copy


Frazee, Joseph, Lars Hinrichs, Benedikt Szmrecsanyi & Axel Bohmann (2015). "Which-hunting and the Standard English relative clause: Online Supplement: Automatic Zero-Relative Detection". Language 91(4): s1-s3. 
DOI: 10.1353/lan.2015.0070 (Open Access)
    • Kolbe-Hanna, Daniela & Benedikt Szmrecsanyi (2015). "Grammatical variation". In: Douglas Biber & Randi Reppen (eds.), The Cambridge handbook of English corpus linguistics. Cambridge: Cambridge University Press, 161-179.
      pdf manuscript
    • Szmrecsanyi, Benedikt (2015). "Recontextualizing language complexity". In: Jocelyne Daems, Eline Zenner, Kris Heylen, Dirk Speelman & Hubert Cuyckens (eds.), Change of Paradigms -- New Paradoxes: Recontextualizing Language and Linguistics. Berlin, Boston: de Gruyter Mouton, 347-360.
      DOI: 10.1515/9783110435597-020 |preprint pdf


    Szmrecsanyi, Benedikt  & Bernhard Wälchli (eds) (2014). Aggregating dialectology, typology, and register analysis: Linguistic variation in text and speech (Series: linguae & litterae). Berlin: de Gruyter.
    DOI: 10.1515/978311031755 | ISBN: 9783110317398
    book website 

    • Cuyckens, Hubert, Frauke D’hoedt & Benedikt Szmrecsanyi (2014). "Variability in verb complementation in Late Modern English: finite vs. non-finite patterns". In: Marianne Hundt (ed.), Late Modern English Syntax. Cambridge, New York: Cambridge University Press, 182-203.
      pdf manuscript
    • Ehret, Katharina, Christoph Wolk & Benedikt Szmrecsanyi (2014). "Quirky quadratures: on rhythm and weight as constraints on genitive variation in an unconventional dataset". English Language and Linguistics 18(2): 263–303 (special issue, ed. by Eva Berlage & John Payne).
      DOI: 10.1017/S1360674314000033 |uncorrected page proofs
    • Szmrecsanyi, Benedikt (2014). "Forests, trees, corpora, and dialect grammars". In: Benedikt Szmrecsanyi & Bernhard Wälchli (eds). Aggregating dialectology, typology, and register analysis: Linguistic variation in text and speech. Berlin: de Gruyter, 89-112.
      DOI: 10.1515/9783110317558.89 |pdf manuscript
    • Szmrecsanyi, Benedikt (2014). "Methods and objectives in contemporary dialectology". In: Ilja A. Seržant & Björn Wiemer (eds.), Contemporary approaches to dialectology: The area of North, Northwest Russian and Belarusian vernaculars // Современные методы в диалектологии. Ареал северных, северо-западных русских и белорусских говоров. Bergen: Department of Foreign Languages, University of Bergen, 81-92.
      pdf manuscript
    • Szmrecsanyi, Benedikt, Anette Rosenbach, Joan Bresnan & Christoph Wolk (2014). "Culturally conditioned language change? A multi-variate analysis of genitive constructions in ARCHER". In: Marianne Hundt (ed.), Late Modern English Syntax. Cambridge, New York: Cambridge University Press, 133-152.
      pdf manuscript
    • Wälchli, Bernhard & Benedikt Szmrecsanyi (2014). "Introduction: The text-feature-aggregation pipeline in variation studies". In: Benedikt Szmrecsanyi & Bernhard Wälchli (eds). Aggregating dialectology, typology, and register analysis: Linguistic variation in text and speech. Berlin: de Gruyter, 1-25.
      DOI: 10.1515/9783110317558.1 |uncorrected page proofs


    Auer, Peter, Martin Hilpert, Anja Stukenbrock & Benedikt Szmrecsanyi (eds) (2013). Space in language and linguistics: geographical, interactional, and cognitive perspectives (Series: linguae & litterae). Berlin: de Gruyter.
    DOI: 10.1515/9783110312027 | ISBN: 9783110312027 | book website
    {reviewed by Asya Pereltsvaig in LINGUIST List 25.4160, 2014}

    Szmrecsanyi, Benedikt (2013). Grammatical Variation in British English Dialects: A Study in Corpus-Based Dialectometry. (Series: Studies in English Language).  Cambridge: Cambridge University Press. ISBN: 9781107003453 | book website | download the dataset here
    {reviewed by James A. Walker in ICAME Journal 38, 2014, 216-220{reviewed by Warren Maguire in Language 90(4), 2014, 983-986{reviewed by Sali A. Tagliamonte in Diachronica 31(4), 2014, 579-583{reviewed by Luke A Rudge in LINGUIST List 26.3841, 2015}

    • Auer, Peter, Martin Hilpert, Anja Stukenbrock & Benedikt Szmrecsanyi (2013). "Integrating the perspectives on language and space". In: Auer, Peter, Martin Hilpert, Anja Stukenbrock & Benedikt Szmrecsanyi (eds). Space in language and linguistics: geographical, interactional, and cognitive perspectives. Berlin: de Gruyter, 1-18.
      DOI: 10.1515/9783110312027.1|preprint pdf
    • Szmrecsanyi, Benedikt (2013). "Analyzing aggregated linguistic data". In: Krug, Manfred & Julia Schlüter (eds.), Research Methods in Language Variation and Change. Cambridge: Cambridge University Press, 433-455.
      preprint pdf
    • Szmrecsanyi, Benedikt (2013). "Commentary: Lost in Space? The many geographies and methodologies in research on variation within languages". In: Auer, Peter, Martin Hilpert, Anja Stukenbrock & Benedikt Szmrecsanyi (eds). Space in language and linguistics: geographical, interactional, and cognitive perspectives. Berlin: de Gruyter, 240-244.
      DOI: 10.1515/9783110312027.240 |preprint pdf
    • Szmrecsanyi, Benedikt (2013). "Diachronic Probabilistic Grammar". English Language and Linguistics (Journal of the English Linguistics Society of Korea) 19(3): 41-68.
      pdf manuscript
    • Szmrecsanyi, Benedikt (2013). "The great regression: genitive variability in Late Modern English news texts". In: Börjars, Kersti, David Denison & Alan Scott (eds.), Morphosyntactic categories and the expression of possession. Amsterdam: Benjamins, 59–88.
      preprint pdf
    • Szmrecsanyi, Benedikt (2013). "Kapitel 7: Variation und Wandel" ["Chapter 7: Variation and Change"]. Textbook chapter in: Auer, Peter (ed). Sprachwissenschaft: Grammatik - Interaktion - Kognition. Stuttgart: Metzler, 261-284.
      preprint pdf
    • Wolk, Christoph, Joan Bresnan, Anette Rosenbach & Benedikt Szmrecsanyi (2013). "Dative and genitive variability in Late Modern English: Exploring cross-constructional variation and change". Diachronica 30(3): 382–419.
      DOI: 10.1075/dia.30.3.04wol |preprint pdf | download the dataset here


    Kortmann, Bernd & Benedikt Szmrecsanyi (eds) (2012). Linguistic Complexity: Second Language Acquisition, Indigenization, Contact. (Series: linguae & litterae). Berlin: de Gruyter.
    DOI: 10.1515/9783110229226 | ISBN: 9783110229226 | book website
    {reviewed by Natalie Operstein and Amber Clontz in LINGUIST List 23.5022, 2012}

    • Szmrecsanyi, Benedikt (2012). "Analyticity and syntheticity in the history of English". In: Nevalainen, Terttu & Elizabeth Closs Traugott (eds.), The Oxford Handbook of the History of English. Oxford: Oxford University Press, 654-665.
      preprint pdf
    • Szmrecsanyi, Benedikt (2012). "Geography is overrated". In: Hansen, Sandra, Christian Schwarz, Philipp Stoeckle & Tobias Streck (eds.), Dialectological and folk dialectological concepts of space. Berlin: de Gruyter, 215-231.
      DOI: 10.1515/9783110229127.215 |preprint pdf
    • Szmrecsanyi, Benedikt (2012). "Typological profile: L1 varieties". In: Bernd Kortmann  & Kerstin Lunkenheimer (eds). The Mouton World Atlas of Variation in English. Berlin, Boston: de Gruyter, 826-843.
      DOI: 10.1515/9783110280128.826 | pdf manuscript
    • Szmrecsanyi, Benedikt & Bernd Kortmann (2012). "Introduction: Linguistic complexity – Second Language Acquisition, indigenization, contact". In: Kortmann, Bernd & Benedikt Szmrecsanyi (eds.), Linguistic Complexity: Second Language Acquisition, Indigenization, Contact. Berlin: de Gruyter, 6-34.
      DOI: 10.1515/9783110229226.6 | preprint pdf


    • Kortmann, Bernd & Benedikt Szmrecsanyi (2011). "Parameters of morphosyntactic variation in World Englishes: prospects and limitations of searching for universals". In: Siemund, Peter (ed), Linguistic Universals and Language Variation. Berlin/New York: De Gruyter Mouton, 264-290.
      DOI: 10.1515/9783110238068.264 | preprint pdf
    reprinted in: Biber, Douglas & Randi Reppen (eds.) (2011), Corpus Linguistics (SAGE Benchmarks in Language and Linguistics). Thousand Oaks: Sage Publications, 259-289.
    • Szmrecsanyi, Benedikt & Bernd Kortmann (2011). "Typological profiling: learner Englishes versus indigenized L2 varieties of English". In: Mukherjee, Joybrato & Marianne Hundt (eds), Exploring Second-Language Varieties of English and Learner Englishes: Bridging a Paradigm Gap. Amsterdam: Benjamins, 167-187.
      preprint pdf


    • Szmrecsanyi, Benedikt (2010). "The English genitive alternation in a cognitive sociolinguistics perspective". In: Geeraerts, Dirk, Gitte Kristiansen & Yves Peirsman (eds), Advances in Cognitive Sociolinguistics. Berlin/New York:  De Gruyter Mouton, 141-166.
      DOI: 10.1515/9783110226461.139 | preprint pdf
    • Szmrecsanyi, Benedikt (2010). The morphosyntax of BrE dialects in a corpus-based dialectometrical perspective: feature extraction, coding protocols, projections to geography, summary statistics. URN: urn:nbn:de:bsz:25-opus-73209, URL: Freiburg. (64pp.)


    • Anderwald, Lieselotte & Benedikt Szmrecsanyi (2009).  "Corpus linguistics and dialectology". In: Lüdeling, Anke & Merja Kytö (eds), Corpus Linguistics. An International Handbook. (Series: Handbücher zur Sprache und Kommunikationswissenschaft/Handbooks of Linguistics and Communication Science). Berlin, New York: Mouton de Gruyter, 1126-1139.
      DOI: 10.1515/9783110213881.2.1126 | preprint pdf
    • Anderwald, Lieselotte & Benedikt Szmrecsanyi (2009). "Why grammar is real: a usage-based perspective on patterns". In: Eckstein, Lars & Christoph Reinfandt (eds), Anglistentag 2008 - Tübingen - Proceedings. Trier: Wissenschaftlicher Verlag Trier, 403-414.
      preprint pdf
    • Kortmann, Bernd & Benedikt Szmrecsanyi (2009). "World Englishes between simplification and complexification". In: Siebers, Lucia & Thomas Hoffmann (eds), World Englishes -- Problems, Properties and Prospects: selected papers from the 13th IAWE conference. Amsterdam: Benjamins, 265-285.
      preprint pdf
    • Szmrecsanyi, Benedikt (2009). "Typological parameters of intralingual variability: grammatical analyticity versus syntheticity in varieties of English". Language Variation and Change 21(3): 319–353.
      DOI: 10.1017/S0954394509990123 | preprint pdf
    • Szmrecsanyi, Benedikt & Bernd Kortmann (2009). "Between simplification and complexification: non-standard varieties of English around the world". In: Sampson, Geoffrey, David Gil & Peter Trudgill (eds.), Language Complexity as an Evolving Variable. Oxford: Oxford University Press, 64-79.
      preprint pdf
    • Szmrecsanyi, Benedikt & Bernd Kortmann (2009).  "The morphosyntax of varieties of English worldwide: a quantitative perspective". Lingua 119(11): 1643-1663 (special issue "The Forests behind the Trees", ed. by John Nerbonne & Franz Manni).
      DOI: 10.1016/j.lingua.2007.09.016 | preprint pdfdownload the dataset here
    • Szmrecsanyi, Benedikt & Bernd Kortmann (2009). "Vernacular universals and angloversals in a typological perspective". In: Filppula, Markku, Juhani Klemola & Heli Paulasto (eds), Vernacular Universals and Language Contacts: Evidence from Varieties of English and Beyond. London/New York: Routledge, 33-53.
      preprint pdfdownload the dataset here


    • Szmrecsanyi, Benedikt (2008). "Corpus-based dialectometry: aggregate morphosyntactic variability in British English dialects". International Journal of Humanities and Arts Computing 2(1-2): 279-296 (special issue on "Language Variation", ed. by John Nerbonne, Charlotte Gooskens, Sebastian Kürschner & Renée van Bezooijen).
      DOI: 10.3366/E1753854809000433 | preprint pdf
    • Szmrecsanyi, Benedikt & Nils Goldschmidt (2008). "The Cameelious Hump and 'Just So Stories' in economic literature – a linguistic analysis". In: Clift, Edward (ed.), How Language is Used to Do Business: Essays on the Rhetoric of Economics. New York City: Edwin Mellen Press.
      preprint pdf
    • Szmrecsanyi, Benedikt & Lars Hinrichs (2008). "Probabilistic determinants of genitive variation in spoken and written English: a multivariate comparison across time, space, and genres". In: Nevalainen, Terttu, Irma Taavitsainen, Päivi Pahta & Minna Korhonen (eds.), The Dynamics of Linguistic Variation: Corpus Evidence on English Past and Present. Amsterdam: Benjamins, 291-309.
      preprint pdf


    • Goldschmidt, Nils & Benedikt Szmrecsanyi (2007). "What do economists talk about? A linguistic analysis of published writing in economic journals". American Journal of Economics and Sociology 66(2): 335–378.
      DOI: 10.1111/j.1536-7150.2007.00514.x | preprint pdf
    reprinted as "Dimension reduction of word-frequency data as a substitute for intersubjective content analysis" in: Vogt, W. Paul (ed.) (2008), Selecting Research Methods, Vol. 4: Methods for analysing and reporting results (SAGE Benchmarks in Social Research Methods series). Thousand Oaks: Sage Publications.
    • Hinrichs, Lars & Benedikt Szmrecsanyi (2007). "Recent changes in the function and frequency of standard English genitive constructions: a multivariate analysis of tagged corpora". English Language and Linguistics 11(3): 437–474.
      DOI: 10.1017/S1360674307002341 | preprint pdf | download the dataset here
    reprinted in: Biber, Douglas & Randi Reppen (eds.) (2011), Corpus Linguistics (SAGE Benchmarks in Language and Linguistics). Thousand Oaks: Sage Publications, 139-178.
    • Szmrecsanyi, Benedikt & Nuria Hernández (2007). Manual of Information to accompany the Freiburg Corpus of English Dialects Sampler ("FRED-S"). URN: urn:nbn:de:bsz:25-opus-28598, URL: Freiburg. (32pp.)


    Szmrecsanyi, Benedikt (2006). Morphosyntactic persistence in spoken English. A corpus study at the intersection of variationist sociolinguistics, psycholinguistics, and discourse analysis. (Series: Trends in Linguistics. Studies and Monographs). Berlin/New York: Mouton de Gruyter.
    DOI: 10.1515/9783110197808 | ISBN: 9783110190120 | book website
    {reviewed by Ingo Plag in Zeitschrift für Sprachwissenschaft 27, 2008, 293-296}


    • Szmrecsanyi, Benedikt (2005). "Language users as creatures of habit: a corpus-linguistic analysis of persistence in spoken English". Corpus Linguistics and Linguistic Theory 1(1): 113-150.
      DOI: 10.1515/cllt.2005.1.1.113 | preprint pdf
    reprinted in: Biber, Douglas & Randi Reppen (eds.) (2011), Corpus Linguistics (SAGE Benchmarks in Language and Linguistics). Thousand Oaks: Sage Publications, 297-330.
    • Szmrecsanyi, Benedikt (2005). "Never change a winning chunk". Recherches Anglaises et Nord-Américaines 38: 21-34.
      preprint pdf


    • Kortmann, Bernd & Benedikt Szmrecsanyi (2004).  "Global synopsis – morphological and syntactic variation in English". In: Kortmann, Bernd, Kate Burridge, Rajend Mesthrie & Edgar Schneider (eds.), A Handbook of Varieties of English, Vol. 2: Morphology and Syntax. Berlin/New York: Mouton de Gruyter, 1142-1202.
       | download the dataset here
    • Szmrecsanyi, Benedikt (2004). "On operationalizing syntactic complexity". In: Purnelle, Gérard, Cédrick Fairon & Anne Dister (eds.),  Le poids des mots. Proceedings of the 7th International Conference on Textual Data Statistical Analysis. Louvain-la-Neuve, March 10-12, 2004, Vol. 2. Louvain-la-Neuve, Presses universitaires de Louvain. 1032-1039.


    Book reviews

    • Szmrecsanyi, Benedikt (2013). Review of Warren Maguire and April McMahon (eds.), Analysing variation in English. Cambridge. English Language and Linguistics 17(1): 222-226.
      DOI: 10.1017/S136067431200041X 
    • Szmrecsanyi, Benedikt (2011). Review of Bauer, Roland. 2009. Dialektometrische Einsichten. Sprachklassifikatorische Oberflächenmuster und Tiefenstrukturen im lombardo-venedischen Dialektraum und in der Rätoromania. San Martin de Tor. Romanistisches Jahrbuch 62: 242-245.
    • Szmrecsanyi, Benedikt (2011). Review of McWhorter, John H. 2011. Linguistic simplicity and complexity: why do languages undress? Boston: De Gruyter Mouton. Language Dynamics and Change 1(2): 279-283.
      DOI: 10.1163/221058212X643987
    • Szmrecsanyi, Benedikt (2007). Review of Joybrato Mukherjee. 2005. English Ditransitive Verbs: Aspects of Theory, Description and a Usage-based Model. Amsterdam/New York: Rodopi. English Language and Linguistics 11(1): 240-246. 
      DOI: 10.1017/S1360674306232222
    • Szmrecsanyi, Benedikt (2006). Review of Rens Bod, Jennifer Hay, and Stefanie Jannedy (Eds.). 2003. Probabilistic Linguistics. Cambridge, MA: MIT Press. Corpus Linguistics and Linguistic Theory 2(1): 79-94. 
      DOI: 10.1515/CLLT.2006.004