Publications

Journal articles

Ehret, Katharina.  2024. "Through the compression glass: language complexity and the linguistic structure of compressed strings". Linguistics Vanguard. DOI: https://doi.org/10.1515/lingvan-2022-0140

Ehret, Katharina, Aleksandrs Berdicevskis, Christian Bentz, and Alice Blumenthal-Dramé. 2023. "Measuring language complexity: challenges and opportunities". In:  Ehret, Katharina, Aleksandrs Berdicevskis, Christian Bentz, and  Alice Blumenthal-Dramé (eds.), Special Issue Measuring Language Complexity, Linguistics Vanguard 9 (s1): 1-8. DOI: https://doi.org/10.1515/lingvan-2022-0133

Ehret, Katharina, Alice Blumenthal-Dramé, Christian Bentz, and Aleksandrs Berdicevskis. 2021. "Meaning and measures: Interpreting and evaluating the meaning of complexity metrics"Frontiers in Communication. 6:640510. DOI: https://doi.org/10.3389/fcomm.2021.640510

Ehret Katharina and Maite Taboada. 2021a. "Characterising online news comments: A multi-dimensional cruise through online registers". Frontiers in Artificial Intelligence - Language and Communication 4 (79): 4:643770. DOI: https://doi.org/10.3389/frai.2021.643770

Ehret Katharina and Maite Taboada. 2021b. "The interplay of complexity and subjectivity in opinionated discourse". Discourse Studies 23 (2), 141-165.  DOI: https://doi.org/10.1177/1461445620966923

Ehret, Katharina. 2021. “An Information-Theoretic View on Language Complexity and Register Variation: Compressing Naturalistic Corpus Data.” Corpus Linguistics and Linguistic Theory 17 (2) : 383-410. DOI: https://doi.org/10.1515/cllt-2018-0033

Ehret, Katharina and Maite Taboada. 2020. "Are online news comments like face-to-face conversation? A multi-dimensional analysis of an emerging register". Register Studies, 2 (1): 1-36.  DOI: https://doi.org/10.1075/rs.19012.ehr

Ehret, Katharina and Benedikt Szmrecsanyi. 2019. "Compressing learner language: an information-theoretic measure of complexity in SLA".  Second Language Research, 35 (1), 23-45.  DOI: 10.1177/0267658316669559

Ruette, Tom, Katharina Ehret and Benedikt Szmrecsanyi. 2016a. "A lectometrical analysis of aggregated lexical variation in written Standard English with Semantic Vector Space models".  International Journal of Corpus Linguistics 21 (1): 48-79. DOI: https://doi.org/10.1075/ijcl.21.1.03rue

Ehret, Katharina. 2014. "Kolmogorov complexity of morphs and constructions in English".  Linguistic Issues in  Language Technology 11 (2): 43-71.

Ehret, Katharina, Christoph Wolk and Benedikt Szmrecsanyi. 2014. "Quirky quadratures: on rhythm and weight as constraints on genitive variation in an unconventional dataset". English Language and Linguistics 18 (2): 263-303. DOI: https://doi.org/10.1017/S1360674314000033

Book chapters

Ehret, Katharina and Benedikt Szmrecsanyi. 2016. "An information-theoretic approach to assess linguistic complexity". In: Raffaela Baechler and Guido Seiler (eds.), Complexity, Isolation, and Variation, 71-94. Berlin: de Gruyter.

Ruette, Tom, Katharina Ehret and Benedikt Szmrecsanyi. 2016b. "Frequency effects in lexical sociolectometry are insubstantial". In: Heike Behrens and Stefan Pfänder (eds.), Experience Counts: Frequency Effects in Language, 111-132. Berlin/Boston: de Gruyter.

Conference proceedings (with double-blind peer review)

Babayode, Aminat, Laurens Bosman, Nicole Chan, Katharina Ehret, Ivan Fong, Noelle Harris, Alissa Hewton, Danica Reid, Maite Taboada, and Rebecca Wong. 2023. "Structural linguistic characteristics of podcasts as an emerging register of computer-mediated communication". In: Proceedings of the 10th International Conference on CMC and Social Media Corpora for the Humanities (CMC Corpora 2023). Mannheim, Germany. 

Ehret, Katharina. 2018. "Kolmogorov complexity as a universal measure of language complexity". In: Aleksandrs Berdicevskis and Christian Bentz  (eds.), Proceedings of the First Shared Task on Measuring Language Complexity, 8-14. Workshop on "Measuring Language Complexity", EvoLang XII, Torun, Poland.

Aleksandrs Berdicevskis, Çağrı Çöltekin, Katharina Ehret, Kilu von Prince, Daniel Ross, Bill Thompson, Chunxiao Yan, Vera Demberg, Gary Lupyan, Taraka Rama, and Christian Bentz (2018). "Using Universal Dependencies in cross-linguistic complexity research". In: Proceedings of the Second Workshop on Universal Dependencies (UDW 2018), 8-17. Association for Computational Linguistics.

Theses

Ehret, Katharina. 2017.  "An information-theoretic approach to language complexity: variation in naturalistic corpora". FreiDok plus, Universität Freiburg. DOI: 10.6094/UNIFR/12243

Ehret, Katharina Luisa. 2008. "Analyticity and syntheticity in East African English and British English: a register comparison". Freidok, Universität Freiburg. URL: http//freidok.uni-freiburg.de/volltexte/5804/