Publications
Journal articles
Ehret, Katharina. 2024. "Through the compression glass: language complexity and the linguistic structure of compressed strings". Linguistics Vanguard. DOI: https://doi.org/10.1515/lingvan-2022-0140
Ehret, Katharina, Aleksandrs Berdicevskis, Christian Bentz, and Alice Blumenthal-Dramé. 2023. "Measuring language complexity: challenges and opportunities". In: Ehret, Katharina, Aleksandrs Berdicevskis, Christian Bentz, and Alice Blumenthal-Dramé (eds.), Special Issue Measuring Language Complexity, Linguistics Vanguard 9 (s1): 1-8. DOI: https://doi.org/10.1515/lingvan-2022-0133
Ehret, Katharina, Alice Blumenthal-Dramé, Christian Bentz, and Aleksandrs Berdicevskis. 2021. "Meaning and measures: Interpreting and evaluating the meaning of complexity metrics". Frontiers in Communication. 6:640510. DOI: https://doi.org/10.3389/fcomm.2021.640510
Ehret Katharina and Maite Taboada. 2021a. "Characterising online news comments: A multi-dimensional cruise through online registers". Frontiers in Artificial Intelligence - Language and Communication 4 (79): 4:643770. DOI: https://doi.org/10.3389/frai.2021.643770
Ehret Katharina and Maite Taboada. 2021b. "The interplay of complexity and subjectivity in opinionated discourse". Discourse Studies 23 (2), 141-165. DOI: https://doi.org/10.1177/1461445620966923
Ehret, Katharina. 2021. “An Information-Theoretic View on Language Complexity and Register Variation: Compressing Naturalistic Corpus Data.” Corpus Linguistics and Linguistic Theory 17 (2) : 383-410. DOI: https://doi.org/10.1515/cllt-2018-0033
Ehret, Katharina and Maite Taboada. 2020. "Are online news comments like face-to-face conversation? A multi-dimensional analysis of an emerging register". Register Studies, 2 (1): 1-36. DOI: https://doi.org/10.1075/rs.19012.ehr
Ehret, Katharina and Benedikt Szmrecsanyi. 2019. "Compressing learner language: an information-theoretic measure of complexity in SLA". Second Language Research, 35 (1), 23-45. DOI: 10.1177/0267658316669559
Ruette, Tom, Katharina Ehret and Benedikt Szmrecsanyi. 2016a. "A lectometrical analysis of aggregated lexical variation in written Standard English with Semantic Vector Space models". International Journal of Corpus Linguistics 21 (1): 48-79. DOI: https://doi.org/10.1075/ijcl.21.1.03rue
Ehret, Katharina. 2014. "Kolmogorov complexity of morphs and constructions in English". Linguistic Issues in Language Technology 11 (2): 43-71.
Ehret, Katharina, Christoph Wolk and Benedikt Szmrecsanyi. 2014. "Quirky quadratures: on rhythm and weight as constraints on genitive variation in an unconventional dataset". English Language and Linguistics 18 (2): 263-303. DOI: https://doi.org/10.1017/S1360674314000033
Book chapters
Ehret, Katharina and Benedikt Szmrecsanyi. 2016. "An information-theoretic approach to assess linguistic complexity". In: Raffaela Baechler and Guido Seiler (eds.), Complexity, Isolation, and Variation, 71-94. Berlin: de Gruyter.
Ruette, Tom, Katharina Ehret and Benedikt Szmrecsanyi. 2016b. "Frequency effects in lexical sociolectometry are insubstantial". In: Heike Behrens and Stefan Pfänder (eds.), Experience Counts: Frequency Effects in Language, 111-132. Berlin/Boston: de Gruyter.
Conference proceedings (with double-blind peer review)
Babayode, Aminat, Laurens Bosman, Nicole Chan, Katharina Ehret, Ivan Fong, Noelle Harris, Alissa Hewton, Danica Reid, Maite Taboada, and Rebecca Wong. 2023. "Structural linguistic characteristics of podcasts as an emerging register of computer-mediated communication". In: Proceedings of the 10th International Conference on CMC and Social Media Corpora for the Humanities (CMC Corpora 2023). Mannheim, Germany.
Ehret, Katharina. 2018. "Kolmogorov complexity as a universal measure of language complexity". In: Aleksandrs Berdicevskis and Christian Bentz (eds.), Proceedings of the First Shared Task on Measuring Language Complexity, 8-14. Workshop on "Measuring Language Complexity", EvoLang XII, Torun, Poland.
Aleksandrs Berdicevskis, Çağrı Çöltekin, Katharina Ehret, Kilu von Prince, Daniel Ross, Bill Thompson, Chunxiao Yan, Vera Demberg, Gary Lupyan, Taraka Rama, and Christian Bentz (2018). "Using Universal Dependencies in cross-linguistic complexity research". In: Proceedings of the Second Workshop on Universal Dependencies (UDW 2018), 8-17. Association for Computational Linguistics.
Theses
Ehret, Katharina. 2017. "An information-theoretic approach to language complexity: variation in naturalistic corpora". FreiDok plus, Universität Freiburg. DOI: 10.6094/UNIFR/12243
Ehret, Katharina Luisa. 2008. "Analyticity and syntheticity in East African English and British English: a register comparison". Freidok, Universität Freiburg. URL: http//freidok.uni-freiburg.de/volltexte/5804/