10-17-2025
E 388D | Text in Code: Digital Methods for Language and Culture
Graduate course, Fall 2026
This course is for students from linguistics, literature, and others who study textual culture. It introduces computational methods for the analysis of textual data, with a focus on scripting in the R programming language. Assuming no prior programming experience, the course emphasizes hands-on learning in corpus building, text mining, statistical analysis, and data visualization. Students will engage with sample studies from synchronic and diachronic linguistics as well as digital humanities, exploring how digital tools can illuminate patterns in language and culture. While instruction centers on R, students will also explore LLM chatbots to support their coding practice. Grading based on readings, short presentations, and an original research project.
Linguistic studies on our reading list will include corpus linguistic approaches to variation and alternation in Present-Day English as well as corpus studies of language change. On the cultural front, we will start by examining distant reading and its recent refinements, then look at emerging trends in DH text analysis using topic modeling, stylometry, sentiment analysis, corpus-based discourse analysis, and machine learning techniques for interpreting narrative and genre.