Corpus tool
We will use Sketch Engine throughout the summer course. Please make sure your Sketch Engine account is still active during this period.
Schedule
Good morning, everyone.
Welcome to DipSCORLING, which stands for Diponegoro Summer Course in Corpus Linguistics. DipSCORLING is an international summer course organized by the English Department of the Faculty of Humanities, Universitas Diponegoro. Since its first edition in 2023, the program has brought together participants, researchers, lecturers, students, and language enthusiasts from many countries to learn about corpus linguistics from leading scholars around the world. Corpus linguistics is the study of language through large collections of authentic texts, known as corpora. Instead of relying only on intuition, corpus linguistics allows us to investigate how language is actually used in real-life communication through systematic analysis of linguistic data. Over the next two days, we will explore the fundamental concepts and techniques of corpus linguistics, ranging from corpus analysis tools to practical methods for investigating language patterns.
Introduction to Corpus Linguistics – Gede Primahadi Wijaya Rajeg, Ph.D.
This session provides the foundation of corpus linguistics. Participants will learn what a corpus is, why corpus-based approaches are important in modern language research, and how corpora are used to investigate language patterns and linguistic phenomena.
Concordance – Karlina Denistia, Ph.D.
A concordance displays all occurrences of a word or phrase in its surrounding context. In this session, participants will learn how concordance lines help researchers examine meaning, grammar, discourse patterns, and language use in authentic contexts.
Frequency List – Karlina Denistia, Ph.D.
Frequency analysis shows which words, phrases, or linguistic features occur most often in a corpus. Participants will learn how frequency lists can reveal important patterns, themes, and characteristics of a text collection.
Collocation – Dr. Peter Crosthwaite
Collocation refers to words that frequently occur together, such as strong coffee or make a decision. This session explores how collocation analysis helps researchers understand vocabulary usage, phraseology, meaning, and language fluency.
Diponegoro Special Lecture – Dr. Peter Crosthwaite
Our special lecture presents advanced insights and current developments in corpus linguistics and related research areas. This session offers participants an opportunity to engage with cutting-edge scholarship from an internationally recognized expert in the field.
Before we conclude, I would also like to take this opportunity to briefly introduce the Master’s Program in Linguistics at Universitas Diponegoro.
For those of you who are considering further study, our program offers a number of attractive opportunities. Several competitive scholarship schemes are available for prospective Indonesian and international students to fund your studies. Depending on the scheme, the support may cover tuition fees, monthly stipends, and even international airfare.
We also offer international pathways through our joint degree collaborations with the University of Sheffield in the United Kingdom and the University of Otago in New Zealand. These partnerships allow students to gain international academic experience while pursuing their graduate studies.
For participants who are particularly interested in corpus linguistics, digital humanities, language technology, computational linguistics, or corpus-based language research, I would be happy to discuss potential supervision opportunities. If your research interests align with these areas, please feel free to reach out and have a conversation with me after the course.
Further information about funding opportunities and admission requirements can be found on my website at prihantoro.com/funding.
I would also like to extend an invitation to fellow lecturers and researchers joining us today, Indonesians or expatriates. Universitas Diponegoro actively welcomes international and national collaborations in research, postgraduate supervision, teaching, community engagement, and various academic activities.
There are several collaboration schemes available, including postdoctoral fellowships, visiting professorships, Undip Global Classroom initiatives, as well as both onsite and online collaborative programs. Depending on the scheme, support may include, APC payment, airfare, visa assistance, accommodation, and financial incentives.
If you are interested in exploring collaboration opportunities with Universitas Diponegoro or with our research groups, I would be delighted to discuss possibilities with you. More information about available schemes and requirements can be found at prihantoro.com/collaboration.
Thank you once again for joining DipSCORLING, and I hope this course will be the beginning of many future academic collaborations and friendships.
Is DDL dead yet? Not so fast! Revisiting earlier claims on the state of corpus linguistics, DDL, and generative AI
Abstract
This forum article revisits earlier claims about the relationship between corpus linguistics, data-driven learning (DDL), and generative AI three years after initial discussions of these issues. Drawing on recent developments in large language model (LLM) architectures, prompting strategies, and corpus-AI integration, the article reflects on how current research practices still need to evolve beyond traditional concordancing and basic AI-assisted analysis. It highlights emerging opportunities for combining generative AI with corpus resources in EAP and LSP contexts, while also emphasising the need for greater methodological clarity regarding prompting, data integration, and the interpretation of AI-generated language. The article concludes by arguing that the future of corpus-informed language research and pedagogy will likely lie in even closer integration between empirical corpus evidence and AI-mediated language technologies.