Linguistic Resources and Tools for the Study of Ancient Indo-European Languages
The Pavia repository
This page has been created by Chiara Zanchi (University of Pavia), in cooperation with Erica Biagetti (University of Pavia) and Guglielmo Inglese (KU Leuven). It aims to facilitate the work of students and researchers interested in the study of ancient Indo-European languages.
We have collected and made available here a number of links to the main linguistic resources and tools for the study of ancient Indo-European languages.
We are trying to keep the list up to date. All suggestions are most welcome!
We are grateful to Giovanni B. Boccardo, Martina Giarda, David M. Karaj, Elisa Roma and Matteo Tarsi for helping with specific IE groups and subgroups.
Grammars, dictionaries, electronic texts, annotated corpora, and treebanks
AGLDT (Ancient Greek and Latin Dependency Treebank); online queries available via Structural Search
Alpheios Application: dictionary and morphological analyzer linked to the Perseus Library (see below); downloadable as a browser extension; by clicking on the desired word, one can get its translation and morphological analysis
CLARIN (Common Language Resources and Technology Infrastructure); available non-contemporary IE languages: Medieval Gree, Dutch, English, American English, German, Middle High German, Middle Low German, Old Norse, Scottish Gaelic, Welsh, Swedish, Old French, Slovene
Danish portal for Slavonic, Balkan and East European Studies
Early Indo-European Online Lessons, Indo-European Lexicon, campione di testi digitalizzati e taggati per la morfologia (The University of Texas Austin)
Glottothèque (on-line courses with videos and slides of old IE languages)
Indogermanisches Etymologisches Wörterbuch (IEW) by Julius Pokorny
ISWOC Corpus (Information structure and word order change in Germanic and Romance languages)
Perseus Project (digitalized texts of Ancient Greek, Latin, Old Norse and Old English); new web interface: Perseus 5.0
PROIEL (Pragmatic Resources of Old Indo-European Languages): GitHub repository; web application; new web application: Syntacticus
TITUS (Thesaurus Indogermanischer Text- und Sprachmaterialien)
Universal Dependencies; old IE languages available: Old French, Gothic, Ancient Greek, Latin, Sanskrit, Old Church Slavic
Webpage with additional useful links
Resources and tools for the study of specific groups and subgroups
Latin
ALIM2 (Archivio della Latinità Italiana nel Medioevo)
CIL (Corpus Inscriptionum Latinarum)
CIL @ EAGLE
CLaSSES (Corpus containing non-literary Latin texts - e.g. letters, epigraphs, etc. - annotated with linguistic and extralinguistic metadata; e.g. it allows studying ortographic variants taking into account the sociolinguistic environment of the Roman world)
DigilibLT (Biblioteca digitale di testi latini tardoantichi)
DLL (Digital Latin Library)
Frankfurt Latin Lexicon (Computational Historical Semantics): lexicon of Medieval Latin
IT-TB and linked linguistic resources: IT-Valency Lexicon, LDT = Latin Dependency Treebank), VALLEX (Valency Lexicon for Latin)
LASLA - Laboratoire d’analyse statistique des langues anciennes (Université de Liège)
LemLat 3.0 and WFR (Word Formation Latin)
ΛΟΓΕΙΟΝ (online dictionary of Greek and Latin)
Musisque Deoque (poetic corpus of Latin and Italian until the Reinassance)
REGLA (Rección y complementación en griego y latín: database with verbs and valency frames)
The Digital Latin Library (it collects Latin texts available online to ease their query; texts in different formats)
TLL(Thesaurus Linguae Latinae)
Romance Languages
Old Italian
GDLI (Grande Dizionario della Lingua Italiana)
MIDIA (Morfologia Italiana in DIAcronia)
Musisque Deoque (poetic corpus of Latin and Italian until the Reinassance)
Nuovo De Mauro (focused on contemporary Italian; it nevertheless gives dates for all first attestations)
TLIO (Tesoro della Lingua Italiana delle Origini)
Old Portoguese
Old French
DEAF (Dictionnaire Étymologique de l'Ancien Français)
TLFi (Trésor de la langue française informatisé)
Ancient Greek
DĀMOS (Database of Mycenaean at Oslo University)
DFHG (Digital Fragmenta Historicorum Graecorum)
EAGLE (Electronic Archive of Greek and Latin Epigraphy)
Greek Ancient and Modern (A resource for teaching and study of the Greek language in all its phases)
HoDeL (Homeric Dependency Lexicon)
LSJ (Liddell-Scott-Jones Dictionary)
ΛΟΓΕΙΟΝ (online dictionary of Greek and Latin)
REGLA (Rección y complementación en griego y latín: database with verbs and valency frames)
TLG (Thesaurus Linguae Graecae)
Sanskrit
GRETIL (Göttingen Register of Electronic Texts in Indian Languages)
Lists of lexical resources, digitalized texts, and tools, developed at the School of Sanskrit and Indic Studies (New Dehli)
The Sanskrit Library (digital texts and various tools)
Avestan
Hittite and Anatolian
Hittite texts (University of Leiden)
SLUW (A Computer-Aided Study of the Luwian (Morpho)-Syntax)
Germanic
Annotated corpora of Middle English and Early Modern English (Penn University)
DWDS-Wörterbuch (Etymologisches Wörterbuch des Deutschen)
Historische Woordenboeken (Instituut voor de Nederlandse taal)
IcePaHC (Icelandic Parsed Historical Corpus)
ONP: Dictionary of Old Norse Prose (Ordbog over det norrøne prosasprog)
Baltoslavic
Collection of downloadable pdfs, including grammars and dictionaries (mostly in Russian)
Digitalized version of some Old Russian manuscripts from 15th century (mostly in Russian)
The World Wide Web portal for the study of Cyrillic and Glagolitic manuscripts and early printed books
TOROT (Tromsø Old Russian and OCS Treebank)
Tocharian
CEToM (A Comprehensive Edition of Tocharian Manuscripts)
Old Irish and Celtic
A Dictionary of the Old-Irish Glosses (Milan Glosses)
CODECS (Online database and e-resources for Celtic studies)
Corpus of Electronic Texts for Irish history, literature and politics
Corpus PalaeoHibernicum (soon available online)
DipSGG (UD treebank of the Old Irish Glosses of St. Gall)
eDIL (Electronic Dictionary of the Irish Language)
Geiriadur Prifysgol Cymru (A Dictionary of the Welsh Language)
Thesaurus Linguae Hibernicae (the link is currently not working)