The Emille Project (Central Institute of Indian Languages(CIIL) Collaboration with Lancester University)
Tamil Telugu General Text corpus by ILTPDC (Link )
Manually Annotated POS tagged corpus by AUKBC, Chennai
Word Corpora by Central Institute of Indian Languages(CIIL)
Tamil Speech Data by ILTPDC
English-Tamil Parallel Copora by Amrita Vishwa Vidyapeetham, Coimbatore (MTIL 2017)
Lemmatizer by Amrita Vishwa Vidyapeetham, AUKBC and Anna University
Morphological Analyzer for Tamil Anusaraka Project by IIT Kanpur
A Dataset for Troll Classification of Tamil Memes (Zenodo Link)
Multimodal Machine Translation Tamil Dataset (created some years ago)
Machine Translation in Dravidian languages (shared task dataset)
Some source codes