LIghtweight, Multi-speaker, Multi-lingual Indic TTS
LIMMITS'23
Challenge overview
The challenge is being organized as part of ICASSP 2023 which aims at the development of a Lightweight, Multi-speaker, Multi-lingual Indic Text-to-Speech (TTS) model using datasets in Marathi, Hindi, and Telugu. These studies are important for data selection in TTS, for various landscapes such as India, Europe, etc which have a large number of languages and their regional variations. TTS corpora in these languages are being built as a part of the SYSPIN project at SPIRE lab, Indian Institute of Science (IISc) Bangalore, India, in which large corpora comprising 40 hours of single speaker’s speech in each of nine Indian languages are being collected. This proposal is part of the initiatives taken to open source these corpora and continue to propose challenges in the next two years’ ICASSP as well.
The challenge aims towards helping and encouraging the advancement of TTS in Indian Languages. The basic challenge is to take the released speech data, build TTS voices, and share the voice in web API form for evaluation. The output from each synthesizer will be evaluated through extensive listening tests. The primary objective of this challenge is understanding and comparing the various approaches to build TTS and simultaneously identifying efficient speech groups across the world.
About SYSPIN
SYnthesizing SPeech in INdian languages (SYSPIN) is an initiative to develop large open-source text-to-speech (TTS) corpora and models for TTS systems in nine Indian languages in the area of agriculture and finance. Nine Indian languages considered for this project are Hindi, Bengali, Marathi, Telugu, Bhojpuri, Kannada, Magadhi, Chhattisgarhi, and Maithili.
A majority of the population in the country is still unable to use all the technological services due to language and literacy constraints. SYSPIN helps to reduce their barriers to voice-based technologies and creates a potential market for tech innovators and social entrepreneurs.
The output of this project will allow local innovation in emerging markets to develop products and services serving illiterate Indians and rural poor populations in their own medium of engagement with technology. The TTS corpus will be a unique resource for developing assistive technologies for people with speech and visual disabilities. The proposed 720 hours of open-source TTS data will open up opportunities for academic and industrial research.
More about SYSPIN: https://syspin.iisc.ac.in/
Important Dates
Nov 26, 2022: Registration for the challenge opens
Dec 04, 2022: Release training data
Dec 15, 2022: Release baseline recipe
Jan 15, 2023: API submission begins
Jan 20, 2023: Last date for submission of a working API
Jan 30, 2023 Jan 27, 2023: Final API submission deadline
Feb 10, 2023: Announcement of Winners in all tracks
Feb 20, 2023: Grand Challenge 2-page Papers Due (by invitation only - challenge organizers will invite the top 5 ranked teams )
Note: The dates given apply to any time zone across the world.