Using a pretrained multi-lingual, multi-speaker TTS built on the challenge dataset, perform a few shot voice cloning by fine-tuning new speakers.
Using a pretrained multi-lingual, multi-speaker TTS built on datasets of this challenge and any other publicly available corpora such as VCTK, LibtiTTS etc., perform few-shot voice cloning by fine-tuning on new speakers.
Using a pretrained multi-lingual, multi-speaker TTS built on datasets of this challenge and any other publicly available corpora such as VCTK, LibtiTTS etc., evaluate on utterances of unseen speakers.