Formosa Speech Recognition Challenge 2025 - Hakka ASR II
Formosa Speech Recognition Challenge 2025 (FSR-2025) is the fourth event of the Formosa Speech in the Wild (FSW) project, which is organized by National Yang Ming Chiao Tung University (NYCU).
Taiwanese Hakka is a language spoken natively by about 1.5% of the population of Taiwan. Although the number of Hakka speakers continues to drop, especially among youth, it's not yet too late to save this language. Therefore, we are now calling for and welcoming participants from both academic and industrial sectors to FSR-2025. Students are especially welcome to participate in the competition for the Student Awards.
Build an automatic Hakka speech recognizer (ASR) that could output either (至少選一個Track):
Taiwanese Hakka Recommended Characters by Ministry of Education of Taiwan (客語漢字,依據教育部部定 臺灣客家語推薦用字,漢字優先)
Taiwan Hakka Pinyin (依據教育部部定 客家語拼音方案,以本調為準)
For example:
Track1 - 今晡日係拜二(除外來語外,都用漢字表示,另外,同義字也會先處理)
Track2 - gim24 bu24 ngid2 he55 bai55 ngi55(本調為準)
檔名:請以“單位+隊名+參賽者”為檔名,以避免誤判(之前沒寫的沒關係,會檢查email位置)。
答案格式:ID 答案(同Kaldi, 一欄為音檔ID,一欄為語音辨認器輸出)
以下範例
Track1:
1 今晡日係拜二
2 老妹當好搞水
3 暗晡夜來吾屋下食夜
Track2:
1 gim24 bu24 ngid2 he55 bai55 ngi55
2 lo31 moi55 dong24 hau55 gau31 sui31
3 am55 bu24 ia55 loi11 nga24 vug2 ka24 siid5 ia55
This challenge is based on the "HAT-Vol2" corpus.
"HAT-Vol2" consists of about 100 speakers recruited across Taiwan, in total about 80 hours (Training + Eval + Test sets).
This data is released here for FREE under a Non-Commercial Use Only license. Please read and accept the License.
Baseline Scripts: ESPnet-based baseline recipes are provided in Github for students to develop their own systems easily and quickly. --> TBA
2025/06/02 --- Registration Open & Training Data Release
2025/07/31 --- Registration Close
2025/08/04 --- Pilot-Test (dry-run only) Data Release
2025/08/11 --- Pilot-Test (dry-run only) Result Submission
2025/08/18 --- Pilot-Test (dry-run only) Performance Notification
2025/09/08 --- Final-Test Data Release
2025/09/19 --- Final-Test Result & Draft Paper Submission
2025/09/26 --- Final-Test Performance Notification (released)
2025/10/06--- Paper Submission
2025/11/20-11/22--- Award Ceremony and Workshop (T.B.D.)
PS: Pilot-Test (dry-run) is only used to make sure everything for the final test is fine, not for scoring!
Yuan-Fu Liao (廖元甫)
Full Professor, National Yang Ming Chiao Tung University