Home >IEEE CIS HSO Events >QCI Workshop @ Malaysia (8/9/2025) >Whisper-Taiwanese Model for QCI&GAI Experience > Introduction to Whisper-Taiwanese Model Tv0.5
Home >IEEE CIS HSO Events >QCI Workshop @ Malaysia (8/9/2025) >Whisper-Taiwanese Model for QCI&GAI Experience > Introduction to Whisper-Taiwanese Model Tv0.5
Introduction to Whisper-Taiwanese Tv0.5 Model
Whisper-Taiwanese model V0.5 (Tv0.5): This model is a fine-tuned version of OpenAI’s openai/whisper-large-v3-turbo. It was developed by the National University of Tainan (NUTN), Taiwan, as part of a National Science and Technology Council (NSTC)-funded industry-academia collaboration project. We carried out the Taiwanese-English Co-Learning Pilot Project from September 2024 to June 2025 in collaboration with JEN-PIN ENTERPRISE CO., LTD. The model is trained for Taiwanese language recognition tasks using JEN-PIN educational materials generated through Student–Machine Co-Learning during the Fall 2024 semester. Additionally, the NUTN is collaborating with the National Center for High-performance Computing (NCHC) of the National Applied Research Laboratories (NARLabs) in Taiwan to provide computational and storage resources and co-develop an AI learning model for elementary and high school students.
Model Details
Base Model: openai/whisper-large-v3-turbo
Fine-tuned for: Taiwanese Hokkien Automatic Speech Recognition (ASR)
Fine-tuning Framework: Hugging Face Transformers
Training Duration: Approximately 180 hours using two V100 GPUs
Dataset: Custom dataset, including the Dictionary of Frequently-Used Taiwanese Taigi released by the Ministry of Education, Taiwan, totaling approximately 90 hours of audio data.
Input Format: 16kHz mono WAV
License: CC BY-NC 4.0
Home >IEEE CIS HSO Events >QCI Workshop @ Malaysia (8/9/2025) >Whisper-Taiwanese Model for QCI&GAI Experience > Introduction to Whisper-Taiwanese Model Tv0.5