Go to kapwing.com.
Log in with a Google or email account.
Select “Create New Project” to open a blank workspace.
In the left sidebar, click “AI Tools.”
Select “AI Video Generator.”
Choose “Generate video from text.”
This opens a text box where you can write a prompt or paste a full script.
You can choose either method:
Short prompt example:
“Create a 45-second explainer video about how climate change affects animals, with simple vocabulary for B1 English learners.”
Full script example:
Paste a paragraph you want turned into a video.
Kapwing will use AI to:
divide the text into scenes,
select appropriate stock footage,
add subtitles,
organize the video timeline.
Before generating, you can adjust:
Aspect ratio: 16:9 (YouTube), 9:16 (TikTok/Reels), 1:1 (Instagram).
Tone/Style: educational, narrative, upbeat, formal, etc.
Subtitle preferences: on/off, auto-captions.
Press “Generate Video” to let Kapwing create the first draft.
After generation, you can modify every element:
Replace or rearrange clips
Edit or rewrite text overlays
Adjust timing
Upload your own images, audio, or video
Change fonts, colors, or layouts
This step supports creative decision-making and aligns with task-based, student-centered learning.
Students may add:
AI-generated voiceover (built into Kapwing)
A recorded voiceover using the microphone
Imported audio files
This is especially valuable for language learners who want to practice speaking.
Click “Export” in the top-right corner.
Select “Video.”
Choose resolution (usually 720p or 1080p).
Wait for rendering and download the result.
Students can now publish, share, or submit the video as part of their task.
Kapwing offers several advantages for educational contexts, particularly in language learning. According to Tech & Learning (2018), Kapwing’s intuitive, web-based interface works well on devices like Chromebooks, which lowers the technical barrier for both students and teachers. Its multimodal editing capabilities also enhance student engagement by allowing the integration of text, visuals, audio, and captions into a single task. According to Common Sense Education (2021), Kapwing supports multimodal literacy because students can communicate meaning not only through language but also through images, sound, and layout. The platform’s automatic subtitling and transcription tools boost accessibility; according to Kapwing’s own education resources (2022), these features make videos more inclusive for English language learners and students with hearing difficulties. Collaboration is another strength: according to Kapwing’s 2022 announcement for educators, its cloud-based workspace enables real-time cooperative editing, which is valuable for hybrid or remote learning. Finally, Kapwing’s AI-powered features—such as auto-captioning, text-to-speech, and text-to-video—streamline production and scaffold creativity, allowing students to focus more on content and communication rather than technical editing.
Bates, T. (2015). Teaching in a digital age: Guidelines for designing teaching and learning. https://opentextbc.ca/teachinginadigitalage/
Hobbs, R. (2010). Digital and media literacy: A plan of action. Aspen Institute.
Kapwing. (2025). Kapwing Help Center. https://www.kapwing.com/help
Kapwing. (2025). Getting started with Kapwing. https://www.kapwing.com/resources/
Mayer, R. E. (2009). Multimedia learning (2nd ed.). Cambridge University Press.
OpenAI. (2025). ChatGPT [Large language model]. https://chat.openai.com/
Selwyn, N. (2016). Education and technology: Key issues and debates (2nd ed.). Bloomsbury Publishing.
UNESCO. (2023). AI competency framework for teachers. https://unesdoc.unesco.org/