Video text needs timing and visual context
When the source is video, the text often has to line up with scenes, edits and speaking turns. This page keeps that video context separate from the audio-file and general recognition pages.
Download Voice2Sub