Supported formats

Open almost any video or audio file and export SRT/VTT

Voice2Sub is built for real-world media files from phones, cameras, screen recorders, editors, podcasts and lectures. Start with the file you already have, generate AI subtitles or text locally, then export SRT, VTT, TXT, LRC or CSV.

Broad media support with a clear container/codec caveat for unusual files.

Supported Formats

Best for format-heavy workflows

  • Files from phones and cameras
  • Screen recordings and course videos
  • Podcast and interview audio
  • Video editor exports
  • Subtitle and transcript handoff

Start with the media file you already have

Voice2Sub is designed to open a broad range of common and many less common video or audio files directly in the desktop app. That includes real-world files from phones, cameras, screen recorders, editing apps, podcasts, lectures and voice recordings.

Download free app

Broad format support, explained clearly

  • Video containers such as MP4, MOV, MKV, AVI and WebM are common starting points for subtitle generation.
  • Audio files such as MP3, WAV, M4A, AAC and FLAC can become transcripts or timed subtitle files.
  • Containers can hold different codecs, so very unusual files should be opened in the app first before a long processing job.
  • Export formats are intentionally practical: SRT/VTT for captions, TXT for text, LRC for line timing and CSV for structured review.

Workflow

Import media, generate subtitles, export the right format

Use this page as the compatibility guide before format-heavy projects such as MP4 to SRT, MP3 to transcript, VTT for web players or CSV review handoffs.

  1. 01

    Open a local media file

    Start from MP4, MOV, MKV, AVI, WebM, MP3, WAV, M4A, FLAC or another supported video/audio file.

  2. 02

    Generate AI text

    Voice2Sub recognizes speech and creates subtitle or transcript text you can review.

  3. 03

    Choose the output format

    Export SRT, VTT, TXT, LRC or CSV based on where the result will be used.

Supported formats

Supported input and export formats

Voice2Sub is designed for broad media support: common containers and many less common real-world video or audio files from creators, cameras, phones, screen recorders, podcasts, lectures and editing apps. If a familiar container uses an unusual codec, open it in Voice2Sub to confirm compatibility before a long workflow.

Video containers

Import videos from phones, cameras, screen recorders, editing apps and archive folders.

  • MP4
  • MOV
  • MKV
  • AVI
  • WebM
  • WMV
  • FLV
  • M4V
  • MPEG/MPG
  • TS
  • 3GP
  • VOB
  • MXF
  • DIVX
  • and more

Audio formats

Import recordings, podcasts, interviews, lectures, meetings and voice notes.

  • MP3
  • WAV
  • M4A
  • AAC
  • FLAC
  • OGG
  • WMA
  • OPUS
  • AIFF
  • AMR
  • AC3
  • APE
  • and more

Subtitle and transcript exports

Export subtitles, WebVTT, transcript text, lyric-style timing or CSV review data.

  • SRT
  • VTT / WebVTT
  • TXT
  • LRC
  • CSV

File extensions are examples. Containers such as MP4, MOV or MKV can hold different codecs; unusual or proprietary codecs should be tested in the app before long processing jobs.

Export guide

Which export format should you choose?

Choose the output based on where the subtitles or transcript will be used after review.

YouTube and video platforms

Use SRT or VTT when you need a standard caption file for publishing workflows.

  • SRT
  • VTT

Web players

Use VTT/WebVTT when the destination is a web video player or HTML5 caption workflow.

  • VTT

Editing and review

Use SRT for broad compatibility or CSV when structured timing/text review is useful.

  • SRT
  • CSV

Transcript archive

Use TXT when you need readable text for notes, search, documentation or handoff.

  • TXT

Lyric-style timing

Use LRC when line-by-line timing is helpful for lyric-style or timestamped text workflows.

  • LRC

Container vs codec

Why one MP4 may work differently from another

MP4, MOV and MKV are containers. They can hold different audio/video codecs inside the same extension. Voice2Sub is built for broad real-world compatibility, but very unusual or proprietary codecs should be opened in the app first before a long processing job.

  • Container: MP4, MOV, MKV
  • Codec: the media stream inside
  • Test unusual files first

Input coverage

A practical format page for files users actually have

The page is meant to answer format questions from creators, educators and editors: camera videos, screen recordings, downloaded meeting files, podcast audio, lectures, interviews and exports from editing software.

  • Phone and camera videos
  • Screen recordings
  • Podcast and lecture audio

Output coverage

Subtitle and transcript formats serve different jobs

SRT and VTT are caption files. TXT is readable transcript text. LRC adds line-by-line timing, and CSV is useful for structured review or handoff. The right output depends on where the result goes after review.

  • SRT/VTT for captions
  • TXT for transcript archives
  • CSV for structured review

Conversion rule

Do not convert first unless you need to

For common video and audio files, start with the original media. Convert only when a file cannot be opened, uses an unsupported codec, or comes from an unusual tool that created a nonstandard stream.

  • Start with the original file
  • Test unusual codecs
  • Convert only when needed

AI Transcription

AI Transcription

Voice2Sub also covers speech to text, audio to text, video transcription and Whisper AI transcription, so subtitle generation can stay connected to transcript and export needs.

  • Speech to Text
  • AI Transcription
  • Whisper AI Transcription

Format examples

Common source-to-output workflows

These examples match practical long-tail questions such as MP4 to SRT, MOV to captions, MP3 to transcript, WAV to text and VTT for web players.

  • MP4 to SRT or VTT captions
  • MOV or MKV videos to subtitle files
  • MP3 or WAV to transcript text
  • M4A recordings to TXT or SRT
  • SRT/VTT export for editors and platforms

Supported formats FAQ

Does Voice2Sub support MP4?

Yes. MP4 is a common video container supported by Voice2Sub. Codec compatibility can still vary for unusual files, so test unfamiliar media before a long workflow.

Can I create SRT from MP3 or WAV?

Yes. Import an MP3 or WAV audio file, generate text with AI recognition, review the result and export SRT when timed subtitles are useful.

What is the difference between SRT and VTT?

SRT is widely used for subtitle exchange and review. VTT, also called WebVTT, is common for web video captions and some publishing workflows.

What if a video uses an unusual codec?

A familiar extension can contain an uncommon codec. Open the file in Voice2Sub first to confirm compatibility before committing to a long processing job.

Do I need to convert files before importing?

Usually no. Start with the original video or audio file when it is a common format. Convert only if the file cannot be opened or uses an unsupported codec.

Are export formats available in the free app?

Yes. The free desktop app includes the core media import and subtitle/transcript export workflow, including SRT, VTT, TXT, LRC and CSV.

Open your media file and export the format you need

Download the desktop app, import video/audio on your computer, and export SRT, VTT, TXT, LRC or CSV after review.