Subtitle Formats

SRT vs VTT: What's Actually Different?

SRT vs VTT is one of those questions that comes up on every video project. Here's a straight answer on what separates them and when each one makes sense.

Gary Sztajnman

Author

March 24, 20268 min read

TL;DR

SRT works everywhere. If you need one format that every player can read, it's SRT.
VTT is made for the web. It's what browsers expect when you use the HTML5 track element.
The timecode syntax is slightly different: SRT uses a comma (00:01:23,456), VTT uses a dot (00:01:23.456).
VTT lets you style and position subtitles with CSS. SRT doesn't — it's plain text, full stop.
Hello8 exports both (plus ASS/SSA, TTML, and others), so you don't have to pick just one.

SRT vs VTT: Quick Decision Guide

Use SRT if you need:

YouTube, Vimeo, or social platform uploads
Desktop playback (VLC, MPV, Windows Media Player)
NLE import (Premiere Pro, DaVinci Resolve, Final Cut)
Maximum compatibility with any system

Use VTT if you need:

Web video with the HTML5 track element
Styled subtitles (colors, positioning, fonts)
Chapter markers or metadata
Accessibility compliance (WCAG)

Not sure? Export both from Hello8 — it takes one click.

What is an SRT file?

SRT stands for SubRip Text. It originally came from a tool called SubRip that was used to rip subtitles off DVDs — hence the name. Over the years it became the go-to format for sharing subtitles. Pretty much every video player, editing tool, and platform knows how to read it.

The file itself is dead simple. It's plain text. Each subtitle block has a number, a timecode range (start → end), and the text to display. That's it.

example.srt

1
00:00:01,000 --> 00:00:04,000
Welcome to our documentary
about marine life.

2
00:00:05,500 --> 00:00:09,000
The ocean covers over 70%
of the Earth's surface.

3
00:00:10,200 --> 00:00:14,800
Yet we have explored less than
5% of this vast underwater world.

That simplicity is why SRT is so popular — and also why it's limited. You can open one in Notepad, edit it by hand, and any player will read it without complaining. But you get zero control over appearance. No colors, no positioning, no font changes. What you type is what shows up, plain white text on a black bar.

At a glance

Plain text — no markup, no styling
Numbered cues (1, 2, 3...)
Timecodes use a comma: HH:MM:SS,mmm
Works on basically everything
File extension: .srt

What is a VTT file?

VTT stands for WebVTT (Web Video Text Tracks). It's a W3C standard that grew out of SRT. The idea was to take what SRT does well and add the things web developers actually need — CSS styling, cue positioning, metadata, chapter markers.

Every VTT file starts with a WEBVTT header (this is required — leave it out and the file won't parse). After that, you get cue blocks similar to SRT, but each cue can carry positioning instructions and CSS-compatible styling that SRT has no concept of.

example.vtt

WEBVTT
Kind: captions
Language: en

00:00:01.000 --> 00:00:04.000
Welcome to our documentary
about marine life.

00:00:05.500 --> 00:00:09.000 position:10% align:start
<i>The ocean covers over 70%
of the Earth's surface.</i>

00:00:10.200 --> 00:00:14.800
Yet we have explored less than
<b>5%</b> of this vast underwater world.

If you've ever used the track element inside an HTML5 video player, VTT is what it expects. That makes it the obvious pick for anything web-native: your own site, an LMS, a custom player, or any streaming setup where you want to control how subtitles look and where they sit on screen.

At a glance

Must start with WEBVTT header
CSS styling works (bold, italic, colors, fonts)
You can position and align cues on screen
Supports chapter markers and metadata
Timecodes use a dot: HH:MM:SS.mmm
Native to HTML5 track element
File extension: .vtt

Side-by-Side Comparison

The practical SRT vs VTT differences, all in one place.

Feature

SRT

VTT (WebVTT)

File extension

.srt

.vtt

Timecode separator

Comma (,)

Dot (.)

File header

None

WEBVTT (required)

Text styling

None

CSS / HTML tags

Cue positioning

Nope

Yes — vertical, line, position, align

Chapter markers

Nope

Yes

Metadata

Nope

Header metadata + NOTE blocks

HTML5 track element

Needs conversion first

Works natively

Browser playback

Needs a JS polyfill

Built in

Desktop players (VLC, etc.)

Universal

Hit or miss

Broadcast / TV

Standard

Rare

Hand-editing

Easy — any text editor

Same — any text editor

When SRT is the right call

When in doubt, SRT. It just works.

Desktop players

VLC, MPV, Windows Media Player — they all read SRT without blinking. Drop the file next to your video and you're done.

Post-production

Premiere Pro, DaVinci Resolve, Final Cut — SRT is the format your NLE expects. No import headaches.

Platform uploads

YouTube, Vimeo, most social platforms — they all take SRT for closed captions. It's the path of least resistance.

Older systems

DVD authoring, legacy media servers, enterprise video platforms — if it's been around a while, it probably only speaks SRT.

When VTT is the right call

Building for the web? VTT is what browsers actually want.

Web video

The HTML5 track element was designed for VTT. If you're embedding video on a website, this is the native choice.

Styled subtitles

Want colored speaker names, italic stage directions, or bold emphasis? You need VTT — SRT can't do any of that.

E-learning & courses

Chapter markers let learners jump between sections. Metadata lets you build interactive features on top of the video timeline.

Accessibility

VTT supports description tracks and regions, which helps when you're working toward WCAG compliance.

Best subtitle format by use case

Not every project needs the same format. Here's a cheat sheet.

YouTube / Vimeo— Platform default, widest acceptance

SRT

Your website / HTML5 player— Native browser support, styling

VTT

Netflix / broadcast / TV— Industry standard for delivery

TTML

Fansubs / advanced styling— Full typographic control

ASS/SSA

Universal safe bet— If it plays video, it reads SRT

SRT

VTT features you probably didn't know about

Most people treat VTT like "SRT with a header." It can do a lot more than that.

CSS styling

The ::cue pseudo-element lets you style individual cues with colors, fonts, backgrounds, even opacity. You can make speaker names look different from dialogue, or highlight specific words.

Cue positioning

Put subtitles anywhere on screen — not just bottom-center. Handy when there's text or a lower third already on screen that you don't want to cover.

Regions

You can define named screen areas and assign cues to them. Two speakers? Put each one in a different corner. It's surprisingly useful for panel discussions or multi-character scenes.

Chapter markers

Add kind='chapters' to a track and you get table-of-contents navigation in supported players. Great for long-form content like webinars or lectures.

Metadata tracks

You can embed hidden data — JSON, speaker IDs, whatever — synced to the video timeline. This is how people build searchable transcripts, interactive overlays, and analytics hooks.

How to convert SRT to VTT (and back)

Because SRT vs VTT is mostly a structural difference, conversion is doable. But there are gotchas that catch people off guard — especially going from VTT back to SRT.

SRT → VTT

1Add WEBVTT as the first line
2Swap commas for dots in timecodes (00:01:23,456 → 00:01:23.456)
3Cue numbers become optional — you can drop them or keep them as IDs
4Now you can add styling and positioning if you want

VTT → SRT

1Remove the WEBVTT header and any metadata/NOTE blocks
2Swap dots for commas in timecodes
3Add sequential numbering (1, 2, 3...) to each cue
4Strip all styling, positioning, and cue settings — this is the lossy part

Going from VTT to SRT is a one-way trip for styling. All CSS, positioning, regions, and metadata get thrown away. Keep your VTT source file around — you can't get that stuff back from the SRT.

Common conversion mistakes

Forgetting the WEBVTT header — File won't parse — browsers silently ignore it

Wrong decimal separator (comma vs dot) — Timecodes shift or break entirely

Keeping CSS styling in an SRT export — Raw tags show up as visible text in players

UTF-8 BOM encoding issues — First cue gets corrupted or skipped on some platforms

The real problem isn't SRT vs VTT

The format itself is rarely the issue. The workflow around it is.

Converting between formats by hand — and hoping nothing breaks

Losing styling every time you export to SRT

Files that silently fail on upload because of a wrong header or separator

Tools that only support one format, so you're stuck re-doing work

How Hello8 removes the format problem

You don't pick formats upfront. You create subtitles once, then export wherever you need.

Create once, export anywhere

SRT, VTT, ASS/SSA, TTML, EBU-STL, FCPXML — all from one project. No manual conversion, no duplicate work.

Nothing breaks on export

Format-specific QC catches timecode issues, encoding problems, and spec violations before they reach your client or platform.

Styling survives

Speaker labels, italics, and positioning carry over to formats that support them. Export to SRT and VTT from the same source — each gets what it can handle.

Human + AI, format-agnostic

AI transcribes and translates. Human editors refine. The output is accurate no matter which format you export to.

FAQ

Can I convert SRT to VTT?

Yes — add the WEBVTT header, swap commas for dots in the timecodes, and you're basically done. Hello8 does this automatically on export.

Which is better for YouTube?

YouTube takes both. Most people upload SRT because that's what YouTube's own editor spits out. Either works fine.

Can SRT files have colors or bold text?

Officially, no. SRT is plain text. Some players will render basic HTML bold or italic tags, but it's non-standard and spotty — don't count on it.

Does VTT work in VLC?

Sort of. VLC can read VTT, but it ignores most of the advanced stuff (positioning, styling). For desktop playback, SRT is still the safer bet.

What formats does Hello8 support?

SRT, VTT, ASS/SSA, TTML, EBU-STL, FCPXML, and more. Import in any of them, export to any combination.

Not sure which format to pick?

If it's for the web, go VTT. If it's for everything else, go SRT. Or just export both from Hello8 — it takes one click.

Related resources

Still thinking about formats?

You shouldn't have to. Create your subtitles once. Export everywhere.

Talk to our team