Skip to content

VTT Editor Pro v2.1 ‐ Features

RDTvlokip edited this page Nov 6, 2025 · 2 revisions

Complete and detailed list of all features in VTT Editor Pro v2.1.


🎵 Audio Management

Audio Loading

  • Supported formats: MP3, WAV, OGG, M4A
  • Drag & Drop: Drop your file directly
  • Upload button: Click "📁 Import MP3"
  • Waveform: Complete visualization with Wavesurfer.js v7

Audio Playback

  • Play/Pause: ▶️ Play button or Space key
  • Timeline: Real-time current time display
  • Seeking: Click on waveform to navigate
  • Auto-scroll: Waveform follows playback

📝 Cue Management (Subtitles)

Creating Cues

  • Manual add: "➕ Add Region" button or Enter key
  • Waveform add: Double-click on waveform
  • Auto-timing: New cues added at current position

Editing Cues

  • Text editing: Double-click a cue or "✏️ Edit" button
  • Timing modification: Start/End fields in VTT format
  • Color: Color picker for each cue
  • Quick save: Ctrl+Enter to save quickly

Deleting Cues

  • Individual delete: "🗑️ Delete" button
  • Multiple delete: "🗑️ Clear All" button
  • Confirmation: Confirmation dialog to prevent errors

Moving Cues

  • Drag on waveform: Drag colored regions
  • Resize handles: Region edges to adjust timing
  • Snap-to-grid: Automatic alignment (v2.1)

🎨 Interface and Visualization

Interactive Waveform

  • Zoom: Mouse wheel or pinch-to-zoom
  • Scroll: Scrollbar or trackpad
  • Colored regions: Each cue has its own color
  • Resize handles: Visible handles at extremities

List Display

  • Scrollable list: All cues displayed
  • Numbering: Automatic numbers
  • Timestamps: Precise VTT format (HH:MM:SS.mmm)
  • Active highlighting: Currently edited cue highlighted
  • Overlap detection: Visual warning in orange

Lyrics Display (Karaoke)

  • Dedicated panel: Right column
  • Karaoke style: Active line highlighting
  • Auto-scroll: Follows audio playback
  • Click to navigate: Click a line to jump to it

🔧 Version 2.1 Features

1. Enhanced Resize Tooltips ✨

Description: Real-time timestamp display while resizing cues.

How it works:

  • Tooltip appears above cursor during resize
  • Displays timestamp in VTT format (00:00:05.234)
  • Updates in real-time during movement
  • Disappears automatically when resize ends

Benefits:

  • ✅ Maximum precision when adjusting timing
  • ✅ No need to look elsewhere on screen
  • ✅ Immediate visual feedback

Shortcuts: None (automatic)


2. Snap-to-Grid 📏

Description: Automatic alignment of cues to a time grid.

How it works:

  • Toggle with "⚡ Snap Grid" checkbox
  • Choose interval: 100ms, 250ms, 500ms, 1s
  • Cues automatically align during resize
  • Edges (start/end) snap to intervals

Benefits:

  • ✅ Consistent timing across all cues
  • ✅ Avoids millisecond offsets
  • ✅ Automatic professional alignment
  • ✅ Considerable time savings

Available intervals:

  • 100ms: Maximum precision, ideal for fast dialogue
  • 250ms: Good compromise between precision and speed
  • 500ms: Half-second alignment, for karaoke
  • 1000ms: Alignment to full seconds

Shortcuts: None


3. Batch Text Editing 🔄

Description: Mass text editing across multiple cues simultaneously.

Access: "📝 Batch Edit" button in top bar

Available modes:

Mode 1: Find & Replace

  • Find: Text to search for
  • Replace with: Replacement text
  • Case Sensitive: Case matching (optional)
  • Regex: Regular expression support
  • Scope: All cues

Example:

Search: "music"
Replace: "song"
Result: All occurrences of "music" become "song"

Mode 2: Transform

Quick text transformations:

  • ⬆️ UPPERCASE: All uppercase
  • ⬇️ lowercase: All lowercase
  • 🔤 Capitalize Each Word: First Letter Of Each Word
  • 📝 Sentence case: First letter only

Example:

Original text: "hello world"
UPPERCASE: "HELLO WORLD"
Capitalize: "Hello World"
Sentence case: "Hello world"

Mode 3: Modify (Prefix/Suffix)

  • Prefix: Text to add at beginning
  • Suffix: Text to add at end
  • Range: Cue selection (all, 1-5, 2,4,6)

Range examples:

"all" → All cues
"1-5" → Cues 1 to 5
"1,3,5" → Cues 1, 3, and 5
"1-3,7-9" → Cues 1 to 3 and 7 to 9

Usage example:

Prefix: "♪ "
Suffix: " ♪"
Original text: "Hello"
Result: "♪ Hello ♪"

Benefits:

  • ✅ Quick editing of dozens of cues
  • ✅ Mass corrections
  • ✅ Add musical formatting (♪)
  • ✅ Text normalization

Shortcuts: None


💾 Import / Export

VTT Export

  • Standard format: W3C-compliant WebVTT
  • Colors preserved: Color comments included
  • Filename: Based on audio filename
  • Download: "💾 Export VTT" button

Generated format:

WEBVTT

NOTE color:rgb(29, 185, 84)

00:00:01.000 --> 00:00:03.500
First line

NOTE color:rgb(255, 100, 100)

00:00:04.000 --> 00:00:07.000
Second line

VTT Import

  • Loading: "📄 Import VTT" button
  • Complete parsing: Timestamps, text, and colors
  • Validation: Format verification
  • Reconstruction: Automatic region creation

⏮️ Undo / Redo

History

  • Undo: "↩️ Undo" button or Ctrl+Z
  • Redo: "↪️ Redo" button or Ctrl+Y
  • Unlimited history: All actions saved
  • Tracked actions:
    • Add cue
    • Delete cue
    • Modify text
    • Modify timing
    • Batch editing

🎨 Customization

Colors

  • Fixed palette: 10 predefined colors
  • Visual selector: Dropdown with preview
  • Default colors: Spotify Green (#1db954)
  • Persistence: Saved in VTT

Theme

  • Dark mode: Professional dark theme
  • Spotify colors: Green (#1db954) as accent
  • Contrasts: Optimized for readability
  • Icons: Emojis for intuitiveness

⚠️ Error Detection

Overlaps

  • Automatic detection: Real-time
  • Orange highlighting: Overlapping cues
  • Visual warning: "⚠️ Overlap detected"
  • Conflict list: Easy to identify

Validation

  • VTT format: Verification on import
  • Timestamps: Validation start < end
  • Minimum duration: Alerts if too short
  • Empty text: Warning if no text

🚀 Performance

  • Large files: Support up to 1h+ audio
  • Hundreds of cues: Smooth handling of 200+ cues
  • Optimized rendering: Virtual scrolling for list
  • No lag: Real-time interactions

🌐 Compatibility

Supported Browsers

  • Chrome/Edge: 90+
  • Firefox: 88+
  • Safari: 14+
  • Opera: 76+

Audio Formats

  • MP3: Recommended
  • WAV: Full support
  • OGG: Full support
  • M4A: Browser-dependent support

📊 Statistics

The interface displays:

  • Total cues: Number of subtitles
  • Audio duration: HH:MM:SS format
  • Current position: Playback time
  • Active cue: Number and timestamp

← Back to home | Usage guide →

Clone this wiki locally