[BUG] Update Transcript Parsing to support Valid WebVTT formats #440

Dananji · 2024-02-29T15:32:46Z

Description

Transcript component cannot parse WebVTT files with additional information in the header. These files are properly interpreted as captions while the transcript component freezes when trying to display them.

Example of a valid vtt file from W3C that doesn't work in Transcript component:

WEBVTT

REGION
id:fred
width:40%
lines:3
regionanchor:0%,100%
viewportanchor:10%,90%
scroll:up

REGION
id:bill
width:40%
lines:3
regionanchor:100%,100%
viewportanchor:90%,90%
scroll:up

00:00:00.000 --> 00:00:20.000 region:fred align:left
<v Fred>Hi, my name is Fred

00:00:02.500 --> 00:00:22.500 region:bill align:right
<v Bill>Hi, I’m Bill

00:00:05.000 --> 00:00:25.000 region:fred align:left
<v Fred>Would you like to get a coffee?

NOTE: When parsing WebVTT does the parser allow arbitrary text before timed text? Is there a way to identify this?

Done Looks Like

Transcript component parses valid WebVTT files
Only timed text blocks are displayed to end user
Region and styling blocks are ignored; not used to style text

Related resources

The text was updated successfully, but these errors were encountered:

elynema · 2024-04-08T13:33:30Z

Dananji not trying to validate or parse what is within the block, just identifying them and skipping them.

Styling should not be displayed by end user; supposed to be read by parser and used to display text. Region is used for caption display only, so will be ignored for transcript text.

elynema · 2024-04-08T13:39:37Z

@joncameron Dananji's suggestion for this first pass was that styling and region info be ignored in the transcript context. Region info at least is intended for caption display, and does not pertain to transcript display. That sound ok?

elynema · 2024-04-09T13:39:53Z

Notes at top of transcript or with the transcript will be shown as plain text, but they do not have any timing component so span the time and text columns.

Dananji · 2024-04-26T20:32:33Z

This can be tested on Ramp demo site.

joncameron · 2024-05-16T19:58:38Z

Works great; I created an issue for a small rendering bug I saw while testing: #500.

Dananji added bug 🐛 Something isn't working transcripts Transcript component related labels Feb 29, 2024

Dananji self-assigned this Apr 5, 2024

Dananji mentioned this issue Apr 9, 2024

Vtt parse fix #476

Merged

This was referenced May 10, 2024

New Ramp build avalonmediasystem/avalon#5829

Merged

Transcript component parses 'note' within cue-text as plain text #499

Open

joncameron mentioned this issue May 16, 2024

First VTT Cue Timestamp Doesn't Render Correctly in Transcript #500

Open

1 task

joncameron closed this as completed May 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Update Transcript Parsing to support Valid WebVTT formats #440

[BUG] Update Transcript Parsing to support Valid WebVTT formats #440

Dananji commented Feb 29, 2024 •

edited by elynema

elynema commented Apr 8, 2024 •

edited

elynema commented Apr 8, 2024

elynema commented Apr 9, 2024

Dananji commented Apr 26, 2024

joncameron commented May 16, 2024

[BUG] Update Transcript Parsing to support Valid WebVTT formats #440

[BUG] Update Transcript Parsing to support Valid WebVTT formats #440

Comments

Dananji commented Feb 29, 2024 • edited by elynema

Description

Done Looks Like

Related resources

elynema commented Apr 8, 2024 • edited

elynema commented Apr 8, 2024

elynema commented Apr 9, 2024

Dananji commented Apr 26, 2024

joncameron commented May 16, 2024

Dananji commented Feb 29, 2024 •

edited by elynema

elynema commented Apr 8, 2024 •

edited