Experimental: I have not (yet) researched how acceptable synthesised speech is to end-users for audio description on the Web.
Organisations frequently produce short videos for social media and the Web, with many short cuts and few gaps in the audio track. Even with the benefit of planning, it would be hard to find spaces for standard audio description.
The proposed solution combines:
- Extended audio description
- WebVTT tracks for 'descriptions'
- WebVTT files containing JSON metadata
- Speech Synthesis Web API (widely supported)
Work-In-Progress!
An example of WebVTT containing extended audio description and metadata:
WEBVTT
...
4
00:00:16.000 --> 00:00:17.000
The next line is extended audio description!
5
00:00:18.000 --> 00:00:18.400
Now the yacht is sailing down wind, parallel with the shore.
6
00:00:18.600 --> 00:00:19.100
{
"pauseMedia": 3000
}
7
00:00:21.000 --> 00:00:22.000
End of extended audio description.More example VTT.
- License: MIT