index.bs

<pre class='metadata'>
Title: WebVTT: The Web Video Text Tracks Format
H1: WebVTT: The Web Video Text Tracks Format
Shortname: webvtt1
Status: CG-DRAFT
Group: texttracks
ED: https://w3c.github.io/webvtt/
Level: 1
Editor: Simon Pieters, Opera Software ASA http://www.opera.com/, simonp@opera.com
Editor: Courtney Kennedy, Apple Inc. http://www.apple.com/, ckennedy@apple.com
Former Editor: Silvia Pfeiffer, NICTA http://nicta.com.au/, silviapfeiffer1@gmail.com
Former Editor: Philip Jägenstedt, Opera Software ASA http://www.opera.com/, philipj@opera.com
Former Editor: Ian Hickson, Google http://www.google.com/, ian@hixie.ch
!Participate: <a href=https://github.com/w3c/webvtt>GitHub w3c/webvtt</a> (<a href=https://github.com/w3c/webvtt/issues/new>new issue</a>, <a href=https://github.com/w3c/webvtt/issues>open issues</a>, <a href=https://www.w3.org/Bugs/Public/buglist.cgi?product=TextTracks%20CG&component=WebVTT&resolution=--->legacy open bugs</a>)
!Commits: <a href=https://github.com/w3c/webvtt/commits>GitHub w3c/webvtt/commits</a>
Test Suite: https://github.com/w3c/web-platform-tests/tree/master/webvtt
Abstract: This specification defines WebVTT, the Web Video Text Tracks format. Its main use is for marking up external text track resources in connection with the HTML &lt;track> element.
Abstract: WebVTT files provide captions or subtitles for video content, and also text video descriptions [[MAUR]], chapters for content navigation, and more generally any form of metadata that is time-aligned with audio or video content.
Boilerplate: omit conformance, omit feedback-header

</pre>

<pre class='anchors'>
urlPrefix: https://html.spec.whatwg.org/multipage/
    type: dfn
        urlPrefix: infrastructure.html
            text: ascii digits
            text: split a string on spaces
            text: skip whitespace
            text: alphanumeric ascii characters
            text: space character
        urlPrefix: embedded-content.html
            text: text track kind
            text: text track cue
            text: text track list of cues
            text: text track
            text: list of text tracks
            text: media element
            text: text track mode
            text: text track showing
            text: rules for updating the text track rendering
            text: text track cue active flag
            text: text track cue text
            text: text track cue display state
            text: current playback position
            text: text track cue identifier
            text: text track cue pause-on-exit flag
            text: rules for extracting the chapter title
            text: text track cue start time
            text: text track cue end time
            text: expose a user interface to the user
            text: text track cue order
    type: element-attr
        urlPrefix: dom.html
            text: title; url: #attr-title
            text: lang; url: #attr-lang
            text: class; url: #classes
urlPrefix: https://encoding.spec.whatwg.org/
    type: dfn
        text: utf-8 decode
urlPrefix: https://heycam.github.io/webidl/
    type: exception
        text: IndexSizeError
</pre>

<pre class=link-defaults>
spec:dom-ls; type:interface; text:Document
spec:css-ruby-1; type:value; text:ruby-base
spec:css-color-4; type:property; text:color
spec:css-fonts-3; type:property; text:font-style
spec:css-fonts-3; type:property; text:font-weight
spec:css-ruby-1; type:value; text:ruby
spec:css-ruby-1; type:value; text:ruby-text
spec:css21; type:property; text:min-height
spec:css21; type:property; text:max-height
spec:css-flexbox-1; type:value; text:inline-flex
</pre>

<pre class=biblio>
{
    "MAUR": {
        "authors": [ "Shane McCarron", "Michael Cooper", "Mark Sadecki" ],
        "href": "http://www.w3.org/TR/media-accessibility-reqs/",
        "title": "Media Accessibility User Requirements",
        "status": "WD",
        "publisher": "W3C"
    }
}
</pre>

<h2 id=introduction>Introduction</h2>

<p><i>This section is non-normative.</i></p>

<p>The <dfn>WebVTT</dfn> (Web Video Text Tracks) format is intended for marking up external text
track resources.</p>

<p>The main use for WebVTT files is captioning or subtitling video content. Here is a sample file
that captions an interview:</p>

<pre>
WEBVTT

00:11.000 --> 00:13.000
&lt;v Roger Bingham>We are in New York City

00:13.000 --> 00:16.000
&lt;v Roger Bingham>We're actually at the Lucern Hotel, just down the street

00:16.000 --> 00:18.000
&lt;v Roger Bingham>from the American Museum of Natural History

00:18.000 --> 00:20.000
&lt;v Roger Bingham>And with me is Neil deGrasse Tyson

00:20.000 --> 00:22.000
&lt;v Roger Bingham>Astrophysicist, Director of the Hayden Planetarium

00:22.000 --> 00:24.000
&lt;v Roger Bingham>at the AMNH.

00:24.000 --> 00:26.000
&lt;v Roger Bingham>Thank you for walking down here.

00:27.000 --> 00:30.000
&lt;v Roger Bingham>And I want to do a follow-up on the last conversation we did.

00:30.000 --> 00:31.500 align:end size:50%
&lt;v Roger Bingham>When we e-mailed&mdash;

00:30.500 --> 00:32.500 align:start size:50%
&lt;v Neil deGrasse Tyson>Didn't we talk about enough in that conversation?

00:32.000 --> 00:35.500 align:end size:50%
&lt;v Roger Bingham>No! No no no no; 'cos 'cos obviously 'cos

00:32.500 --> 00:33.500 align:start size:50%
&lt;v Neil deGrasse Tyson>&lt;i>Laughs&lt;/i>

00:35.500 --> 00:38.000
&lt;v Roger Bingham>You know I'm so excited my glasses are falling off here.
</pre>

<h3 id=introduction-multiple-lines>Cues with multiple lines</h3>

<p><i>This section is non-normative.</i></p>

<p>Line breaks in cues are honored. User agents will also insert extra line breaks if necessary to
fit the cue in the cue's width. In general, therefore, authors are encouraged to write cues all on
one line except when a line break is definitely necessary.</p>

<div class="example">

 <p>These captions on a public service announcement video demonstrate line breaking:</p>

 <pre>
 WEBVTT

 00:01.000 --> 00:04.000
 Never drink liquid nitrogen.

 00:05.000 --> 00:09.000
 &mdash; It will perforate your stomach.
 &mdash; You could die.

 00:10.000 --> 00:14.000
 The Organisation for Sample Public Service Announcements accepts no liability for the content of this advertisement, or for the consequences of any actions taken on the basis of the information provided.
 </pre>

 <p>The first cue is simple, it will probably just display on one line. The second will take two
 lines, one for each speaker. The third will wrap to fit the width of the video, possibly taking
 multiple lines. For example, the three cues could look like this:</p>

 <!-- 50 -->
 <pre>
 &nbsp;          Never drink liquid nitrogen.

         &mdash; It will perforate your stomach.
                 &mdash; You could die.

     The Organisation for Sample Public Service
     Announcements accepts no liability for the
     content of this advertisement, or for the
      consequences of any actions taken on the
         basis of the information provided.
 </pre>

 <p>If the width of the cues is smaller, the first two cues could wrap as well, as in the following
 example. Note how the second cue's explicit line break is still honored, however:</p>

 <!-- 25 -->
 <pre>
 &nbsp;     Never drink
     liquid nitrogen.

   &mdash; It will perforate
       your stomach.
     &mdash; You could die.

   The Organisation for
   Sample Public Service
   Announcements accepts
   no liability for the
      content of this
   advertisement, or for
    the consequences of
   any actions taken on
     the basis of the
   information provided.
 </pre>

 <p>Also notice how the wrapping is done so as to keep the line lengths balanced.</p>

</div>

<h3 id=introduction-comments>Comments</h3>

<p><i>This section is non-normative.</i></p>

<p>Comments can be included in WebVTT files.</p>

<p>Comments are just blocks that are preceded by a blank line, start with the word
"<code>NOTE</code>" (followed by a space or newline), and end at the first blank line.</p>

<div class="example">

 <p>Here, a one-line comment is used to note a possible problem with a cue.</p>

 <pre>
 WEBVTT

 00:01.000 --> 00:04.000
 Never drink liquid nitrogen.

 NOTE I'm not sure the timing is right on the following cue.

 00:05.000 --> 00:09.000
 &mdash; It will perforate your stomach.
 &mdash; You could die.
 </pre>

</div>

<div class="example">

 <p>In this example, the author has written many comments.</p>

 <pre>
 WEBVTT

 NOTE
 This file was written by Jill. I hope
 you enjoy reading it. Some things to
 bear in mind:
 - I was lip-reading, so the cues may
 not be 100% accurate
 - I didn't pay too close attention to
 when the cues should start or end.

 00:01.000 --> 00:04.000
 Never drink liquid nitrogen.

 NOTE check next cue

 00:05.000 --> 00:09.000
 &mdash; It will perforate your stomach.
 &mdash; You could die.

 NOTE end of file
 </pre>

</div>

<h3 id=introduction-other-features>Other features</h3>

<p><i>This section is non-normative.</i></p>

<p>WebVTT also supports some less-often used features.</p>

<div class="example">

 <p>In this example, the cues have an identifier:</p>

 <pre>
 WEBVTT

 1
 00:00.000 --> 00:02.000
 That's an, an, that's an L!

 crédit de transcription
 00:04.000 --> 00:05.000
 Transcrit par Célestes&trade;
 </pre>

 <p>This allows a style sheet to specifically target the cues (notice the use of CSS character
 escape sequences):</p>

 <pre>
 ::cue(#\31) { color: green; }
 ::cue(#crédit\ de\ transcription) { color: red; }
 </pre>

</div>

<div class="example">

 <p>In this example, each cue says who is talking using voice spans. In the first cue, the span
 specifying the speaker is also annotated with two classes, "first" and "loud". In the third cue,
 there is also some italics text (not associated with a specific speaker). The last cue is annotated
 with just the class "loud".</p>

 <pre>
 WEBVTT

 00:00.000 --> 00:02.000
 &lt;v.first.loud Esme>It's a blue apple tree!

 00:02.000 --> 00:04.000
 &lt;v Mary>No way!

 00:04.000 --> 00:06.000
 &lt;v Esme>Hee!&lt;/v> &lt;i>laughter&lt;/i>

 00:06.000 --> 00:08.000
 &lt;v.loud Mary>That's awesome!
 </pre>

 <p>Notice that as a special exception, the voice spans don't have to be closed if they cover the
 entire cue text.</p>

 <p>Style sheets can style these spans:</p>

 <pre>
 ::cue(v[voice="Esme"]) { color: blue }
 ::cue(v[voice="Mary"]) { color: green }
 ::cue(i) { font-style: italic }
 ::cue(.loud) { font-size: 2em }
 </pre>

</div>

<div class="example">
 <p>This example shows how to position cues at explicit positions in the video viewport.</p>

 <pre>
 WEBVTT

 00:00:00.000 --> 00:00:04.000 position:10%,start align:start size:35%
 Where did he go?

 00:00:03.000 --> 00:00:06.500 position:90% align:end size:35%
 I think he went down this lane.

 00:00:04.000 --> 00:00:06.500 position:45%,end align:middle size:35%
 What are you waiting for?
 </pre>

 <p>Since the cues in these examples are horizontal, the "position" setting refers to a percentage
 of the width of the video viewpoint. If the text were vertical, the "position" setting would refer
 to the height of the viewport.</p>

 <p>The "start" or "end" only refers to the physical side of the box to which the "position" setting
 applies, in a way which is agnostic regarding the horizontal or vertical direction of the cue. It
 does not affect or relate to the direction or position of the text itself within the box.</p>

 <p>The cues cover only 35% of the video viewport's width - that's the <a lt="WebVTT cue box">cue
 box</a>'s "size" for all three cues.</p>

 <p>The first cue has its <a lt="WebVTT cue box">cue box</a> positioned at the 10% mark. The "start"
 and "end" within the "position" setting indicates which side of the <a lt="WebVTT cue box">cue
 box</a> the position refers to. Since in this case the text is horizontal, "start" refers to the
 left side of the box, and the cue box is thus positioned between the 10% and the 45% mark of the
 video viewport's width, probably underneath a speaker on the left of the video image. If the cue
 was vertical, "start" positioning would be from the top of the video viewport's height and the <a
 lt="WebVTT cue box">cue box</a> would cover 35% of the video viewport's height.</p>

 <p>The text within the first cue's cue box is aligned using the "align" cue setting. For
 left-to-right rendered text, "start" alignment is the left of that box, for right-to-left rendered
 text the right of the box. So, independent of the directionality of the text, it will stay
 underneath that speaker. Note that "start" alignment of the cue box is the default for start
 aligned text, so does not need to be specified in "position".</p>

 <p>The second cue has its <a lt="WebVTT cue box">cue box</a> right aligned at the 90% mark of the
 video viewport width ("end" aligned text right aligns the box). The same effect can be achieved
 with "position:55%,start", which explicitly positions the cue box. The third cue has middle aligned
 text within the same positioned cue box as the first cue.</p>

</div>

<div class="example">
 <p>This example shows two regions containing rollup captions for two different speakers. Fred's
 cues scroll up in a region in the left half of the video, Bill's cues scroll up in a region on the
 right half of the video. Fred's first cue disappears at 12.5sec even though it is defined until
 20sec because its region is limited to 3 lines and at 12.5sec a fourth cue appears:</p>

 <pre>
 WEBVTT
 Region: id=fred width=40% lines=3 regionanchor=0%,100% viewportanchor=10%,90% scroll=up
 Region: id=bill width=40% lines=3 regionanchor=100%,100% viewportanchor=90%,90% scroll=up

 00:00:00.000 --> 00:00:20.000 region:fred align:left
 &lt;v Fred>Hi, my name is Fred

 00:00:02.500 --> 00:00:22.500 region:bill align:right
 &lt;v Bill>Hi, I'm Bill

 00:00:05.000 --> 00:00:25.000 region:fred align:left
 &lt;v Fred>Would you like to get a coffee?

 00:00:07.500 --> 00:00:27.500 region:bill align:right
 &lt;v Bill>Sure! I've only had one today.

 00:00:10.000 --> 00:00:30.000 region:fred align:left
 &lt;v Fred>This is my fourth!

 00:00:12.500 --> 00:00:32.500 region:fred align:left
 &lt;v Fred>OK, let's go.
 </pre>

 <p>Note that regions are only defined for horizontal cues.</p>

</div>


<h2 id=conformance>Conformance</h2>

<h3 id=conformance-for-authors>Conformance for authors</h3>

<p>The <a href="#syntax">Syntax</a> section of this specification defines what consists a valid
WebVTT document. Authors need to follow the <a href="#syntax">Syntax</a> specification and are
encouraged to use a validator.</p>

<p>The <a href="#parsing">Parsing</a> section of this specification defines in some detail the
required processing for valid and also for invalid documents. It is a little more tolerant to author
errors than the syntax allows, so as to reject less documents and provide for extensibility.
However, authors must not take advantage of it. Only documents that follow the <a
href="#syntax">Syntax</a> specification are valid.</p>


<h3 id=unicode-normalization>Unicode normalization</h3>

<p>Implementations of this specification must not normalize Unicode text during processing.</p>

<p>For example, a cue with the identifier consisting of the characters U+0041 LATIN CAPITAL LETTER A
and U+212B ANGSTROM SIGN will not match a selector targeting a cue with an ID consisting of the
character U+00C5 LATIN CAPITAL LETTER A WITH RING ABOVE.</p>


<h3 id=document-conformance>Document conformance</h3>

<p>All diagrams, examples, and notes in this specification are non-normative, as are all sections
explicitly marked non-normative. Everything else in this specification is normative.</p>

<p>The key words "MUST", "MUST NOT", "SHOULD", "SHOULD NOT", "MAY", and "OPTIONAL" in the normative
parts of this document are to be interpreted as described in RFC2119. The key word "OPTIONALLY" in
the normative parts of this document is to be interpreted with the same normative meaning as "MAY"
and "OPTIONAL". For readability, these words do not appear in all uppercase letters in this
specification. [[!RFC2119]]</p>

<p>Requirements phrased in the imperative as part of algorithms (such as "strip any leading space
characters" or "return false and abort these steps") are to be interpreted with the meaning of the
key word ("must", "should", "may", etc) used in introducing the algorithm.</p>

<p>Conformance requirements phrased as algorithms or specific steps may be implemented in any
manner, so long as the end result is equivalent. (In particular, the algorithms defined in this
specification are intended to be easy to follow, and not intended to be performant.)</p>


<h2 id=data-model>Data model</h2>
<!-- Add some content here about cues and serialisation format in general -->
<!-- Describe metadata, caption/subtitle, chapter & description cues -->


<h3 id=cues>WebVTT cues</h3>

<p>A <dfn>WebVTT cue</dfn> is a <a>text track cue</a> that additionally consist of the following:
[[!HTML]]</p>

<dl>

 <dt><dfn lt="WebVTT cue box">A cue box</dfn></dt>
 <dd>
  <p>The cue box of a <a>WebVTT cue</a> is a box within which the text of all lines of the cue is to
  be rendered.</p>

  <p class="note">The position of the <a lt="WebVTT cue box">cue box</a> within the video viewport's
  dimensions depends on the value of the <a>WebVTT cue position</a> and the <a>WebVTT cue
  line</a>.</p>

  <p class="note">Lines are wrapped within the <a lt="WebVTT cue box">cue box</a>'s <a lt="WebVTT
  cue size">size</a> if lines' lengths make this necessary.</p>

 </dd>

 <dt><dfn lt="WebVTT cue writing direction">A writing direction</dfn></dt>
 <dd>
  <p>A writing direction, either</p>
  <ul>

   <li><dfn lt="WebVTT cue horizontal writing direction">horizontal</dfn> (a line extends
   horizontally and is offset vertically from the video viewport's top edge, with consecutive lines
   displayed below each other),</li>

   <li> <dfn lt="WebVTT cue vertical growing left writing direction">vertical growing left</dfn> (a
   line extends vertically and is offset horizontally from the video viewport's
   right edge, with consecutive lines displayed to the left of each other<!-- used for east
   asian-->), or</li>

   <li><dfn lt="WebVTT cue vertical growing right writing direction">vertical growing right</dfn> (a
   line extends vertically and is offset horizontally from the video viewport's left edge, with
   consecutive lines displayed to the right of each other<!-- used for mongolian -->).</li>

  </ul>

  <p>If the <a lt="WebVTT cue writing direction">writing direction</a> is <a lt="WebVTT cue
  horizontal writing direction">horizontal</a>, then the <a lt="WebVTT cue line">line</a>
  percentages are relative to the height of the video, and <a lt="WebVTT cue position">position</a>
  and <a lt="WebVTT cue size">size</a> percentages are relative to the width of the video.</p>

  <p>Otherwise, <a lt="WebVTT cue line">line</a> percentages are relative to the width of the video,
  and <a lt="WebVTT cue position">position</a> and <a lt="WebVTT cue size">size</a> percentages are
  relative to the height of the video.</p>

  <p>The <a lt="WebVTT cue writing direction">writing direction</a> defaults to <a lt="WebVTT cue
  horizontal writing direction">horizontal</a>.</p>

 </dd>

 <dt><dfn lt="WebVTT cue snap-to-lines flag">A snap-to-lines flag</dfn></dt>
 <dd>

  <p>A boolean indicating whether the <a lt="WebVTT cue line">line</a> is an integer number of lines
  (using the line dimensions of the first line of the cue), or whether it is a percentage of the
  dimension of the video. The flag is set when lines are counted, unset otherwise.</p>

  <p>Cues whose <a>WebVTT cue snap-to-lines flag</a> is set will be placed within the title-safe
  area on user agents that use overscan. Cues with the flag unset will be offset as requested
  (modulo overlap avoidance if multiple cues are in the same place).</p>

  <p>By default, the <a lt="WebVTT cue snap-to-lines flag">snap-to-lines flag</a> is set to
  true.</p>

 </dd>

 <dt><dfn lt="WebVTT cue line">A line</dfn></dt>
 <dd>
  <p>The <a lt="WebVTT cue line">line</a> defines positioning of the <a lt="WebVTT cue box">cue
  box</a>.</p>

  <p>The <a lt="WebVTT cue line">line</a> offsets the <a lt="WebVTT cue box">cue box</a> from the
  top, the right or left of the video viewport as defined by the <a lt="WebVTT cue writing
  direction">writing direction</a>, the <a lt="WebVTT cue snap-to-lines flag">snap-to-lines
  flag</a>, or the lines occupied by any other showing tracks.</p>

  <p>The <a lt="WebVTT cue line">line</a> is set either as a number of lines, a percentage of the
  video viewport height or width, or as the special value <dfn lt="WebVTT cue line
  automatic">auto</dfn>, which means the offset is to depend on the other showing tracks.</p>

  <p>A <a>WebVTT cue</a> has a <dfn lt="cue computed line">computed line</dfn> whose value is that
  returned by the following algorithm, which is defined in terms of the other aspects of the
  cue:</p>

  <ol>

   <li><p>If the <a lt="WebVTT cue line">line</a> is numeric, the <a>WebVTT cue snap-to-lines
   flag</a> of the <a>WebVTT cue</a> is not set, and the <a lt="WebVTT cue line">line</a> is
   negative or greater than 100, then return 100 and abort these steps.</p></li>

   <li><p>If the <a lt="WebVTT cue line">line</a> is numeric, return the value of the <a>WebVTT cue
   line</a> and abort these steps. (Either the <a>WebVTT cue snap-to-lines flag</a> is set, so any
   value, not just those in the range 0..100, is valid, or the value is in the range 0..100 and is
   thus valid regardless of the value of that flag.)</p></li>

   <li><p>If the <a>WebVTT cue snap-to-lines flag</a> of the <a>WebVTT cue</a> is not set, return
   the value 100 and abort these steps. (The <a lt="WebVTT cue line">line</a> is the special value
   <a lt="WebVTT cue line automatic">auto</a>.)</p></li>

   <li><p>Let <var>cue</var> be the <a>WebVTT cue</a>.</p></li>

   <li><p>If <var>cue</var> is not in a <a lt="text track list of cues">list of cues</a> of a
   <a>text track</a>, or if that <a>text track</a> is not in the <a>list of text tracks</a> of a
   <a>media element</a>, return &#x2212;1 and abort these steps.</p></li>

   <li><p>Let <var>track</var> be the <a>text track</a> whose <a lt="text track list of cues">list
   of cues</a> the <var>cue</var> is in.</p></li>

   <li><p>Let <var>n</var> be the number of <a lt="text track">text tracks</a> whose <a>text track
   mode</a> is <a lt="text track showing">showing</a> and that are in the <a>media element</a>'s
   <a>list of text tracks</a> before <var>track</var>.</p></li>

   <li><p>Increment <var>n</var> by one.</p></li>

   <li><p>Negate <var>n</var>.</p></li>

   <li><p>Return <var>n</var>.</p></li>

  </ol>

 </dd>

 <dt><dfn lt="WebVTT cue line alignment">A line alignment</dfn></dt>
 <dd>
  <p>An alignment for the <a lt="WebVTT cue box">cue box</a>'s <a lt="WebVTT cue line">line</a>, one
  of:</p>

  <dl>

   <dt><dfn lt="WebVTT cue line start alignment">Start alignment</dfn></dt>
   <dd>The <a lt="WebVTT cue box">cue box</a>'s top side (for <a lt="WebVTT cue horizontal writing
   direction">horizontal</a> cues), left side (for <a lt="WebVTT cue vertical growing right writing
   direction">vertical growing right</a>), or right side (for <a lt="WebVTT cue vertical growing
   left writing direction">vertical growing left</a>) is aligned at the <a lt="WebVTT cue
   line">line</a>.</dd>

   <dt><dfn lt="WebVTT cue line middle alignment">Middle alignment</dfn></dt>
   <dd>The <a lt="WebVTT cue box">cue box</a> is centered at the <a lt="WebVTT cue
   line">line</a>.</dd>

   <dt><dfn lt="WebVTT cue line end alignment">End alignment</dfn></dt>
   <dd>The <a lt="WebVTT cue box">cue box</a>'s bottom side (for <a lt="WebVTT cue horizontal
   writing direction">horizontal</a> cues), right side (for <a lt="WebVTT cue vertical growing right
   writing direction">vertical growing right</a>), or left side (for <a lt="WebVTT cue vertical
   growing left writing direction">vertical growing left</a>) is aligned at the <a lt="WebVTT cue
   line">line</a>.</dd>

  </dl>

  <p>A <a>WebVTT cue</a> has a default <a>WebVTT cue line alignment</a> of <a lt="WebVTT cue line
  start alignment">start</a>.</p>

 </dd>

 <dt><dfn lt="WebVTT cue position">A position</dfn></dt>
 <dd>
  <p>The <a lt="WebVTT cue position">position</a> defines the indent of the <a lt="WebVTT cue
  box">cue box</a> in the direction defined by the <a lt="WebVTT cue writing direction">writing
  direction</a>.</p>

  <p>The <a lt="WebVTT cue position">position</a> is either a number giving the position of the <a
  lt="WebVTT cue box">cue box</a> as a percentage value or the special value <dfn lt="WebVTT cue
  automatic position">auto</dfn>, which means the position is to depend on the <a lt="WebVTT cue
  text alignment">text alignment</a> of the cue.</p>

  <p>If the cue is not within a <a lt="WebVTT region">region</a>, the percentage value is to be
  interpreted as a percentage of the video dimensions, otherwise as a percentage of the region
  dimensions.</p>

  <p>A <a>WebVTT cue</a> has a <dfn lt="cue computed position">computed position</dfn> whose value
  is that returned by the following algorithm, which is defined in terms of the other aspects of the
  cue:</p>

  <ol>

   <li><p>If the <a lt="WebVTT cue position">position</a> is numeric, then return the value of the
   <a lt="WebVTT cue position">position</a> and abort these steps. (Otherwise, the <a lt="WebVTT cue
   position">position</a> is the special value <a lt="WebVTT cue automatic
   position">auto</a>.)</p></li>

   <li><p>If the <a lt="WebVTT cue text alignment">cue text alignment</a> is <a lt="WebVTT cue start
   alignment">start</a> or <a lt="WebVTT cue left alignment">left</a>, return 0 and abort these
   steps.</p></li>

   <li><p>If the <a lt="WebVTT cue text alignment">cue text alignment</a> is <a lt="WebVTT cue end
   alignment">end</a> or <a lt="WebVTT cue right alignment">right</a>, return 100 and abort these
   steps.</p></li>

   <li><p>If the <a lt="WebVTT cue text alignment">cue text alignment</a> is <a lt="WebVTT cue
   middle alignment">middle</a>, return 50 and abort these steps.</p></li>

  </ol>

  <p class="note">Since the default value of the <a>WebVTT cue position alignment</a> is <a
  lt="WebVTT cue middle alignment">middle</a>, if there is no <a>WebVTT cue text alignment</a>
  setting for a cue, the <a>WebVTT cue position</a> defaults to 50%.</p>

  <p class="note">Even for <a lt="WebVTT cue horizontal writing direction">horizontal</a> cues with
  right-to-left <i>paragraph direction</i> text, the <a lt="WebVTT cue box">cue box</a> is
  positioned from the left edge of the video viewport. This allows defining a rendering space
  template which can be filled with either left-to-right or right-to-left <i>paragraph direction</i>
  text. If such a <a lt="WebVTT cue box">cue box</a> template is created with <a lt="WebVTT cue
  start alignment">start</a> or <a lt="WebVTT cue end alignment">end</a> aligned text, it is best to
  also specify a <a lt="WebVTT cue size">size</a> since otherwise the text may flip from one side of
  the video viewport to the other.</p>

 </dd>

 <dt><dfn lt="WebVTT cue position alignment">A position alignment</dfn></dt>
 <dd>
  <p>An alignment for the <a lt="WebVTT cue box">cue box</a> in the dimension of the <a lt="WebVTT
  cue writing direction">writing direction</a>, describing what the <a lt="WebVTT cue
  position">position</a> is anchored to, one of:</p>

  <dl>

   <dt><dfn lt="WebVTT cue position start alignment">Start alignment</dfn></dt>
   <dd>The <a lt="WebVTT cue box">cue box</a>'s left side (for <a lt="WebVTT cue horizontal writing
   direction">horizontal</a> cues) or top side (otherwise) is aligned at the <a lt="WebVTT cue
   position">position</a>.</dd>

   <dt><dfn lt="WebVTT cue position middle alignment">Middle alignment</dfn></dt>
   <dd>The <a lt="WebVTT cue box">cue box</a> is centered at the <a lt="WebVTT cue
   position">position</a>.</dd>

   <dt><dfn lt="WebVTT cue position end alignment">End alignment</dfn></dt>
   <dd>The <a lt="WebVTT cue box">cue box</a>'s right side (for <a lt="WebVTT cue horizontal writing
   direction">horizontal</a> cues) or bottom side (otherwise) is aligned at the <a lt="WebVTT cue
   position">position</a>.</dd>

   <dt><dfn lt="WebVTT cue position automatic alignment">Auto alignment</dfn></dt>
   <dd>The <a lt="WebVTT cue box">cue box</a>'s alignment depends on the value of the <a lt="WebVTT
   cue text alignment">text alignment</a> of the cue.</dd>

  </dl>

  <p>A <a>WebVTT cue</a> has a <dfn lt="cue computed position alignment">computed position
  alignment</dfn> whose value is that returned by the following algorithm, which is defined in terms
  of other aspects of the cue:</p>

  <ol>

   <li><p>If the <a>WebVTT cue position alignment</a> is not <a lt="WebVTT cue position automatic
   alignment">auto</a>, then return the value of the <a>WebVTT cue position alignment</a> and abort
   these steps.</p></li>

   <li><p>If the <a>WebVTT cue text alignment</a> is <a lt="WebVTT cue start alignment">start</a> or
   <a lt="WebVTT cue left alignment">left</a>, return <a lt="WebVTT cue position start
   alignment">start</a> and abort these steps.</p></li>

   <li><p>If the <a>WebVTT cue text alignment</a> is <a lt="WebVTT cue end alignment">end</a> or <a
   lt="WebVTT cue right alignment">right</a>, return <a lt="WebVTT cue position end
   alignment">end</a> and abort these steps.</p></li>

   <li><p>If the <a>WebVTT cue text alignment</a> is <a lt="WebVTT cue middle alignment">middle</a>,
   return <a lt="WebVTT cue position middle alignment">middle</a> and abort these steps.</p></li>

  </ol>

  <p class="note">Since the <a lt="WebVTT cue position">position</a> always measures from the left
  of the video (for <a lt="WebVTT cue horizontal writing direction">horizontal</a> cues) or the top
  (otherwise), the <a>WebVTT cue position alignment</a> <a lt="WebVTT cue position start
  alignment">start value</a> varies between left and top for horizontal and vertical cues, but not
  between left and right even for changing <i>paragraph direction</i>.</p>

 </dd>

 <dt><dfn lt="WebVTT cue size">A size</dfn></dt>
 <dd>
  <p>A number giving the size of the <a lt="WebVTT cue box">cue box</a>, to be interpreted as a
  percentage of the video, as defined by the <a lt="WebVTT cue writing direction">writing
  direction</a>.</p>

  <p>By default, the <a>WebVTT cue size</a> is 100%.</p>

 </dd>

 <dt><dfn lt="WebVTT cue text alignment">A text alignment</dfn></dt>
 <dd>

  <p>An alignment for all lines of text within the <a lt="WebVTT cue box">cue box</a>, in the
  dimension of the <a lt="WebVTT cue writing direction">writing direction</a> and the <i>paragraph
  direction</i> [[!BIDI]], one of:</p>

  <dl>

   <dt><dfn lt="WebVTT cue start alignment">Start alignment</dfn></dt>
   <dd>The text is aligned towards the <i>paragraph direction</i> start side of the <a lt="WebVTT
   cue box">cue box</a>.</dd>

   <dt><dfn lt="WebVTT cue middle alignment">Middle alignment</dfn></dt>
   <dd>The text is aligned centered between the box's start and end sides.</dd>

   <dt><dfn lt="WebVTT cue end alignment">End alignment</dfn></dt>
   <dd>The text is aligned towards the <i>paragraph direction</i> end side of the <a lt="WebVTT cue
   box">cue box</a>.</dd>

   <dt><dfn lt="WebVTT cue left alignment">Left alignment</dfn></dt>
   <dd>The text is aligned to the box's left side.</dd>

   <dt><dfn lt="WebVTT cue right alignment">Right alignment</dfn></dt>
   <dd>The text is aligned to the box's right side.</dd>

  </dl>

  <p>By default, the value of the <a>WebVTT cue text alignment</a> is <a lt="WebVTT cue middle
  alignment">middle aligned</a>.</p>

 </dd>

 <dt><dfn lt="WebVTT cue region">A region</dfn></dt>
 <dd>
  <p>An optional <a>WebVTT region</a> to which a cue belongs.</p>
 </dd>

</dl>

<p>The associated <a>rules for updating the text track rendering</a> of <a lt="WebVTT cue">WebVTT
cues</a> are the <a>rules for updating the display of WebVTT text tracks</a>.</p>

<div class="impl">

 <p>When a <a>WebVTT cue</a> whose <a lt="text track cue active flag">active flag</a> is set has its
 <a lt="WebVTT cue writing direction">writing direction</a>, <a lt="WebVTT cue snap-to-lines
 flag">snap-to-lines flag</a>, <a lt="WebVTT cue line">line</a>, <a lt="WebVTT cue
 position">position</a>, <a lt="WebVTT cue size">size</a>, <a lt="WebVTT cue text alignment">text
 alignment</a>, <a lt="WebVTT cue region">region</a>, or <a lt="text track cue text">text</a> change
 value, then the user agent must empty the <a>text track cue display state</a>, and then immediately
 run the <a>text track</a>'s <a>rules for updating the display of WebVTT text tracks</a>.</p>

</div>


<h3 id=regions>WebVTT regions</h3>

<p>A <dfn>WebVTT region</dfn> represents a subpart of the video viewport and provides a rendering
area for <a lt="WebVTT cue">WebVTT cues</a>.</p>

<p>Each <a>WebVTT region</a> consists of:</p>

<dl>

 <dt><dfn lt="WebVTT region identifier">An identifier</dfn></dt>
 <dd>
  <p>An arbitrary string.</p>
 </dd>

 <dt><dfn lt="WebVTT region width">A width</dfn></dt>
 <dd>
  <p>A number giving the width of the box within which the text of each line of the containing cues
  is to be rendered, to be interpreted as a percentage of the video width. Defaults to 100.</p>
 </dd>

 <dt><dfn lt="WebVTT region lines">A lines value</dfn></dt>
 <dd>
  <p>A number giving the number of lines of the box within which the text of each line of the
  containing cues is to be rendered. Defaults to 3.</p>
 </dd>

 <dt><dfn lt="WebVTT region anchor">A region anchor point</dfn></dt>
 <dd>
  <p>Two numbers giving the x and y coordinates within the region which is anchored to the video
  viewport and does not change location even when the region does, e.g. because of font size
  changes. Defaults to (0,100), i.e. the bottom left corner of the region.</p>
 </dd>

 <dt><dfn lt="WebVTT region viewport anchor">A region viewport anchor point</dfn></dt>
 <dd>
  <p>Two numbers giving the x and y coordinates within the video viewport to which the region anchor
  point is anchored. Defaults to (0,100), i.e. the bottom left corner of the viewport.</p>
 </dd>

 <dt><dfn lt="WebVTT region scroll">A scroll value</dfn></dt>
 <dd>
  <p>One of the following:</p>
  <dl>
   <dt><dfn lt="WebVTT region scroll none">None</dfn></dt>
   <dd>Indicates that the cues in the region are not to scroll and instead stay fixed at the
   location they were first painted in.</dd>

   <dt><dfn lt="WebVTT region scroll up">Up</dfn></dt>
   <dd>Indicates that the cues in the region will be added at the bottom of the region and push any
   already displayed cues in the region up until all lines of the new cue are visible in the
   region.</dd>
   <!-- in the future we may introduce scroll="down"-->
  </dl>
 </dd>
</dl>

<div class="note">
 <p>The following diagram illustrates how anchoring of a region to a video viewport works. The black
 cross is the anchor, orange explains the anchor's offset within the region and green the anchor's
 offset within the viewport. Think of it as sticking a pin through a note onto a board:</p>
 <p><img src="webvtt-region-diagram.png" alt="Within the video viewport, there is a WebVTT region.
 Inside the region, there is an anchor point marked with a black cross. The vertical and horizontal
 distance from the video viewport's edges to the anchor is marked with green arrows, representing
 the region viewport anchor X and Y offsets. The vertical and horizontal distance from the region's
 edges to the anchor is marked with orange arrows, representing the region anchor X and Y offsets.
 The size of the region is represented by the region width for the horizontal axis, and region lines
 for the vertical axis."></p>
</div>

<p>For parsing, we also need the following:</p>

<dl>
 <dt><dfn lt="text track list of regions">A text track list of regions</dfn></dt>

 <dd>

  <p>A list of zero or more <a lt="WebVTT region">WebVTT regions</a>.</p>

 </dd>
</dl>


<h2 id=syntax>Syntax</h2>


<h3 id=file-structure>WebVTT file structure</h3>

<p>A <dfn>WebVTT file</dfn> must consist of a <a>WebVTT file body</a> encoded as UTF-8 and labeled
with the <a spec=html>MIME type</a> <code>text/vtt</code>. [[!RFC3629]]</p>

<p>A <dfn>WebVTT file body</dfn> consists of the following components, in the following order:</p>

<ol>

 <li>An optional U+FEFF BYTE ORDER MARK (BOM) character.</li>

 <li>The string "<code>WEBVTT</code>".</li>

 <li>Optionally, either a U+0020 SPACE character or a U+0009 CHARACTER TABULATION (tab) character
 followed by any number of characters that are not U+000A LINE FEED (LF) or U+000D CARRIAGE RETURN
 (CR) characters.</li> <!-- allows for Emacs line -->

 <li>Exactly one <a lt="WebVTT line terminator">WebVTT line terminators</a> to terminate the line
 with the file magic and separate it from the rest of the body.</li>

 <li>Zero or more <a lt="WebVTT metadata header">WebVTT metadata headers</a>.</li>

 <li>One or more <a lt="WebVTT line terminator">WebVTT line terminators</a> to terminate the header
 block and separate the cues from the file header.</li>

 <li>Zero or more <a lt="WebVTT cue block">WebVTT cue blocks</a> and <a lt="WebVTT comment
 block">WebVTT comment blocks</a> separated from each other by one or more <a lt="WebVTT line
 terminator">WebVTT line terminators</a>.</li>

 <li>Zero or more <a lt="WebVTT line terminator">WebVTT line terminators</a>.</li>

</ol>

<p>A <dfn>WebVTT line terminator</dfn> consists of one of the following:</p>

<ul class="brief">
 <li>A U+000D CARRIAGE RETURN U+000A LINE FEED (CRLF) character pair.</li>
 <li>A single U+000A LINE FEED (LF) character.</li>
 <li>A single U+000D CARRIAGE RETURN (CR) character.</li>
</ul>

<p>A <dfn>WebVTT metadata header</dfn> consists of the following components, in the given order:</p>

<ol>
 <li>A <a>WebVTT metadata header name</a>.</li>
 <li>A U+003A COLON (colon) character.</li>
 <li>A <a>WebVTT metadata header value</a>.</li>
 <li>A <a>WebVTT line terminator</a>.</li>
</ol>

<p>A <dfn>WebVTT metadata header name</dfn> and a <dfn>WebVTT metadata header value</dfn> each
consist of any sequence of one or more characters other than U+000A LINE FEED (LF) characters and
U+000D CARRIAGE RETURN (CR) characters except that the entire resulting string must not contain the
substring "<code>--></code>" (U+002D HYPHEN-MINUS, U+002D HYPHEN-MINUS, U+003E GREATER-THAN
SIGN).</p>

<p>A <dfn>WebVTT cue block</dfn> consists of the following components, in the given order:</p>

<ol>
 <li>Optionally, a <a>WebVTT cue identifier</a> followed by a <a>WebVTT line terminator</a>.</li>
 <li><a>WebVTT cue timings</a>.</li>
 <li>Optionally, one or more U+0020 SPACE characters or U+0009 CHARACTER TABULATION (tab) characters
 followed by a <a>WebVTT cue settings list</a>.</li>
 <li>A <a>WebVTT line terminator</a>.</li>
 <li>The <dfn>cue payload</dfn>: either <a>WebVTT cue text</a>, <a>WebVTT chapter title text</a>, or
 <a>WebVTT metadata text</a>, but it must not contain the substring "<code>--></code>" (U+002D
 HYPHEN-MINUS, U+002D HYPHEN-MINUS, U+003E GREATER-THAN SIGN).</li>
 <li>A <a>WebVTT line terminator</a>.</li>
</ol>

<p class="note">A <a>WebVTT cue block</a> corresponds to one piece of time-aligned text or data in
the <a>WebVTT file</a>, for example one subtitle. The <a>cue payload</a> is the text or data
associated with the cue.</p>

<p>A <dfn>WebVTT cue identifier</dfn> is any sequence of one or more characters not containing the
substring "<code>--></code>" (U+002D HYPHEN-MINUS, U+002D HYPHEN-MINUS, U+003E GREATER-THAN SIGN),
nor containing any U+000A LINE FEED (LF) characters or U+000D CARRIAGE RETURN (CR) characters.</p>

<p>A <a>WebVTT cue identifier</a> must be unique amongst all the <a lt="WebVTT cue
identifier">WebVTT cue identifiers</a> of all <a lt="WebVTT cue">WebVTT cues</a> of a <a>WebVTT
file</a>.</p>

<p class="note">A <a>WebVTT cue identifier</a> can be used to reference a specific cue, for example
from script or CSS.</p>

<p>The <dfn>WebVTT cue timings</dfn> part of a <a>WebVTT cue block</a> consists of the following
components, in the given order:</p>

<ol>

 <!-- we could allow leading and trailing spaces and tabs, and make the space between the arrow
 either optional or allow multiple spaces or tabs -->

 <li>A <a>WebVTT timestamp</a> representing the start time offset of the cue. The time represented
 by this <a>WebVTT timestamp</a> must be greater than or equal to the start time offsets of all
 previous cues in the file.</li>

 <li>One or more U+0020 SPACE characters or U+0009 CHARACTER TABULATION (tab) characters.</li>

 <li>The string "<code>--></code>" (U+002D HYPHEN-MINUS, U+002D HYPHEN-MINUS, U+003E GREATER-THAN
 SIGN).</li>

 <li>One or more U+0020 SPACE characters or U+0009 CHARACTER TABULATION (tab) characters.</li>

 <li>A <a>WebVTT timestamp</a> representing the end time offset of the cue. The time represented by
 this <a>WebVTT timestamp</a> must be greater than the start time offset of the cue.</li>

</ol>

<p class="note">The <a>WebVTT cue timings</a> give the start and end offsets of the <a>WebVTT cue
block</a>. Different cues can overlap. Cues are always listed ordered by their start time.</p>

<p>A <dfn>WebVTT timestamp</dfn> consists of the following components, in the given order:</p>

<ol>

 <li>
  Optionally (required if <var>hours</var> is non-zero):

  <ol>

   <li>Two or more <a spec=html>ASCII digits</a>, representing the <var>hours</var> as a base ten
   integer.</li>

   <li>A U+003A COLON character (:)</li>

  </ol>

 </li>

 <li>Two <a>ASCII digits</a>, representing the <var>minutes</var> as a base ten integer in the range
 0&nbsp;&le;&nbsp;<var>minutes</var>&nbsp;&le;&nbsp;59.</li>

 <li>A U+003A COLON character (:)</li>

 <li>Two <a>ASCII digits</a>, representing the <var>seconds</var> as a base ten integer in the range
 0&nbsp;&le;&nbsp;<var>seconds</var>&nbsp;&le;&nbsp;59.</li>

 <li>A U+002E FULL STOP character (.).</li>

 <li>Three <a>ASCII digits</a>, representing the thousandths of a second <var>seconds-frac</var> as
 a base ten integer.</li>

</ol>

<p class="note">A <a>WebVTT timestamp</a> is always interpreted relative to the <a>current playback
position</a> of the media data that the WebVTT file is to be synchronized with.</p>

<p>A <dfn>WebVTT cue settings list</dfn> consist of a sequence of zero or more <dfn lt="WebVTT cue
setting">WebVTT cue settings</dfn> in any order, separated from each other by one or more U+0020
SPACE characters or U+0009 CHARACTER TABULATION (tab) characters. Each setting consists of the
following components, in the order given:</p>

<ol>
 <li>A <a lt="WebVTT cue setting name">WebVTT cue setting name</a>.</li>
 <li>An optional U+003A COLON (colon) character.</li>
 <li>An optional <a lt="WebVTT cue setting value">WebVTT cue setting value</a>.</li>
</ol>

<p>A <dfn>WebVTT cue setting name</dfn> and a <dfn>WebVTT cue setting value</dfn> each consist of
any sequence of one or more characters other than U+000A LINE FEED (LF) characters and - U+000D
CARRIAGE RETURN (CR) characters except that the entire resulting string must not contain the
substring "<code>--></code>" (U+002D HYPHEN-MINUS, U+002D HYPHEN-MINUS, U+003E GREATER-THAN
SIGN).</p>

<p>A <dfn>WebVTT percentage</dfn> consists of the following components:</p>

<ol>
 <li>One or more <a>ASCII digits</a>.</li>
 <li>
  Optionally:
  <ol>
   <li>A U+002E DOT character (.).</li>
   <li>One or more <a>ASCII digits</a>.</li>
  </ol>
 </li>
 <li>A U+0025 PERCENT SIGN character (%).</li>
</ol>

<p>When interpreted as a number, a <a>WebVTT percentage</a> must be in the range 0..100.</p>

<p>A <dfn>WebVTT comment block</dfn> consists of the following components, in the given order:</p>

<ol>
 <li>The string "<code>NOTE</code>".</li>
 <li>
  Optionally, the following components, in the given order:
  <ol>
   <li>
    Either:
    <ul>
     <li>A U+0020 SPACE character or U+0009 CHARACTER TABULATION (tab) character.</li>
     <li>A <a>WebVTT line terminator</a>.</li>
    </ul>
   </li>
   <li>Any sequence of zero or more characters other than U+000A LINE FEED (LF) characters and
   U+000D CARRIAGE RETURN (CR) characters, each optionally separated from the next by a <a>WebVTT
   line terminator</a>, except that the entire resulting string must not contain the substring
   "<code>--></code>" (U+002D HYPHEN-MINUS, U+002D HYPHEN-MINUS, U+003E GREATER-THAN SIGN).</li>
  </ol>
 </li>
 <li>A <a>WebVTT line terminator</a>.</li>
</ol>

<p class="note">A <a>WebVTT comment block</a> is ignored by the parser.</p>


<h3 id=types-of-webvtt-cue-payload>Types of WebVTT cue payload</h3>


<h4 id=metadata-text>WebVTT metadata text</h4>

<p><dfn>WebVTT metadata text</dfn> consists of any sequence of zero or more characters other than
U+000A LINE FEED (LF) characters and U+000D CARRIAGE RETURN (CR) characters, each optionally
separated from the next by a <a>WebVTT line terminator</a>. (In other words, any text that does not
have two consecutive <a lt="WebVTT line terminator">WebVTT line terminators</a> and does not start
or end with a <a>WebVTT line terminator</a>.)</p>

<p><a>WebVTT metadata text</a> cues are only useful for scripted applications (using the
<code>metadata</code> <a>text track kind</a>).</p>


<h4 id=cue-text>WebVTT cue text</h4>

<p><dfn>WebVTT cue text</dfn> is <a>cue payload</a> that consists of zero or more <a>WebVTT cue
components</a>, in any order, each optionally separated from the next by a <a>WebVTT line
terminator</a>.</p>

<p>The <dfn>WebVTT cue components</dfn> are:</p>

<ul>

 <li>A <a>WebVTT cue class span</a>.</li>
 <li>A <a>WebVTT cue italics span</a>.</li>
 <li>A <a>WebVTT cue bold span</a>.</li>
 <li>A <a>WebVTT cue underline span</a>.</li>
 <li>A <a>WebVTT cue ruby span</a>.</li>
 <li>A <a>WebVTT cue voice span</a>.</li>
 <li>A <a>WebVTT cue language span</a>.</li>

 <li>A <a>WebVTT cue timestamp</a>.</li>

 <li>A <a>WebVTT cue text span</a>, representing the text of the cue.</li>

 <li>A <a>WebVTT cue amp escape</a>, representing a "&amp;" character in the text of the cue.</li>
 <li>A <a>WebVTT cue lt escape</a>, representing a "&lt;" character in the text of the cue.</li>
 <li>A <a>WebVTT cue gt escape</a>, representing a "&gt;" character in the text of the cue.</li>
 <li>A <a>WebVTT cue lrm escape</a>, representing a U+200E LEFT-TO-RIGHT MARK Unicode bidirectional
 formatting character in the text of the cue.</li>
 <li>A <a>WebVTT cue rlm escape</a>, representing a U+200F RIGHT-TO-LEFT MARK Unicode bidirectional
 formatting character in the text of the cue.</li>
 <li>A <a>WebVTT cue nbsp escape</a>, representing a U+00A0 NO-BREAK SPACE character in the text of
 the cue.</li>

</ul>

<p><dfn>WebVTT cue internal text</dfn> consists of an optional <a>WebVTT line terminator</a>,
followed by zero or more <a>WebVTT cue components</a>, in any order, each optionally followed by a
<a>WebVTT line terminator</a>.</p>

<p>A <dfn>WebVTT cue class span</dfn> consists of a <a>WebVTT cue span start tag</a>
"<code>c</code>" that disallows an annotation, <a>WebVTT cue internal text</a> representing cue
text, and a <a>WebVTT cue span end tag</a> "<code>c</code>".</p>

<p>A <dfn>WebVTT cue italics span</dfn> consists of a <a>WebVTT cue span start tag</a>
"<code>i</code>" that disallows an annotation, <a>WebVTT cue internal text</a> representing the
italicized text, and a <a>WebVTT cue span end tag</a> "<code>i</code>".</p>

<p>A <dfn>WebVTT cue bold span</dfn> consists of a <a>WebVTT cue span start tag</a> "<code>b</code>"
that disallows an annotation, <a>WebVTT cue internal text</a> representing the boldened text, and a
<a>WebVTT cue span end tag</a> "<code>b</code>".</p>

<p>A <dfn>WebVTT cue underline span</dfn> consists of a <a>WebVTT cue span start tag</a>
"<code>u</code>" that disallows an annotation, <a>WebVTT cue internal text</a> representing the
underlined text, and a <a>WebVTT cue span end tag</a> "<code>u</code>".</p>

<p>A <dfn>WebVTT cue ruby span</dfn> consists of the following components, in the order given:</p>

<ol>
 <li>A <a>WebVTT cue span start tag</a> "<code>ruby</code>" that disallows an annotation.</li>
 <li>
  One or more occurrences of the following group of components, in the order given:
  <ol>
   <li><a>WebVTT cue internal text</a>, representing the ruby base.</li>
   <li>A <a>WebVTT cue span start tag</a> "<code>rt</code>" that disallows an annotation.</li>
   <li>A <dfn>WebVTT cue ruby text span</dfn>: <a>WebVTT cue internal text</a>, representing the
   ruby text component of the ruby annotation.</li>
   <li>A <a>WebVTT cue span end tag</a> "<code>rt</code>". If this is the last occurrence of this
   group of components in the <a>WebVTT cue ruby span</a>, then this last end tag string may be
   omitted.</li>
  </ol>
 </li>
 <li>If the last end tag string was not omitted: Optionally, a <a>WebVTT line terminator</a>.</li>
 <li>If the last end tag string was not omitted: Zero or more U+0020 SPACE characters or U+0009
 CHARACTER TABULATION (tab) characters, each optionally followed by a <a>WebVTT line
 terminator</a>.</li>
 <li>A <a>WebVTT cue span end tag</a> "<code>ruby</code>".</li>
</ol>

<p class="note">Cue positioning controls the positioning of the baseline text, not the ruby
text.</p>

<p>A <dfn>WebVTT cue voice span</dfn> consists of the following components, in the order given:</p>

<ol>
 <li>A <a>WebVTT cue span start tag</a> "<code>v</code>" that requires an annotation; the annotation
 represents the name of the voice.</li>
 <li><a>WebVTT cue internal text</a>.</li>
 <li>A <a>WebVTT cue span end tag</a> "<code>v</code>". If this <a>WebVTT cue voice span</a> is the
 only <a lt="WebVTT cue components">component</a> of its <a>WebVTT cue text</a> sequence, then the
 end tag may be omitted for brevity.</li>
</ol>

<p>A <dfn>WebVTT cue language span</dfn> consists of the following components, in the order
given:</p>

<ol>
 <li>A <a>WebVTT cue span start tag</a> "<code>lang</code>" that requires an annotation; the
 annotation represents the language of the following component, and must be a valid BCP 47 language
 tag. [[!BCP47]]</li>
 <li><a>WebVTT cue internal text</a>.</li>
 <li>A <a>WebVTT cue span end tag</a> "<code>lang</code>".</li>
</ol>

<p>A <dfn>WebVTT cue span start tag</dfn> has a <var>tag name</var> and either <!--allows,-->
requires<!--,--> or disallows an annotation, and consists of the following components, in the order
given:</p>

<ol>

 <li>A U+003C LESS-THAN SIGN character (&lt;).</li>

 <li>The <var>tag name</var>.</li>

 <li>
  Zero or more occurrences of the following sequence:

  <ol>

   <li>U+002E FULL STOP character (.)</li>

   <li>One or more characters other than U+0009 CHARACTER TABULATION (tab) characters, U+000A LINE
   FEED (LF) characters, U+000D CARRIAGE RETURN (CR) characters, U+0020 SPACE characters, U+0026
   AMPERSAND characters (&amp;), U+003C LESS-THAN SIGN characters (&lt;), U+003E GREATER-THAN SIGN
   characters (>), and U+002E FULL STOP characters (.), representing a class that describes the cue
   span's significance.</li>

  </ol>

 </li>

 <li>
  <p>If the start tag requires an annotation: a U+0020 SPACE character or a U+0009 CHARACTER
  TABULATION (tab) character, followed by one or more of the following components, the concatenation
  of their representations having a value that contains at least one character other than U+0020
  SPACE and U+0009 CHARACTER TABULATION (tab) characters:</p>

  <ul>
   <li><a>WebVTT cue span start tag annotation text</a>, representing the text of the
   annotation.</li>
   <li>A <a>WebVTT cue amp escape</a>, representing a "&amp;" character in the text of the
   annotation.</li>
   <li>A <a>WebVTT cue lt escape</a>, representing a "&lt;" character in the text of the
   annotation.</li>
   <li>A <a>WebVTT cue gt escape</a>, representing a "&gt;" character in the text of the
   annotation.</li>
   <li>A <a>WebVTT cue lrm escape</a>, representing a U+200E LEFT-TO-RIGHT MARK Unicode
   bidirectional formatting character in the text of the cue.</li>
   <li>A <a>WebVTT cue rlm escape</a>, representing a U+200F RIGHT-TO-LEFT MARK Unicode
   bidirectional formatting character in the text of the cue.</li>
   <li>A <a>WebVTT cue nbsp escape</a>, representing a U+00A0 NO-BREAK SPACE character in the text
   of the cue.</li>
  </ul>

 </li>

 <li>A U+003E GREATER-THAN SIGN character (&gt;).</li>

</ol>

<p>A <dfn>WebVTT cue span end tag</dfn> has a <var>tag name</var> and consists of the following
components, in the order given:</p>

<ol>
 <li>A U+003C LESS-THAN SIGN character (&lt;).</li>
 <li>U+002F SOLIDUS character (/).</li>
 <li>The <var>tag name</var>.</li>
 <li>A U+003E GREATER-THAN SIGN character (&gt;).</li>
</ol>

<p>A <dfn>WebVTT cue timestamp</dfn> consists of a U+003C LESS-THAN SIGN character (&lt;), followed
by a <a>WebVTT timestamp</a> representing the time that the given point in the cue becomes active,
followed by a U+003E GREATER-THAN SIGN character (>). The time represented by the <a>WebVTT
timestamp</a> must be greater than the times represented by any previous <a lt="WebVTT cue
timestamp">WebVTT cue timestamps</a> in the cue, as well as greater than the cue's start time
offset, and less than the cue's end time offset.</p>

<p>A <dfn>WebVTT cue text span</dfn> consists of one or more characters other than U+000A LINE FEED
(LF) characters, U+000D CARRIAGE RETURN (CR) characters, U+0026 AMPERSAND characters (&amp;), and
U+003C LESS-THAN SIGN characters (&lt;).</p>

<p><dfn>WebVTT cue span start tag annotation text</dfn> consists of one or more characters other
than U+000A LINE FEED (LF) characters, U+000D CARRIAGE RETURN (CR) characters, U+0026 AMPERSAND
characters (&amp;), and U+003E GREATER-THAN SIGN characters (&gt;).</p>

<p>A <dfn>WebVTT cue amp escape</dfn> is the five character string "<code>&amp;amp;</code>".</p>

<p>A <dfn>WebVTT cue lt escape</dfn> is the four character string "<code>&amp;lt;</code>".</p>

<p>A <dfn>WebVTT cue gt escape</dfn> is the four character string "<code>&amp;gt;</code>".</p>

<p>A <dfn>WebVTT cue lrm escape</dfn> is the five character string "<code>&amp;lrm;</code>".</p>

<p>A <dfn>WebVTT cue rlm escape</dfn> is the five character string "<code>&amp;rlm;</code>".</p>

<p>A <dfn>WebVTT cue nbsp escape</dfn> is the six character string "<code>&amp;nbsp;</code>".</p>


<h3 id=region-definition>WebVTT region definition</h3>

<p>A <a>WebVTT cue settings list</a> may contain a reference to a <a>WebVTT region</a>. To define a
region, a <a>WebVTT region metadata header</a> is specified.</p>

<p>A <dfn>WebVTT region metadata header</dfn> is a special kind of <a>WebVTT metadata header</a>
where both of the following apply:</p>

<ul>
 <li>The <a>WebVTT metadata header name</a> is the string "<code>Region</code>".</li>
 <li>The <a>WebVTT metadata header value</a> is a <a>WebVTT region setting list</a>.</li>
</ul>

<p>The <dfn>WebVTT region setting list</dfn> of a <a>WebVTT region metadata header</a> consists of
zero or more of the following components, in any order, separated from each other by one or more
U+0020 SPACE characters or U+0009 CHARACTER TABULATION (tab) characters. Each component must not be
included more than once per <a>WebVTT region setting list</a> string.</p>

<ul>
 <li>A <a>WebVTT region identifier setting</a>.</li>
 <li>A <a>WebVTT region width setting</a>.</li>
 <li>A <a>WebVTT region lines setting</a>.</li>
 <li>A <a>WebVTT region anchor setting</a>.</li>
 <li>A <a>WebVTT region viewport anchor setting</a>.</li>
 <li>A <a>WebVTT region scroll setting</a>.</li>
</ul>

<p class="note">The <a>WebVTT region setting list</a> gives configuration options regarding the
dimensions, positioning and anchoring of the region. For example, it allows a group of cues within a
region to be anchored in the center of the region and the center of the video viewport. In this
example, when the font size grows, the region grows uniformly in all directions from the center.</p>

<p>A <dfn>WebVTT region identifier setting</dfn> consists of the following components, in the order
given:</p>
<ol>
 <li><p>The string "<code>id</code>".</p></li>
 <li><p>A U+003D EQUALS SIGN character (=).</p></li>
 <li><p>An arbitrary string of one or more characters other than U+0020 SPACE or U+0009 CHARACTER
 TABULATION character. The string must not contain the substring "<code>--></code>" (U+002D
 HYPHEN-MINUS, U+002D HYPHEN-MINUS, U+003E GREATER-THAN SIGN).</p></li>
</ol>

<p>A <a>WebVTT region identifier setting</a> must be unique amongst all the <a lt="WebVTT region
identifier setting">WebVTT region identifier settings</a> of all <a lt="WebVTT region">WebVTT
regions</a> of a <a>WebVTT file</a>.</p>

<p class ="note">The <a>WebVTT region identifier setting</a> gives a name to the region so it can be
referenced by the cues that belong to the region.</p>

<p>A <dfn>WebVTT region width setting</dfn> consists of the following components, in the order
given:</p>
<ol>
 <li><p>The string "<code>width</code>".</p></li>
 <li><p>A U+003D EQUALS SIGN character (=).</p></li>
 <li><p>A <a>WebVTT percentage</a>.</p></li>
</ol>
<p class ="note">The <a>WebVTT region width setting</a> provides a fixed width as a percentage of
the video width for the region into which cues are rendered and based on which alignment is
calculated.</p>

<p>A <dfn>WebVTT region lines setting</dfn> consists of the following components, in the order
given:</p>
<ol>
 <li><p>The string "<code>lines</code>".</p></li>
 <li><p>A U+003D EQUALS SIGN character (=).</p></li>
 <li><p>One or more <a>ASCII digits</a>.</p></li>
</ol>
<p class ="note">The <a>WebVTT region lines setting</a> provides a fixed height as a number of lines
for the region into which cues are rendered. As such, it defines the height of the roll-up region if
it is a scroll region.</p>

<p>A <dfn>WebVTT region anchor setting</dfn> consists of the following components, in the order
given:</p>
<ol>
 <li><p>The string "<code>regionanchor</code>".</p></li>
 <li><p>A U+003D EQUALS SIGN character (=).</p></li>
 <li><p>A <a>WebVTT percentage</a>.</p></li>
 <li><p>A U+002C COMMA character (,).</p></li>
 <li><p>A <a>WebVTT percentage</a>.</p></li>
</ol>
<p class ="note">The <a>WebVTT region anchor setting</a> provides a tuple of two percentages that
specify the point within the region box that is fixed in location. The first percentage measures the
x-dimension and the second percentage y-dimension from the top left corner of the region box. If no
<a>WebVTT region anchor setting</a> is given, the anchor defaults to 0%, 100% (i.e. the bottom left
corner).</p>

<p>A <dfn>WebVTT region viewport anchor setting</dfn> consists of the following components, in the
order given:</p>
<ol>
 <li><p>The string "<code>viewportanchor</code>".</p></li>
 <li><p>A U+003D EQUALS SIGN character (=).</p></li>
 <li><p>A <a>WebVTT percentage</a>.</p></li>
 <li><p>A U+002C COMMA character (,).</p></li>
 <li><p>A <a>WebVTT percentage</a>.</p></li>
</ol>
<p class ="note">The <a>WebVTT region viewport anchor setting</a> provides a tuple of two
percentages that specify the point within the video viewport that the region anchor point is
anchored to. The first percentage measures the x-dimension and the second percentage measures the
y-dimension from the top left corner of the video viewport box. If no viewport anchor is given, it
defaults to 0%, 100% (i.e. the bottom left corner).</p>

<p class ="note">For browsers, the region maps to an absolute positioned CSS box relative to the
video viewport, i.e. there is a relative positioned box that represents the video viewport relative
to which the regions are absolutely positioned. Overflow is hidden.</p>

<p>A <dfn>WebVTT region scroll setting</dfn> consists of the following components, in the order
given:</p>
<ol>
 <li><p>The string "<code>scroll</code>".</p></li>
 <li><p>A U+003D EQUALS SIGN character (=).</p></li>
 <li><p>The string "<code>up</code>".</p></li>
</ol>
<p class ="note">The <a>WebVTT region scroll setting</a> specifies whether cues rendered into the
region are allowed to move out of their initial rendering place and roll up, i.e. move towards the
top of the video viewport. If the scroll setting is omitted, cues do not move from their rendered
position.</p>

<p class="note">Cues are added to a region one line at a time below existing cue lines. When an
existing rendered cue line is removed, and it was above another already rendered cue line, that cue
line moves into its space, thus scrolling in the given direction. If there is not enough space for a
new cue line to be added to a region, the top-most cue line is pushed off the visible region (thus
slowly becoming invisible as it moves into overflow:hidden). This eventually makes space for the new
cue line and allows it to be added.</p>

<p class="note">When there is no scroll direction, cue lines are added in the empty line closest to
the line in the bottom of the region. If no empty line is available, the oldest line is
replaced.</p>


<h3 id=cue-settings>WebVTT cue settings</h3>

<p>A <a>WebVTT cue settings list</a> consists of zero or more of the following settings. Each
setting must not be included more than once per <a>WebVTT cue settings list</a>.</p>

<ul class="brief">
 <li>A <a>WebVTT vertical text cue setting</a>.</li>
 <li>A <a>WebVTT line cue setting</a>.</li>
 <li>A <a>WebVTT position cue setting</a>.</li>
 <li>A <a>WebVTT size cue setting</a>.</li>
 <li>A <a>WebVTT alignment cue setting</a>.</li>
 <li>A <a>WebVTT region cue setting</a>.</li>
</ul>

<p class="note">A <a>WebVTT cue settings list</a> gives configuration options regarding the position
and alignment of the cue box and the cue text within. For example, it allows a cue box to be aligned
to the left or positioned at the top right with the cue text within middle aligned.</p>

<p>A <dfn>WebVTT vertical text cue setting</dfn> is a <a>WebVTT cue setting</a> that consists of the
following components, in the order given:</p>

<ol>
 <li>The string "<code>vertical</code>" as the <a>WebVTT cue setting name</a>.</li>
 <li><p>A U+003A COLON character (:).</p></li>
 <li>One of the following strings as the <a>WebVTT cue setting value</a>: "<code>rl</code>",
 "<code>lr</code>".</li>
</ol>

<p class="note">A <a>WebVTT vertical text cue setting</a> configures the cue to use vertical text
layout rather than horizontal text layout. Vertical text layout is sometimes used in Japanese, for
example. The default is horizontal layout.</p>

<p>A <dfn>WebVTT line cue setting</dfn> consists of the following components, in the order
given:</p>

<ol>
 <li><p>The string "<code>line</code>" as the <a>WebVTT cue setting name</a>.</p></li>
 <li><p>A U+003A COLON character (:).</p></li>
 <li>
  As the <a>WebVTT cue setting value</a>:
  <ol>
   <li>
    an offset value, either:
    <dl>
     <dt>To represent a specific offset relative to the video viewport</dt>
     <dd>
      <p>A <a>WebVTT percentage</a>.</p>
     </dd>
     <dt>Or to represent a line number</dt>
     <dd>
      <ol>
       <li>Optionally a U+002D HYPHEN-MINUS character (-).</li>
       <li>One or more <a>ASCII digits</a>.</li>
      </ol>
     </dd>
    </dl>
   </li>
   <li>
    An optional alignment value consisting of the following components:
    <ol>
     <li>A U+002C COMMA character (,).</li>
     <li>One of the following strings: "<code>start</code>", "<code>middle</code>",
     "<code>end</code>"</li>
    </ol>
   </li>
  </ol>
 </li>
</ol>

<p class="note">A <a>WebVTT line cue setting</a> configures the offset of the cue box from the video
viewport's edge in the direction opposite to the <a lt="WebVTT cue writing direction">writing
direction</a>. For horizontal cues, this is the vertical offset from the top of the video viewport.
The offset is for the <a lt="WebVTT cue line start alignment">start</a>, <a lt="WebVTT cue line
middle alignment">middle</a>, or <a lt="WebVTT cue line end alignment">end</a> of the cue box,
depending on the <a>WebVTT cue line alignment</a> value - <a lt="WebVTT cue line start
alignment">start</a> by default. The offset can be given either as a percentage of the video
dimension or as a line number. Line numbers are based on the size of the first line of the cue.
Positive line numbers count from the start of the video viewport (the first line is numbered 0),
negative line numbers from the end of the viewport (the last line is numbered &#x2212;1).</p>

<p>A <dfn>WebVTT position cue setting</dfn> consists of the following components, in the order
given:</p>

<ol>
 <li><p>The string "<code>position</code>" as the <a>WebVTT cue setting name</a>.</p></li>
 <li><p>A U+003A COLON character (:).</p></li>
 <li>
  As the <a>WebVTT cue setting value</a>:
  <ol>
   <li>a position value consisting of: a <a>WebVTT percentage</a>.</li>
   <li>
    an optional alignment value consisting of:
    <ol>
     <li>A U+002C COMMA character (,).</li>
     <li>One of the following strings: "<code>start</code>", "<code>middle</code>",
     "<code>end</code>"</li>
    </ol>
   </li>
  </ol>
 </li>
</ol>

<p class="note">A <a>WebVTT position cue setting</a> configures the indent position of the <a
lt="WebVTT cue box">cue box</a> in the direction orthogonal to the <a>WebVTT line cue setting</a>.
For horizontal cues, this is the horizontal position. The cue position is given as a percentage of
the video viewport. The positioning is for the <a lt="WebVTT cue position start
alignment">start</a>, <a lt="WebVTT cue position middle alignment">middle</a>, or <a lt="WebVTT cue
position end alignment">end</a> of the cue box, depending on the cue's <a lt="cue computed position
alignment">computed position alignment</a>, which is overridden by the <a>WebVTT position cue
setting</a>.</p>

<p>A <dfn>WebVTT size cue setting</dfn> consists of the following components, in the order
given:</p>

<ol>
 <li><p>The string "<code>size</code>" as the <a>WebVTT cue setting name</a>.</p></li>
 <li><p>A U+003A COLON character (:).</p></li>
 <li><p>As the <a>WebVTT cue setting value</a>: a <a>WebVTT percentage</a>.</p></li>
</ol>

<p class="note">A <a>WebVTT size cue setting</a> configures the size of the <a lt="WebVTT cue
box">cue box</a> in the same direction as the <a>WebVTT position cue setting</a>. For horizontal
cues, this is the width of the <a lt="WebVTT cue box">cue box</a>. It is given as a percentage of
the width of the viewport.</p>

<p>A <dfn>WebVTT alignment cue setting</dfn> consists of the following components, in the order
given:</p>

<ol>
 <li><p>The string "<code>align</code>" as the <a>WebVTT cue setting name</a>.</p></li>
 <li><p>A U+003A COLON character (:).</p></li>
 <li>One of the following strings as the <a>WebVTT cue setting value</a>: "<code>start</code>",
 "<code>middle</code>", "<code>end</code>", "<code>left</code>", "<code>right</code>"</li>
</ol>

<p class="note">A <a>WebVTT alignment cue setting</a> configures the alignment of the text within
the cue. The keywords are relative to the text direction; for left-to-right English text,
"<code>start</code>" means left-aligned.</p>

<p>A <dfn>WebVTT region cue setting</dfn> consists of the following components, in the order
given:</p>

<ol>
 <li><p>The string "<code>region</code>" as the <a>WebVTT cue setting name</a>.</p></li>
 <li><p>A U+003A COLON character (:).</p></li>
 <li><p>As the <a>WebVTT cue setting value</a>: an arbitrary string of one or more characters other
 than U+0020 SPACE or U+0009 CHARACTER TABULATION character. The string must not contain the
 substring "<code>--></code>" (U+002D HYPHEN-MINUS, U+002D HYPHEN-MINUS, U+003E GREATER-THAN
 SIGN).</p></li>
</ol>

<p class ="note">A <a>WebVTT region cue setting</a> configures a cue to become part of a region by
referencing the region's identifier unless the cue has a <a lt="WebVTT vertical text cue
setting">"vertical"</a>, <a lt="WebVTT line cue setting">"line"</a> or <a lt="WebVTT size cue
setting">"size"</a> cue setting. If a cue is part of a region, its cue settings for <a lt="WebVTT
position cue setting">"position"</a> and <a lt="WebVTT alignment cue setting">"align"</a> are
applied to the line boxes in the cue relative to the region box.</p>


<h3 id=properties-of-cue-sequences>Properties of cue sequences</h3>


<h4 id=file-using-only-nested-cues>WebVTT file using only nested cues</h4>

<p>A <a>WebVTT file</a> whose cues all follow the following rules is said to be a <dfn>WebVTT file
using only nested cues</dfn>:</p>

<p>given any two cues <var>cue1</var> and <var>cue2</var> with start and end time offsets <var>(x1,
y1)</var> and <var>(x2, y2)</var> respectively,</p>

<ul>
 <li>either <var>cue1</var> lies fully within <var>cue2</var>, i.e. <var>x1 >= x2</var> and <var>y1
 &lt;= y2</var></li>
 <li>or <var>cue1</var> fully contains <var>cue2</var>, i.e. <var>x1 &lt;= x2</var> and <var>y1 >=
 y2</var>.</li>
</ul>

<div class="example">

 <p>The following example matches this definition:</p>

 <pre>
 WEBVTT

 00:00.000 --> 01:24.000
 Introduction

 00:00.000 --> 00:44.000
 Topics

 00:44.000 --> 01:19.000
 Presenters

 01:24.000 --> 05:00.000
 Scrolling Effects

 01:35.000 --> 03:00.000
 Achim's Demo

 03:00.000 --> 05:00.000
 Timeline Panel
 </pre>

 <p>Notice how you can express the cues in this WebVTT file as a tree structure:</p>

 <ul>
  <li>
   WebVTT file
   <ul>
    <li>
     Introduction
     <ul>
      <li>Topics</li>
      <li>Presenters</li>
     </ul>
    </li>
    <li>
     Scrolling Effects
     <ul>
      <li>Achim's Demo</li>
      <li>Timeline Panel</li>
     </ul>
    </li>
   </ul>
  </li>
 </ul>

 <p>If the file has cues that can't be expressed in this fashion, then they don't match the
 definition of a <a>WebVTT file using only nested cues</a>. For example:</p>

 <pre>
 WEBVTT

 00:00.000 --> 01:00.000
 The First Minute

 00:30.000 --> 01:30.000
 The Final Minute
 </pre>

 <p>In this ninety-second example, the two cues partly overlap, with the first ending before the
 second ends and the second starting before the first ends. This therefore is not a <a>WebVTT file
 using only nested cues</a>.</p>

</div>


<h3 id=types-of-webvtt-files>Types of WebVTT files</h3>

<p>The syntax definition of WebVTT files allows authoring of a wide variety of WebVTT files with a
mix of cues. However, only a small subset of WebVTT file types are typically authored.</p>

<p>Conformance checkers, when validating <a>WebVTT</a> files, may offer to restrict syntax checking
for validating these types.</p>


<h4 id=file-using-metadata-content>WebVTT file using metadata content</h4>

<p>A <a>WebVTT file</a> whose cues all have a <a>cue payload</a> that is <a>WebVTT metadata text</a>
is said to be a <dfn>WebVTT file using metadata content</dfn>.</p>


<h4 id=file-using-chapter-title-text>WebVTT file using chapter title text</h4>

<p><dfn>WebVTT chapter title text</dfn> is <a>WebVTT cue text</a> that makes use only of zero or
more of the following components, each optionally separated from the next by a <a>WebVTT line
terminator</a>:</p>

<ul>
 <li><a>WebVTT cue text span</a></li>
 <li><a>WebVTT cue amp escape</a></li>
 <li><a>WebVTT cue lt escape</a></li>
 <li><a>WebVTT cue gt escape</a></li>
 <li><a>WebVTT cue lrm escape</a></li>
 <li><a>WebVTT cue rlm escape</a></li>
 <li><a>WebVTT cue nbsp escape</a></li>
</ul>

<p>A <dfn>WebVTT file using chapter title text</dfn> is a <a>WebVTT file using only nested cues</a>
whose cues all have a <a>cue payload</a> that is <a>WebVTT chapter title text</a>.</p>


<h4 id=file-using-cue-text>WebVTT file using cue text</h4>

<p>A <a>WebVTT file</a> whose cues all have a <a>cue payload</a> that is <a>WebVTT cue text</a> is
said to be a <dfn>WebVTT file using cue text</dfn>.</p>


<h2 id=parsing>Parsing</h2>


<h3 id=file-parsing>WebVTT file parsing</h3>

<p>A <dfn>WebVTT parser</dfn>, given an input byte stream and a <a>text track list of cues</a>
<var>output</var>, must decode the byte stream using the <a lt="UTF-8 decode">UTF-8 decode</a>
algorithm, and then must parse the resulting string according to the <a>WebVTT parser algorithm</a>
below. This results in <a lt="WebVTT cue">WebVTT cues</a> being added to <var>output</var>.
[[!RFC3629]]</p>

<p>A <a>WebVTT parser</a>, specifically its conversion and parsing steps, is typically run
asynchronously, with the input byte stream being updated incrementally as the resource is
downloaded; this is called an <dfn>incremental WebVTT parser</dfn>.</p>

<p>A <a>WebVTT parser</a> verifies a file signature before parsing the provided byte stream. If the
stream lacks this WebVTT file signature, then the parser aborts.</p>

<p>The <dfn>WebVTT parser algorithm</dfn> is as follows:</p>

<ol>

 <li>
  <p>Let <var>input</var> be the string being parsed, after conversion to Unicode, and with the
  following transformations applied:</p>

  <ul>

   <li><p>Replace all U+0000 NULL characters by U+FFFD REPLACEMENT CHARACTERs.</p></li>

   <li><p>Replace each U+000D CARRIAGE RETURN U+000A LINE FEED (CRLF) character pair by a single
   U+000A LINE FEED (LF) character.</p></li>

   <li><p>Replace all remaining U+000D CARRIAGE RETURN characters by U+000A LINE FEED (LF)
   characters.</p></li>

  </ul>

 </li>

 <li><p>Let <var>position</var> be a pointer into <var>input</var>, initially pointing at the start
 of the string. In an <a>incremental WebVTT parser</a>, when this algorithm (or further algorithms
 that it uses) moves the <var>position</var> pointer, the user agent must wait until appropriate
 further characters from the byte stream have been added to <var>input</var> before moving the
 pointer, so that the algorithm never reads past the end of the <var>input</var> string. Once the
 byte stream has ended, and all characters have been added to <var>input</var>, then the
 <var>position</var> pointer may, when so instructed by the algorithms, be moved past the end of
 <var>input</var>.</p></li>

 <li>Let <var>line</var> be a string variable. Unset the <var>already collected line</var>
 flag.</li>

 <!-- SIGNATURE CHECK -->

 <li><p><a spec=html>Collect a sequence of characters</a> that are <em>not</em> U+000A LINE FEED
 (LF) characters. Let <var>line</var> be those characters, if any.</p></li>

 <li><p>If <var>line</var> is less than six characters long, then abort these steps. The file does
 not start with the correct <a>WebVTT file</a> signature and was therefore not successfully
 processed.</p></li>

 <li><p>If <var>line</var> is exactly six characters long but does not exactly equal
 "<code>WEBVTT</code>", then abort these steps. The file does not start with the correct <a>WebVTT
 file</a> signature and was therefore not successfully processed.</p></li>

 <li><p>If <var>line</var> is more than six characters long but the first six characters do not
 exactly equal "<code>WEBVTT</code>", or the seventh character is neither a U+0020 SPACE character
 nor a U+0009 CHARACTER TABULATION (tab) character, then abort these steps. The file does not start
 with the correct <a>WebVTT file</a> signature and was therefore not successfully
 processed.</p></li>

 <li><p>If <var>position</var> is past the end of <var>input</var>, then abort these steps. The file
 was successfully processed, but it contains no useful data and so no <a lt="WebVTT cue">WebVTT
 cues</a> were added to <var>output</var>.</p></li>

 <li><p>The character indicated by <var>position</var> is a U+000A LINE FEED (LF) character. Advance
 <var>position</var> to the next character in <var>input</var>.</p></li>

 <li><p><i>Header</i>: <a spec=html>Collect a sequence of characters</a> that are <em>not</em>
 U+000A LINE FEED (LF) characters. Let <var>line</var> be those characters, if any.</p></li>

 <!-- METADATA HEADER PARSING -->

 <li><p>Let <var>regions</var> be a <a>text track list of regions</a>.</p></li>

 <li>
  <i>Metadata header loop</i>: If <var>line</var> is not the empty string, run the following
  substeps:

  <ol>

   <li><p><i>Metadata header creation</i>: Let <var>metadata</var> be a new <a>WebVTT metadata
   header</a>.</p></li>

   <li><p>Let <a lt="WebVTT metadata header name">metadata's name</a> be the empty string.</p></li>

   <li><p>Let <a lt="WebVTT metadata header value">metadata's value</a> be the empty
   string.</p></li>

   <li><p>If <var>line</var> contains the character ":" (A U+003A COLON), then set <a lt="WebVTT
   metadata header name">metadata's name</a> to the substring of <var>line</var> before the first
   ":" character and <a lt="WebVTT metadata header value">metadata's value</a> to the substring
   after this character.</p></li>

   <li>
    <p>If <a lt="WebVTT metadata header name">metadata's name</a> equals "Region":</p>

    <ol>
     <li><i>Region creation</i>: Let <var>region</var> be a new <a>WebVTT region</a>.</li>
     <li>Let <var>region</var>'s <a lt="WebVTT region identifier">identifier</a> be the empty
     string.</li>
     <li>Let <var>region</var>'s <a lt="WebVTT region width">width</a> be 100.</li>
     <li>Let <var>region</var>'s <a lt="WebVTT region lines">lines</a> be 3.</li>
     <li>Let <var>region</var>'s <a lt="WebVTT region anchor">anchor point</a> be (0,100).</li>
     <li>Let <var>region</var>'s <a lt="WebVTT region viewport anchor">viewport anchor point</a> be
     (0,100).</li>
     <li>Let <var>region</var>'s <a lt="WebVTT region scroll">scroll value</a> be <a lt="WebVTT
     region scroll none">NONE</a>.</li>
     <li><a>Collect WebVTT region settings</a> from <a lt="WebVTT metadata header value">metadata's
     value</a> using <var>region</var> for the results.</li>
     <li><i>Region processing</i>: Construct a <a>WebVTT Region Object</a> from
     <var>region</var>.</li>
     <li>Append <var>region</var> to the <a>text track list of regions</a> <var>regions</var>.</li>
    </ol>
   </li>
  </ol>
 </li>

 <!-- FIXME: right now ignores all WebVTT metadata headers that don't specify regions. -->

 <li><p>If <var>position</var> is past the end of <var>input</var>, then jump to the step labeled
 <i>end</i>.</p></li>

 <li><p>The character indicated by <var>position</var> is a U+000A LINE FEED (LF) character. Advance
 <var>position</var> to the next character in <var>input</var>.</p></li>

 <li><p>If <var>line</var> contains the three-character substring "<code>--></code>" (U+002D
 HYPHEN-MINUS, U+002D HYPHEN-MINUS, U+003E GREATER-THAN SIGN), then set the <var>already collected
 line</var> flag and jump to the step labeled <i>cue loop</i>.</p></li>

 <li><p>If <var>line</var> is not the empty string, then jump back to the step labeled
 <i>header</i>.</p></li>

 <li><p><i>Cue loop</i>: If the <var>already collected line</var> flag is set, then jump to the step
 labeled <var>cue creation</var>.</p></li>

 <li><p><a spec=html>Collect a sequence of characters</a> that are U+000A LINE FEED (LF)
 characters.</p></li>

 <li><p><a spec=html>Collect a sequence of characters</a> that are <em>not</em> U+000A LINE FEED
 (LF) characters. Let <var>line</var> be those characters, if any.</p></li>

 <li><p>If <var>line</var> is the empty string, then jump to the step labeled <i>end</i>. (In such a
 case, <var>position</var> is also forcibly past the end of <var>input</var><!-- since we've just
 collected newlines, so we have none of those, and we've failed to collect anything that's not a
 newline, so we have none of that either, meaning we have nothing. -->.)</p></li>

 <li>
  <p><i>Cue creation</i>: Let <var>cue</var> be a new <a>WebVTT cue</a> and initialize it as
  follows:</p>

  <ol>
   <li><p>Let <var>cue</var>'s <a>text track cue identifier</a> be the empty string.</p></li>

   <li><p>Let <var>cue</var>'s <a>text track cue pause-on-exit flag</a> be false.</p></li>

   <li><p>Let <var>cue</var>'s <a>WebVTT cue region</a> be null.</p></li>

   <li><p>Let <var>cue</var>'s <a>WebVTT cue writing direction</a> be <a lt="WebVTT cue horizontal
   writing direction">horizontal</a>.</p></li>

   <li><p>Let <var>cue</var>'s <a>WebVTT cue snap-to-lines flag</a> be true.</p></li>

   <li><p>Let <var>cue</var>'s <a>WebVTT cue line</a> be <a lt="WebVTT cue line
   automatic">auto</a>.</p></li>

   <li><p>Let <var>cue</var>'s <a>WebVTT cue line alignment</a> be <a lt="WebVTT cue line start
   alignment">start alignment</a>.</p></li>

   <li><p>Let <var>cue</var>'s <a>WebVTT cue position</a> be <a lt="WebVTT cue automatic
   position">auto</a>.</p></li>

   <li><p>Let <var>cue</var>'s <a>WebVTT cue position alignment</a> be <a lt="WebVTT cue position
   automatic alignment">auto</a>.</p></li>

   <li><p>Let <var>cue</var>'s <a>WebVTT cue size</a> be 100.</p></li>

   <li><p>Let <var>cue</var>'s <a>WebVTT cue text alignment</a> be <a lt="WebVTT cue middle
   alignment">middle alignment</a>.</p></li>

   <li><p>Let <var>cue</var>'s <a>text track cue text</a> be the empty string.</p></li>
  </ol>

 </li>

 <li><p>If <var>line</var> contains the three-character substring "<code>--></code>" (U+002D
 HYPHEN-MINUS, U+002D HYPHEN-MINUS, U+003E GREATER-THAN SIGN), then jump to the step labeled
 <i>timings</i> below.</p></li>

 <li><p>Let <var>cue</var>'s <a>text track cue identifier</a> be <var>line</var>.</p></li>

 <li><p>If <var>position</var> is past the end of <var>input</var>, then discard <var>cue</var> and
 jump to the step labeled <i>end</i>.</p></li>

 <li><p>If the character indicated by <var>position</var> is a U+000A LINE FEED (LF) character,
 advance <var>position</var> to the next character in <var>input</var>.</p></li>

 <li><p><a spec=html>Collect a sequence of characters</a> that are <em>not</em> U+000A LINE FEED
 (LF) characters. Let <var>line</var> be those characters, if any.</p></li>

 <li><p>If <var>line</var> is the empty string, then discard <var>cue</var> and jump to the step
 labeled <i>cue loop</i>.</p></li>

 <li><p><i>Timings</i>: Unset the <var>already collected line</var> flag.</p></li>

 <li><p><a>Collect WebVTT cue timings and settings</a> from <var>line</var> using <var>regions</var>
 for <var>cue</var>. If that fails, jump to the step labeled <i>bad cue</i>.</p></li>

 <li><p>Let <var>cue text</var> be the empty string.</p></li>

 <li><p><i>Cue text loop</i>: If <var>position</var> is past the end of <var>input</var>, then jump
 to the step labeled <i>cue text processing</i>.</p></li>

 <li><p>If the character indicated by <var>position</var> is a U+000A LINE FEED (LF) character,
 advance <var>position</var> to the next character in <var>input</var>.</p></li>

 <li><p><a spec=html>Collect a sequence of characters</a> that are <em>not</em> U+000A LINE FEED
 (LF) characters. Let <var>line</var> be those characters, if any.</p></li>

 <li><p>If <var>line</var> is the empty string, then jump to the step labeled <i>cue text
 processing</i>.</p></li>

 <li><p>If <var>line</var> contains the three-character substring "<code>--></code>" (U+002D
 HYPHEN-MINUS, U+002D HYPHEN-MINUS, U+003E GREATER-THAN SIGN), then set the <var>already collected
 line</var> flag and jump to the step labeled <i>cue text processing</i>.</p></li>

 <li><p>If <var>cue text</var> is not empty, append a U+000A LINE FEED (LF) character to <var>cue
 text</var>.</p></li>

 <li><p>Let <var>cue text</var> be the concatenation of <var>cue text</var> and
 <var>line</var>.</p></li>

 <li><p>Return to the step labeled <i>cue text loop</i>.</p></li>

 <li><p><i>Cue text processing</i>: Let the <a>text track cue text</a> of <var>cue</var> be <var>cue
 text</var>, and let the <a>rules for extracting the chapter title</a> be the <a>WebVTT rules for
 extracting the chapter title</a>.</p></li>

 <li><p>Add <var>cue</var> to the <a>text track list of cues</a> <var>output</var>.</p></li>

 <li><p>Jump to the step labeled <i>cue loop</i>.</p></li>

 <li><p><i>Bad cue</i>: Discard <var>cue</var>.</p></li>

 <li><p><i>Bad cue loop</i>: If <var>position</var> is past the end of <var>input</var>, then jump
 to the step labeled <i>end</i>.</p></li>

 <li><p>If the character indicated by <var>position</var> is a U+000A LINE FEED (LF) character,
 advance <var>position</var> to the next character in <var>input</var>.</p></li>

 <li><p><a spec=html>Collect a sequence of characters</a> that are <em>not</em> U+000A LINE FEED
 (LF) characters. Let <var>line</var> be those characters, if any.</p></li>

 <li><p>If <var>line</var> contains the three-character substring "<code>--></code>" (U+002D
 HYPHEN-MINUS, U+002D HYPHEN-MINUS, U+003E GREATER-THAN SIGN), then set the <var>already collected
 line</var> flag and jump to the step labeled <i>cue loop</i>.</p></li>

 <li><p>If <var>line</var> is the empty string, then jump to the step labeled <i>cue
 loop</i>.</p></li>

 <li><p>Otherwise, jump to the step labeled <i>bad cue loop</i>.</p></li>

 <li><p><i>End</i>: The file has ended. Abort these steps. The <a>WebVTT parser</a> has finished.
 The file was successfully processed.</p></li>

</ol>


<h3 id=region-settings-parsing>WebVTT region settings parsing</h3>

<p>When the <a>WebVTT parser</a> requires that the user agent <dfn>collect WebVTT region
settings</dfn> from a string <var>input</var> for a <a>text track</a>, the user agent must run the
following algorithm.</p>

<p>A <dfn>WebVTT region object</dfn> is a conceptual construct to represent a <a>WebVTT region</a>
that is used as a root node for <a lt="List of WebVTT node objects">lists of WebVTT node
objects</a>. This algorithm returns a list of <a lt="WebVTT region object">WebVTT Region
Objects</a>.</p>

<ol>
 <li><p>Let <var>settings</var> be the result of <a lt="split a string on spaces">splitting
 <var>input</var> on spaces</a>.</p></li>

 <li>
  For each token <var>setting</var> in the list <var>settings</var>, run the following substeps:

  <ol>
   <li><p>If <var>setting</var> does not contain a U+003D EQUALS SIGN character (=), or if the first
   U+003D EQUALS SIGN character (=) in <var>setting</var> is either the first or last character of
   <var>setting</var>, then jump to the step labeled <i>next setting</i>.</p></li>

   <li><p>Let <var>name</var> be the leading substring of <var>setting</var> up to and excluding the
   first U+003D EQUALS SIGN character (=) in that string.</p></li>

   <li><p>Let <var>value</var> be the trailing substring of <var>setting</var> starting from the
   character immediately after the first U+003D EQUALS SIGN character (=) in that string.</p></li>

   <li>
    <p>Run the appropriate substeps that apply for the value of <var>name</var>, as follows:</p>

    <dl>
     <dt><p>If <var>name</var> is a <a>case-sensitive</a> match for "<code>id</code>"</p></dt>
     <dd><p>Let <var>region</var>'s <a lt="WebVTT region identifier">identifier</a> be
     <var>value</var>.</p></dd>

     <dt><p>Otherwise if <var>name</var> is a <a>case-sensitive</a> match for
     "<code>width</code>"</p></dt>
     <dd><p>If <a>parse a percentage string</a> from <var>value</var> returns a
     <var>percentage</var>, let <var>region</var>'s <a>WebVTT region width</a> be
     <var>percentage</var>.</p></dd>

     <dt>Otherwise if <var>name</var> is a <a>case-sensitive</a> match for "<code>lines</code>"</dt>
     <dd>
      <ol>
       <li><p>If <var>value</var> contains any characters other than <a>ASCII digits</a>, then jump
       to the step labeled <i>next setting</i>.</p></li>

       <li><p>Interpret <var>value</var> as an integer, and let <var>number</var> be that
       number.</p></li>

       <li><p>Let <var>region</var>'s <a>WebVTT region lines</a> be <var>number</var>.</p></li>
      </ol>
     </dd>

     <dt>Otherwise if <var>name</var> is a <a>case-sensitive</a> match for
     "<code>regionanchor</code>"</dt>
     <dd>
      <ol>
       <li><p>If <var>value</var> does not contain a U+002C COMMA character (,), then jump to the
       step labeled <i>next setting</i>.</p></li>

       <li><p>Let <var>anchorX</var> be the leading substring of <var>value</var> up to and
       excluding the first U+002C COMMA character (,) in that string.</p></li>

       <li><p>Let <var>anchorY</var> be the trailing substring of <var>value</var> starting from the
       character immediately after the first U+002C COMMA character (,) in that string.</p></li>

       <li><p>If <a>parse a percentage string</a> from <var>anchorX</var> or <a>parse a percentage
       string</a> from <var>anchorY</var> don't return a <var>percentage</var>, then jump to the
       step labeled <i>next setting</i>.</p></li>

       <li><p>Let <var>region</var>'s <a lt="WebVTT region anchor">WebVTT region anchor point</a> be
       the tuple of the <var>percentage</var> values calculated from <var>anchorX</var> and
       <var>anchorY</var>.</p></li>
      </ol>
     </dd>

     <dt>Otherwise if <var>name</var> is a <a>case-sensitive</a> match for
     "<code>viewportanchor</code>"</dt>
     <dd>
      <ol>
       <li><p>If <var>value</var> does not contain a U+002C COMMA character (,), then jump to the
       step labeled <i>next setting</i>.</p></li>

       <li><p>Let <var>viewportanchorX</var> be the leading substring of <var>value</var> up to and
       excluding the first U+002C COMMA character (,) in that string.</p></li>

       <li><p>Let <var>viewportanchorY</var> be the trailing substring of <var>value</var> starting
       from the character immediately after the first U+002C COMMA character (,) in that
       string.</p></li>

       <li><p>If <a>parse a percentage string</a> from <var>viewportanchorX</var> or <a>parse a
       percentage string</a> from <var>viewportanchorY</var> don't return a <var>percentage</var>,
       then jump to the step labeled <i>next setting</i>.</p></li>

       <li><p>Let <var>region</var>'s <a lt="WebVTT region viewport anchor">WebVTT region anchor
       point</a> be the tuple of the <var>percentage</var> values calculated from
       <var>viewportanchorX</var> and <var>viewportanchorY</var>.</p></li>
      </ol>
     </dd>

     <dt>Otherwise if <var>name</var> is a <a>case-sensitive</a> match for
     "<code>scroll</code>"</dt>
     <dd>
      <ol>
       <li><p>If <var>value</var> is a <a>case-sensitive</a> match for the string "<code>up</code>",
       then let <var>region</var>'s <a lt="WebVTT region scroll">scroll value</a> be "<a lt="WebVTT
       region scroll up">scroll up</a>".</p></li>
      </ol>
     </dd>
    </dl>
   </li>

   <li><i>Next setting</i>: Continue to the next setting, if any.</li>
  </ol>
 </li>
</ol>

<p>The rules to <dfn>parse a percentage string</dfn> are as follows. This will return either a
number in the range 0..100, or nothing. If at any point the algorithm says that it "fails", this
means that it is aborted at that point and returns nothing.</p>

<ol>
 <li><p>Let <var>input</var> be the string being parsed.</p></li>

 <li><p>If <var>input</var> contains any characters other than U+0025 PERCENT SIGN characters (%),
 U+002E DOT characters (.) and <a>ASCII digits</a>, then fail.</p></li>

 <li><p>If <var>input</var> does not contain at least one <a lt="ASCII digits">ASCII digit</a>, then
 fail.</p></li>

 <li><p>If <var>input</var> contains more than one U+002E DOT character (.), then fail.</p></li>

 <li><p>If any character in <var>input</var> other than the last character is a U+0025 PERCENT SIGN
 character (%), then fail.</p></li>

 <li><p>If the last character in <var>input</var> is not a U+0025 PERCENT SIGN character (%), then
 fail.</p></li>

 <li><p>Ignoring the trailing percent sign, interpret <var>input</var> as a real number. Let that
 number be the <var>percentage</var>.</p></li>

 <li><p>If <var>percentage</var> is outside the range 0..100, then fail.</p></li>

 <li><p>Return <var>percentage</var>.</p></li>
</ol>


<h3 id=cue-timings-and-settings-parsing>WebVTT cue timings and settings parsing</h3>

<p>When the algorithm above requires that the user agent <dfn>collect WebVTT cue timings and
settings</dfn> from a string <var>input</var> using a <a>text track list of regions</a>
<var>regions</var> for a <a>WebVTT cue</a> <var>cue</var>, the user agent must run the following
algorithm.</p>

<ol>

 <li><p>Let <var>input</var> be the string being parsed.</p></li>

 <li><p>Let <var>position</var> be a pointer into <var>input</var>, initially pointing at the start
 of the string.</p></li>

 <li><p><a>Skip whitespace</a>.</p></li>

 <li><p><a>Collect a WebVTT timestamp</a>. If that algorithm fails, then abort these steps and
 return failure. Otherwise, let <var>cue</var>'s <a>text track cue start time</a> be the collected
 time.</p></li>

 <li><p><a>Skip whitespace</a>.</p></li>

 <!-- we can't be beyond the end of the string until we've seen the arrow, since we know the arrow
 is in the string and nothing we've done so far would move us past the first "-". -->

 <li><p>If <!--<var>position</var> is beyond the end of <var>input</var> or if--> the character at
 <var>position</var> is not a U+002D HYPHEN-MINUS character (-) then abort these steps and return
 failure. Otherwise, move <var>position</var> forwards one character.</p></li>

 <li><p>If <!--<var>position</var> is beyond the end of <var>input</var> or if--> the character at
 <var>position</var> is not a U+002D HYPHEN-MINUS character (-) then abort these steps and return
 failure. Otherwise, move <var>position</var> forwards one character.</p></li>

 <li><p>If <!--<var>position</var> is beyond the end of <var>input</var> or if--> the character at
 <var>position</var> is not a U+003E GREATER-THAN SIGN character (>) then abort these steps and
 return failure. Otherwise, move <var>position</var> forwards one character.</p></li>

 <li><p><a>Skip whitespace</a>.</p></li>

 <li><p><a>Collect a WebVTT timestamp</a>. If that algorithm fails, then abort these steps and
 return failure. Otherwise, let <var>cue</var>'s <a>text track cue end time</a> be the collected
 time.</p></li>

 <li><p>Let <var>remainder</var> be the trailing substring of <var>input</var> starting at
 <var>position</var>.</p></li>

 <li><p><a>Parse the WebVTT cue settings</a> from <var>remainder</var> using <var>regions</var> for
 <var>cue</var>.</p></li>

</ol>

<p>When the user agent is to <dfn>parse the WebVTT cue settings</dfn> from a string <var>input</var>
using a <a>text track list of regions</a> <var>regions</var> for a <a>text track cue</a>
<var>cue</var>, the user agent must run the following steps:</p>

<ol>

 <li><p>Let <var>settings</var> be the result of <a lt="split a string on spaces">splitting
 <var>input</var> on spaces</a>.</p></li>

 <li>

  <p>For each token <var>setting</var> in the list <var>settings</var>, run the following
  substeps:</p>

  <ol>

   <li><p>If <var>setting</var> does not contain a U+003A COLON character (:), or if the first
   U+003A COLON character (:) in <var>setting</var> is either the first or last character of
   <var>setting</var>, then jump to the step labeled <i>next setting</i>.</p></li>

   <li><p>Let <var>name</var> be the leading substring of <var>setting</var> up to and excluding the
   first U+003A COLON character (:) in that string.</p></li>

   <li><p>Let <var>value</var> be the trailing substring of <var>setting</var> starting from the
   character immediately after the first U+003A COLON character (:) in that string.</p></li>

   <li>

    <p>Run the appropriate substeps that apply for the value of <var>name</var>, as follows:</p>

    <dl>

     <dt>If <var>name</var> is a <a>case-sensitive</a> match for "<code>region</code>"</dt>

     <dd>
      <ol>
       <li><p>Let <var>cue</var>'s <a>WebVTT cue region</a> be the last <a>WebVTT region</a> in
       <var>regions</var> whose <a>WebVTT region identifier</a> is <var>value</var>, if any, or null
       otherwise.</p></li>
      </ol>
     </dd>

     <dt>If <var>name</var> is a <a>case-sensitive</a> match for "<code>vertical</code>"</dt>

     <dd>

      <ol>

       <li><p>If <var>value</var> is a <a>case-sensitive</a> match for the string "<code>rl</code>",
       then let <var>cue</var>'s <a>WebVTT cue writing direction</a> be <a lt="WebVTT cue vertical
       growing left writing direction">vertical growing left</a>.</p></li>

       <li><p>Otherwise, if <var>value</var> is a <a>case-sensitive</a> match for the string
       "<code>lr</code>", then let <var>cue</var>'s <a>WebVTT cue writing direction</a> be <a
       lt="WebVTT cue vertical growing right writing direction">vertical growing right</a>.</p></li>

      </ol>

     </dd>

     <dt>If <var>name</var> is a <a>case-sensitive</a> match for "<code>line</code>"</dt>

     <dd>

      <ol>

       <li><p>If <var>value</var> contains a U+002C COMMA character (,), then let <var>linepos</var>
       be the leading substring of <var>value</var> up to and excluding the first U+002C COMMA
       character (,) in that string and let <var>linealign</var> be the trailing substring of
       <var>value</var> starting from the character immediately after the first U+002C COMMA
       character (,) in that string.</p></li>

       <li><p>Otherwise let <var>linepos</var> be the full <var>value</var> string and
       <var>linealign</var> be the empty string.</p></li>

       <li><p>If <var>linepos</var> does not contain at least one <a lt="ASCII digits">ASCII
       digit</a>, then jump to the step labeled <i>next setting</i>.</p></li>

       <li>
        <dl>
         <dt><p>If the last character in <var>linepos</var> is a U+0025 PERCENT SIGN character
         (%)</p></dt>

         <dd><p>If <a>parse a percentage string</a> from <var>linepos</var> doesn't fail, let
         <var>number</var> be the returned <var>percentage</var>, otherwise jump to the step labeled
         <i>next setting</i>.</p></dd>

         <dt><p>Otherwise</p></dt>

         <dd>
          <ol>
           <li><p>If <var>linepos</var> contains any characters other than U+002D HYPHEN-MINUS
           characters (-) and <a>ASCII digits</a>, then jump to the step labeled <i>next
           setting</i>.</p></li>

           <li><p>If any character in <var>linepos</var> other than the first character is a U+002D
           HYPHEN-MINUS character (-), then jump to the step labeled <i>next setting</i>.</p></li>

           <li><p>Interpret <var>linepos</var> as a (potentially signed) integer, and let
           <var>number</var> be that number.</p></li>
          </ol>
         </dd>
        </dl>
       </li>

       <li><p>Let <var>cue</var>'s <a>WebVTT cue line</a> be <var>number</var>.</p></li>

       <li><p>If the last character in <var>linepos</var> is a U+0025 PERCENT SIGN character (%),
       then let <var>cue</var>'s <a>WebVTT cue snap-to-lines flag</a> be false. Otherwise, let it be
       true.</p></li>

       <li><p>If <var>linealign</var> is a <a>case-sensitive</a> match for the string
       "<code>start</code>", then let <var>cue</var>'s <a>WebVTT cue line alignment</a> be <a
       lt="WebVTT cue line start alignment">start alignment</a>.</p></li>

       <li><p>If <var>linealign</var> is a <a>case-sensitive</a> match for the string
       "<code>middle</code>", then let <var>cue</var>'s <a>WebVTT cue line alignment</a> be <a
       lt="WebVTT cue line middle alignment">middle alignment</a>.</p></li>

       <li><p>If <var>linealign</var> is a <a>case-sensitive</a> match for the string
       "<code>end</code>", then let <var>cue</var>'s <a>WebVTT cue line alignment</a> be <a
       lt="WebVTT cue line end alignment">end alignment</a>.</p></li>

      </ol>

     </dd>

     <dt>If <var>name</var> is a <a>case-sensitive</a> match for "<code>position</code>"</dt>

     <dd>

      <ol>

       <li><p>If <var>value</var> contains a U+002C COMMA character (,), then let <var>colpos</var>
       be the leading substring of <var>value</var> up to and excluding the first U+002C COMMA
       character (,) in that string and let <var>colalign</var> be the trailing substring of
       <var>value</var> starting from the character immediately after the first U+002C COMMA
       character (,) in that string.</p></li>

       <li><p>Otherwise let <var>colpos</var> be the full <var>value</var> string and
       <var>colalign</var> be the empty string.</p></li>

       <li><p>If <a>parse a percentage string</a> from <var>colpos</var> doesn't fail, let
       <var>number</var> be the returned <var>percentage</var>, otherwise jump to the step labeled
       <i>next setting</i> (<a lt="WebVTT cue position">position</a>'s value remains the special
       value <a lt="WebVTT cue automatic position">auto</a>).</p></li>

       <li><p>Let <var>cue</var>'s <a lt="WebVTT cue position">position</a> be
       <var>number</var>.</p></li>

       <li><p>If <var>colalign</var> is a <a>case-sensitive</a> match for the string
       "<code>start</code>", then let <var>cue</var>'s <a>WebVTT cue position alignment</a> be <a
       lt="WebVTT cue position start alignment">start alignment</a>.</p></li>

       <li><p>If <var>colalign</var> is a <a>case-sensitive</a> match for the string
       "<code>middle</code>", then let <var>cue</var>'s <a>WebVTT cue position alignment</a> be <a
       lt="WebVTT cue position middle alignment">middle alignment</a>.</p></li>

       <li><p>If <var>colalign</var> is a <a>case-sensitive</a> match for the string
       "<code>end</code>", then let <var>cue</var>'s <a>WebVTT cue position alignment</a> be <a
       lt="WebVTT cue position end alignment">end alignment</a>.</p></li>

      </ol>

     </dd>

     <dt>If <var>name</var> is a <a>case-sensitive</a> match for "<code>size</code>"</dt>

     <dd>

      <ol>

       <li><p>If <a>parse a percentage string</a> from <var>value</var> doesn't fail, let
       <var>number</var> be the returned <var>percentage</var>, otherwise jump to the step labeled
       <i>next setting</i>.</p></li>

       <li><p>Let <var>cue</var>'s <a>WebVTT cue size</a> be <var>number</var>.</p></li>

      </ol>

     </dd>

     <dt>If <var>name</var> is a <a>case-sensitive</a> match for "<code>align</code>"</dt>

     <dd>

      <ol>

       <li><p>If <var>value</var> is a <a>case-sensitive</a> match for the string
       "<code>start</code>", then let <var>cue</var>'s <a>WebVTT cue text alignment</a> be <a
       lt="WebVTT cue start alignment">start alignment</a>.</p></li>

       <li><p>If <var>value</var> is a <a>case-sensitive</a> match for the string
       "<code>middle</code>", then let <var>cue</var>'s <a>WebVTT cue text alignment</a> be <a
       lt="WebVTT cue middle alignment">middle alignment</a>.</p></li>

       <li><p>If <var>value</var> is a <a>case-sensitive</a> match for the string
       "<code>end</code>", then let <var>cue</var>'s <a>WebVTT cue text alignment</a> be <a
       lt="WebVTT cue end alignment">end alignment</a>.</p></li>

       <li><p>If <var>value</var> is a <a>case-sensitive</a> match for the string
       "<code>left</code>", then let <var>cue</var>'s <a>WebVTT cue text alignment</a> be <a
       lt="WebVTT cue left alignment">left alignment</a>.</p></li>

       <li><p>If <var>value</var> is a <a>case-sensitive</a> match for the string
       "<code>right</code>", then let <var>cue</var>'s <a>WebVTT cue text alignment</a> be <a
       lt="WebVTT cue right alignment">right alignment</a>.</p></li>

      </ol>

     </dd>

    </dl>

   </li>

   <li><p><i>Next setting</i>: Continue to the next token, if any.</p></li> <!-- this step is just
   here to give the algorithms above a clean way to 'break' -->

  </ol>

 </li>

</ol>

<p>When this specification says that a user agent is to <dfn>collect a WebVTT timestamp</dfn>, the
user agent must run the following steps:</p>

<ol>

 <li><p>Let <var>input</var> and <var>position</var> be the same variables as those of the same name
 in the algorithm that invoked these steps.</p></li>

 <li><p>Let <var>most significant units</var> be <i>minutes</i>.</p></li>

 <li><p>If <var>position</var> is past the end of <var>input</var>, return an error and abort these
 steps.</p></li>

 <li><p>If the character indicated by <var>position</var> is not an <a spec=html>ASCII digit</a>,
 then return an error and abort these steps.</p></li>

 <li><p><a spec=html>Collect a sequence of characters</a> that are <a spec=html>ASCII digits</a>,
 and let <var>string</var> be the collected substring.</p></li>

 <li><p>Interpret <var>string</var> as a base-ten integer. Let <var>value<sub>1</sub></var> be that
 integer.</p></li>

 <li><p>If <var>string</var> is not exactly two characters in length, or if
 <var>value<sub>1</sub></var> is greater than 59, let <var>most significant units</var> be
 <i>hours</i>.</p></li>

 <li><p>If <var>position</var> is beyond the end of <var>input</var> or if the character at
 <var>position</var> is not a U+003A COLON character (:), then return an error and abort these
 steps. Otherwise, move <var>position</var> forwards one character.</p></li>

 <li><p><a spec=html>Collect a sequence of characters</a> that are <a spec=html>ASCII digits</a>,
 and let <var>string</var> be the collected substring.</p></li>

 <li><p>If <var>string</var> is not exactly two characters in length, return an error and abort
 these steps.</p></li>

 <li><p>Interpret <var>string</var> as a base-ten integer. Let <var>value<sub>2</sub></var> be that
 integer.</p></li>

 <li>

  <p>If <var>most significant units</var> is <i>hours</i>, or if <var>position</var> is not beyond
  the end of <var>input</var> and the character at <var>position</var> is a U+003A COLON character
  (:), run these substeps:</p>

  <ol>

   <li><p>If <var>position</var> is beyond the end of <var>input</var> or if the character at
   <var>position</var> is not a U+003A COLON character (:), then return an error and abort these
   steps. Otherwise, move <var>position</var> forwards one character.</p></li>

   <li><p><a spec=html>Collect a sequence of characters</a> that are <a spec=html>ASCII digits</a>,
   and let <var>string</var> be the collected substring.</p></li>

   <li><p>If <var>string</var> is not exactly two characters in length, return an error and abort
   these steps.</p></li>

   <li><p>Interpret <var>string</var> as a base-ten integer. Let <var>value<sub>3</sub></var> be
   that integer.</p></li>

  </ol>

  <p>Otherwise (if <var>most significant units</var> is not <i>hours</i>, and either
  <var>position</var> is beyond the end of <var>input</var>, or the character at <var>position</var>
  is not a U+003A COLON character (:)), let <var>value<sub>3</sub></var> have the value of
  <var>value<sub>2</sub></var>, then <var>value<sub>2</sub></var> have the value of
  <var>value<sub>1</sub></var>, then let <var>value<sub>1</sub></var> equal zero.</p>

 </li>

 <li><p>If <var>position</var> is beyond the end of <var>input</var> or if the character at
 <var>position</var> is not a U+002E FULL STOP character (.), then return an error and abort these
 steps. Otherwise, move <var>position</var> forwards one character.</p></li>

 <li><p><a spec=html>Collect a sequence of characters</a> that are <a spec=html>ASCII digits</a>,
 and let <var>string</var> be the collected substring.</p></li>

 <li><p>If <var>string</var> is not exactly three characters in length, return an error and abort
 these steps.</p></li>

 <li><p>Interpret <var>string</var> as a base-ten integer. Let <var>value<sub>4</sub></var> be that
 integer.</p></li>

 <li><p>If <var>value<sub>2</sub></var> is greater than 59 or if <var>value<sub>3</sub></var> is
 greater than 59, return an error and abort these steps.</p></li>

 <!-- no need to check if <var>value<sub>4</sub></var> is greater than 999, since we know it had
 exactly three characters in the range 0-9, so we know it's a number in the range 0-999 -->

 <li><p>Let <var>result</var> be <var>value<sub>1</sub></var>&times;60&times;60 +
 <var>value<sub>2</sub></var>&times;60 + <var>value<sub>3</sub></var> +
 <var>value<sub>4</sub></var>&#x2215;1000. <!-- &#x00f7; is the division sign if people prefer that
 to the slash --></p></li>

 <li><p>Return <var>result</var>.</p></li>

</ol>


<h3 id=cue-text-parsing-rules><dfn>WebVTT cue text parsing rules</dfn></h3>

<p>A <dfn>WebVTT Node Object</dfn> is a conceptual construct used to represent components of
<a>WebVTT cue text</a> so that its processing can be described without reference to the underlying
syntax.</p>

<p>There are two broad classes of <a lt="WebVTT Node Object">WebVTT Node Objects</a>: <a lt="WebVTT
Internal Node Object">WebVTT Internal Node Objects</a> and <a lt="WebVTT Leaf Node Object">WebVTT
Leaf Node Objects</a>.</p>

<p><dfn lt="WebVTT Internal Node Object">WebVTT Internal Node Objects</dfn> are those that can
contain further <a lt="WebVTT Node Object">WebVTT Node Objects</a>. They are conceptually similar to
elements in HTML or the DOM. <a lt="WebVTT Internal Node Object">WebVTT Internal Node Objects</a>
have an ordered list of child <a lt="WebVTT Node Object">WebVTT Node Objects</a>. The <a>WebVTT
Internal Node Object</a> is said to be the <i>parent</i> of the children. Cycles do not occur; the
parent-child relationships so constructed form a tree structure. <a lt="WebVTT Internal Node
Object">WebVTT Internal Node Objects</a> also have an ordered list of class names, known as their
<dfn lt="WebVTT Node Object's applicable classes">applicable classes</dfn>, and a language, known as
their <dfn lt="WebVTT Node Object's applicable language">applicable language</dfn>, which is to be
interpreted as a BCP 47 language tag. [[!BCP47]]</p>

<p>There are several concrete classes of <a lt="WebVTT Internal Node Object">WebVTT Internal Node
Objects</a>:</p>

<dl>

 <dt><dfn lt="List of WebVTT Node Objects">Lists of WebVTT Node Objects</dfn></dt>
 <dd>
  <p>These are used as root nodes for trees of <a lt="WebVTT Node Object">WebVTT Node
  Objects</a>.</p>
 </dd>

 <dt><dfn lt="WebVTT Class Object">WebVTT Class Objects</dfn></dt>
 <dd>
  <p>These represent spans of text (a <a>WebVTT cue class span</a>) in <a>WebVTT cue text</a>, and
  are used to annotate parts of the cue with <a lt="WebVTT Node Object's applicable
  classes">applicable classes</a> without implying further meaning (such as italics or bold).</p>
 </dd>

 <dt><dfn lt="WebVTT Italic Object">WebVTT Italic Objects</dfn></dt>
 <dd>
  <p>These represent spans of italic text (a <a>WebVTT cue italics span</a>) in <a>WebVTT cue
  text</a>.</p>
 </dd>

 <dt><dfn lt="WebVTT Bold Object">WebVTT Bold Objects</dfn></dt>
 <dd>
  <p>These represent spans of bold text (a <a>WebVTT cue bold span</a>) in <a>WebVTT cue
  text</a>.</p>
 </dd>

 <dt><dfn lt="WebVTT Underline Object">WebVTT Underline Objects</dfn></dt>
 <dd>
  <p>These represent spans of underline text (a <a>WebVTT cue underline span</a>) in <a>WebVTT cue
  text</a>.</p>
 </dd>

 <dt><dfn lt="WebVTT Ruby Object">WebVTT Ruby Objects</dfn></dt>
 <dd>
  <p>These represent spans of ruby (a <a>WebVTT cue ruby span</a>) in <a>WebVTT cue text</a>.</p>
 </dd>

 <dt><dfn lt="WebVTT Ruby Text Object">WebVTT Ruby Text Objects</dfn></dt>
 <dd>
  <p>These represent spans of ruby text (a <a>WebVTT cue ruby text span</a>) in <a>WebVTT cue
  text</a>.</p>
 </dd>

 <dt><dfn lt="WebVTT Voice Object">WebVTT Voice Objects</dfn></dt>
 <dd>
  <p>These represent spans of text associated with a specific voice (a <a>WebVTT cue voice span</a>)
  in <a>WebVTT cue text</a>. A <a>WebVTT Voice Object</a> has a value, which is the name of the
  voice.</p>
 </dd>

 <dt><dfn lt="WebVTT Language Object">WebVTT Language Objects</dfn></dt>
 <dd>
  <p>These represent spans of text (a <a>WebVTT cue language span</a>) in <a>WebVTT cue text</a>,
  and are used to annotate parts of the cue where the <a lt="WebVTT Node Object's applicable
  language">applicable language</a> might be different than the surrounding text's, without implying
  further meaning (such as italics or bold).</p>
 </dd>

</dl>

<p><dfn lt="WebVTT Leaf Node Object">WebVTT Leaf Node Objects</dfn> are those that contain data,
such as text, and cannot contain child <a lt="WebVTT Node Object">WebVTT Node Objects</a>.</p>

<p>There are two concrete classes of <a lt="WebVTT Leaf Node Object">WebVTT Leaf Node
Objects</a>:</p>

<dl>

 <dt><dfn lt="WebVTT Text Object">WebVTT Text Objects</dfn></dt>
 <dd>
  <p>A fragment of text. A <a>WebVTT Text Object</a> has a value, which is the text it
  represents.</p>
 </dd>

 <dt><dfn lt="WebVTT Timestamp Object">WebVTT Timestamp Objects</dfn></dt>
 <dd>
  <p>A timestamp. A <a>WebVTT Timestamp Object</a> has a value, in seconds and fractions of a
  second, which is the time represented by the timestamp.</p>
 </dd>

</dl>

<p>To parse a string <var>input</var> supposedly containing <a>WebVTT cue text</a>, user agents must
use the following algorithm. This algorithm returns a <a>list of WebVTT Node Objects</a>.</p>

<ol>

 <li><p>Let <var>input</var> be the string being parsed.</p></li>

 <li><p>Let <var>position</var> be a pointer into <var>input</var>, initially pointing at the start
 of the string.</p></li>

 <li><p>Let <var>result</var> be a <a>list of WebVTT Node Objects</a>, initially empty.</p></li>

 <li><p>Let <var>current</var> be the <a>WebVTT Internal Node Object</a> <var>result</var>.</p></li>

 <li><p>Let <var>language stack</var> be a stack of language tags, initially empty.</p></li>

 <li><p><i>Loop</i>: If <var>position</var> is past the end of <var>input</var>, return
 <var>result</var> and abort these steps.</p></li>

 <li><p>Let <var>token</var> be the result of invoking the <a>WebVTT cue text
 tokenizer</a>.</p></li>

 <li>

  <p>Run the appropriate steps given the type of <var>token</var>:</p>

  <dl>

   <dt>If <var>token</var> is a string</dt>
   <dd>

    <ol>

     <li><p>Create a <a>WebVTT Text Object</a> whose value is the value of the string token
     <var>token</var>.</p></li>

     <li><p>Append the newly created <a>WebVTT Text Object</a> to <var>current</var>.</p></li>

    </ol>

   </dd>

   <dt>If <var>token</var> is a start tag</dt>
   <dd>

    <p>How the start tag token <var>token</var> is processed depends on its tag name, as
    follows:</p>

    <dl>

     <dt>If the tag name is "<code>c</code>"</dt>
     <dd>
      <p><a lt="attach a WebVTT Internal Node Object">Attach</a> a <a>WebVTT Class Object</a>.</p>
     </dd>

     <dt>If the tag name is "<code>i</code>"</dt>
     <dd>
      <p><a lt="attach a WebVTT Internal Node Object">Attach</a> a <a>WebVTT Italic Object</a>.</p>
     </dd>

     <dt>If the tag name is "<code>b</code>"</dt>
     <dd>
      <p><a lt="attach a WebVTT Internal Node Object">Attach</a> a <a>WebVTT Bold Object</a>.</p>
     </dd>

     <dt>If the tag name is "<code>u</code>"</dt>
     <dd>
      <p><a lt="attach a WebVTT Internal Node Object">Attach</a> a <a>WebVTT Underline
      Object</a>.</p>
     </dd>

     <dt>If the tag name is "<code>ruby</code>"</dt>
     <dd>
      <p><a lt="attach a WebVTT Internal Node Object">Attach</a> a <a>WebVTT Ruby Object</a>.</p>
     </dd>

     <dt>If the tag name is "<code>rt</code>"</dt>
     <dd>
      <p>If <var>current</var> is a <a>WebVTT Ruby Object</a>, then <a lt="attach a WebVTT Internal
      Node Object">attach</a> a <a>WebVTT Ruby Text Object</a>.</p>
     </dd>

     <dt>If the tag name is "<code>v</code>"</dt>
     <dd>
      <p><a lt="attach a WebVTT Internal Node Object">Attach</a> a <a>WebVTT Voice Object</a>, and
      set its value to the token's annotation string, or the empty string if there is no annotation
      string.</p>
     </dd>

     <dt>If the tag name is "<code>lang</code>"</dt>
     <dd>
      <p>Push the value of the token's annotation string, or the empty string if there is no
      annotation string, onto the <var>language stack</var>; then <a lt="attach a WebVTT Internal
      Node Object">attach</a> a <a>WebVTT Language Object</a>.</p>
     </dd>

     <dt>Otherwise</dt>
     <dd>
      <p>Ignore the token.</p>
     </dd>

    </dl>

    <p>When the steps above say to <dfn>attach a WebVTT Internal Node Object</dfn> of a particular
    concrete class, the user agent must run the following steps:</p>

    <ol>

     <li><p>Create a new <a>WebVTT Internal Node Object</a> of the specified concrete
     class.</p></li>

     <li><p>Set the new object's list of <a lt="WebVTT Node Object's applicable classes">applicable
     classes</a> to the list of classes in the token, excluding any classes that are the empty
     string.</p></li>

     <li><p>Set the new object's <a lt="WebVTT Node Object's applicable language">applicable
     language</a> to the top entry on the <var>language stack</var>, if the stack is not
     empty.</p></li>

     <li><p>Append the newly created node object to <var>current</var>.</p></li>

     <li><p>Let <var>current</var> be the newly created node object.</p></li>

    </ol>

   </dd>

   <dt>If <var>token</var> is an end tag</dt>
   <dd>

    <p>If any of the following conditions is true, then let <var>current</var> be the parent node of
    <var>current</var>.</p>

    <ul class="brief">

     <li>The tag name of the end tag token <var>token</var> is "<code>c</code>" and
     <var>current</var> is a <a>WebVTT Class Object</a>.</li>

     <li>The tag name of the end tag token <var>token</var> is "<code>i</code>" and
     <var>current</var> is a <a>WebVTT Italic Object</a>.</li>

     <li>The tag name of the end tag token <var>token</var> is "<code>b</code>" and
     <var>current</var> is a <a>WebVTT Bold Object</a>.</li>

     <li>The tag name of the end tag token <var>token</var> is "<code>u</code>" and
     <var>current</var> is a <a>WebVTT Underline Object</a>.</li>

     <li>The tag name of the end tag token <var>token</var> is "<code>ruby</code>" and
     <var>current</var> is a <a>WebVTT Ruby Object</a>.</li>

     <li>The tag name of the end tag token <var>token</var> is "<code>rt</code>" and
     <var>current</var> is a <a>WebVTT Ruby Text Object</a>.</li>

     <li>The tag name of the end tag token <var>token</var> is "<code>v</code>" and
     <var>current</var> is a <a>WebVTT Voice Object</a>.</li>

    </ul>

    <p>Otherwise, if the tag name of the end tag token <var>token</var> is "<code>lang</code>" and
    <var>current</var> is a <a>WebVTT Language Object</a>, then let <var>current</var> be the parent
    node of <var>current</var>, and pop the top value from the <var>language stack</var>.</p>

    <p>Otherwise, if the tag name of the end tag token <var>token</var> is "<code>ruby</code>" and
    <var>current</var> is a <a>WebVTT Ruby Text Object</a>, then let <var>current</var> be the
    parent node of the parent node of <var>current</var>.</p>

    <p>Otherwise, ignore the token.</p>

   </dd>

   <dt>If <var>token</var> is a timestamp tag</dt>
   <dd>

    <ol>

     <li><p>Let <var>input</var> be the tag value.</p></li>

     <li><p>Let <var>position</var> be a pointer into <var>input</var>, initially pointing at the
     start of the string.</p></li>

     <li><p><a>Collect a WebVTT timestamp</a>.</p></li>

     <li>

      <p>If that algorithm does not fail, and if <var>position</var> now points at the end of
      <var>input</var> (i.e. there are no trailing characters after the timestamp), then create a
      <a>WebVTT Timestamp Object</a> whose value is the collected time, then append it to
      <var>current</var>.</p>

      <p>Otherwise, ignore the token.</p>

     </li>

    </ol>

   </dd>

  </dl>

 </li>

 <li><p>Jump to the step labeled <i>loop</i>.</p></li>

</ol>

<p>The <dfn>WebVTT cue text tokenizer</dfn> is as follows. It emits a token, which is either a
string (whose value is a sequence of characters), a start tag (with a tag name, a list of classes,
and optionally an annotation), an end tag (with a tag name), or a timestamp tag (with a tag
value).</p>

<ol>

 <li><p>Let <var>input</var> and <var>position</var> be the same variables as those of the same name
 in the algorithm that invoked these steps.</p></li>

 <li><p>Let <var>tokenizer state</var> be <a>WebVTT data state</a>.</p></li>

 <li><p>Let <var>result</var> be the empty string.</p></li>

 <li><p>Let <var>buffer</var> be the empty string.</p></li>

 <li><p>Let <var>classes</var> be an empty list.</p></li>

 <li>

  <p><i>Loop</i>: If <var>position</var> is past the end of <var>input</var>, let <var>c</var> be an
  end-of-file marker. Otherwise, let <var>c</var> be the character in <var>input</var> pointed to by
  <var>position</var>.</p>

  <p class="note">An end-of-file marker is not a Unicode character, it is used to end the
  tokenizer.</p>

 </li>

 <li>

  <p>Jump to the state given by <var>tokenizer state</var>:</p>

  <dl>

   <dt><dfn>WebVTT data state</dfn></dt>

   <dd>

    <p>Jump to the entry that matches the value of <var>c</var>:</p>

    <dl>

     <dt>U+0026 AMPERSAND (&amp;)</dt>
     <dd>
      <p>Set <var>buffer</var> to <var>c</var>, set <var>tokenizer state</var> to the <a>WebVTT
      escape state</a>, and jump to the step labeled <i>next</i>.</p>
     </dd>

     <dt>U+003C LESS-THAN SIGN (&lt;)</dt>
     <dd>
      <p>If <var>result</var> is the empty string, then set <var>tokenizer state</var> to the
      <a>WebVTT tag state</a> and jump to the step labeled <i>next</i>.</p>
      <p>Otherwise, return a string token whose value is <var>result</var> and abort these
      steps.</p>
     </dd>

     <dt>End-of-file marker</dt>
     <dd>
      <p>Return a string token whose value is <var>result</var> and abort these steps.</p>
     </dd>

     <dt>Anything else</dt>
     <dd>
      <p>Append <var>c</var> to <var>result</var> and jump to the step labeled <i>next</i>.</p>
     </dd>

    </dl>

   </dd>

   <dt><dfn>WebVTT escape state</dfn></dt>

   <dd>

    <p>Jump to the entry that matches the value of <var>c</var>:</p>

    <dl>

     <dt>U+0026 AMPERSAND (&amp;)</dt>
     <dd>
      <p>Append <var>buffer</var> to <var>result</var>, set <var>buffer</var> to <var>c</var>, and
      jump to the step labeled <i>next</i>.</p>
     </dd>

     <dt><a>Alphanumeric ASCII characters</a></dt>
     <dd>
      <p>Append <var>c</var> to <var>buffer</var> and jump to the step labeled <i>next</i>.</p>
     </dd>

     <dt>U+003B SEMICOLON character (;)</dt>
     <dd>

      <p>First, examine the value of <var>buffer</var>:</p>

      <p>If <var>buffer</var> is the string "<code>&amp;amp</code>", then append a U+0026 AMPERSAND
      character (&amp;) to <var>result</var>.</p>

      <p>If <var>buffer</var> is the string "<code>&amp;lt</code>", then append a U+003C LESS-THAN
      SIGN character (&lt;) to <var>result</var>.</p>

      <p>If <var>buffer</var> is the string "<code>&amp;gt</code>", then append a U+003E
      GREATER-THAN SIGN character (&gt;) to <var>result</var>.</p>

      <p>If <var>buffer</var> is the string "<code>&amp;lrm</code>", then append a U+200E
      LEFT-TO-RIGHT MARK character to <var>result</var>.</p>

      <p>If <var>buffer</var> is the string "<code>&amp;rlm</code>", then append a U+200F
      RIGHT-TO-LEFT MARK character to <var>result</var>.</p>

      <p>If <var>buffer</var> is the string "<code>&amp;nbsp</code>", then append a U+00A0 NO-BREAK
      SPACE character to <var>result</var>.</p>

      <p>Otherwise, append <var>buffer</var> followed by a U+003B SEMICOLON character (;) to
      <var>result</var>.</p>

      <p>Then, in any case, set <var>tokenizer state</var> to the <a>WebVTT data state</a>, and jump
      to the step labeled <i>next</i>.</p>

     </dd>

     <dt>U+003C LESS-THAN SIGN (&lt;)</dt>
     <dt>End-of-file marker</dt>
     <dd>
      <p>Append <var>buffer</var> to <var>result</var>, return a string token whose value is
      <var>result</var>, and abort these steps.</p>
     </dd>

     <dt>Anything else</dt>
     <dd>
      <p>Append <var>buffer</var> to <var>result</var>, append <var>c</var> to <var>result</var>,
      set <var>tokenizer state</var> to the <a>WebVTT data state</a>, and jump to the step labeled
      <i>next</i>.</p>
     </dd>

    </dl>

   </dd>

   <dt><dfn>WebVTT tag state</dfn></dt>

   <dd>

    <p>Jump to the entry that matches the value of <var>c</var>:</p>

    <dl>

     <dt>U+0009 CHARACTER TABULATION (tab) character</dt>
     <dt>U+000A LINE FEED (LF) character</dt>
     <dt>U+000C FORM FEED (FF) character</dt>
     <dt>U+0020 SPACE character</dt>
     <dd>
      <!-- assert: >result< is the empty string -->
      <p>Set <var>tokenizer state</var> to the <a>WebVTT start tag annotation state</a>, and jump to
      the step labeled <i>next</i>.</p>
     </dd>

     <dt>U+002E FULL STOP character (.)</dt>
     <dd>
      <!-- assert: >result< is the empty string -->
      <p>Set <var>tokenizer state</var> to the <a>WebVTT start tag class state</a>, and jump to the
      step labeled <i>next</i>.</p>
     </dd>

     <dt>U+002F SOLIDUS character (/)</dt>
     <dd>
      <p>Set <var>tokenizer state</var> to the <a>WebVTT end tag state</a>, and jump to the step
      labeled <i>next</i>.</p>
     </dd>

     <dt><a>ASCII digits</a></dt>
     <dd>
      <p>Set <var>result</var> to <var>c</var>, set <var>tokenizer state</var> to the <a>WebVTT
      timestamp tag state</a>, and jump to the step labeled <i>next</i>.</p>
     </dd>

     <dt>U+003E GREATER-THAN SIGN character (>)</dt>
     <dd>
      <p>Advance <var>position</var> to the next character in <var>input</var>, then jump to the
      next "end-of-file marker" entry below.</p>
     </dd>

     <dt>End-of-file marker</dt>
     <dd>
      <p>Return a start tag whose tag name is the empty string, with no classes and no annotation,
      and abort these steps.</p>
     </dd>

     <dt>Anything else</dt>
     <dd>
      <p>Set <var>result</var> to <var>c</var>, set <var>tokenizer state</var> to the <a>WebVTT
      start tag state</a>, and jump to the step labeled <i>next</i>.</p>
     </dd>

    </dl>

   </dd>

   <dt><dfn>WebVTT start tag state</dfn></dt>

   <dd>

    <p>Jump to the entry that matches the value of <var>c</var>:</p>

    <dl>

     <dt>U+0009 CHARACTER TABULATION (tab) character</dt>
     <dt>U+000C FORM FEED (FF) character</dt>
     <dt>U+0020 SPACE character</dt>
     <dd>
      <p>Set <var>tokenizer state</var> to the <a>WebVTT start tag annotation state</a>, and jump to
      the step labeled <i>next</i>.</p>
     </dd>

     <dt>U+000A LINE FEED (LF) character</dt>
     <dd>
      <p>Set <var>buffer</var> to <var>c</var>, set <var>tokenizer state</var> to the <a>WebVTT
      start tag annotation state</a>, and jump to the step labeled <i>next</i>.</p>
     </dd>

     <dt>U+002E FULL STOP character (.)</dt>
     <dd>
      <p>Set <var>tokenizer state</var> to the <a>WebVTT start tag class state</a>, and jump to the
      step labeled <i>next</i>.</p>
     </dd>

     <dt>U+003E GREATER-THAN SIGN character (>)</dt>
     <dd>
      <p>Advance <var>position</var> to the next character in <var>input</var>, then jump to the
      next "end-of-file marker" entry below.</p>
     </dd>

     <dt>End-of-file marker</dt>
     <dd>
      <p>Return a start tag whose tag name is <var>result</var>, with no classes and no annotation,
      and abort these steps.</p>
     </dd>

     <dt>Anything else</dt>
     <dd>
      <p>Append <var>c</var> to <var>result</var> and jump to the step labeled <i>next</i>.</p>
     </dd>

    </dl>

   </dd>

   <dt><dfn>WebVTT start tag class state</dfn></dt>

   <dd>

    <p>Jump to the entry that matches the value of <var>c</var>:</p>

    <dl>

     <dt>U+0009 CHARACTER TABULATION (tab) character</dt>
     <dt>U+000C FORM FEED (FF) character</dt>
     <dt>U+0020 SPACE character</dt>
     <dd>
      <p>Append to <var>classes</var> an entry whose value is <var>buffer</var>, set
      <var>buffer</var> to the empty string, set <var>tokenizer state</var> to the <a>WebVTT start
      tag annotation state</a>, and jump to the step labeled <i>next</i>.</p>
     </dd>

     <dt>U+000A LINE FEED (LF) character</dt>
     <dd>
      <p>Append to <var>classes</var> an entry whose value is <var>buffer</var>, set
      <var>buffer</var> to <var>c</var>, set <var>tokenizer state</var> to the <a>WebVTT start tag
      annotation state</a>, and jump to the step labeled <i>next</i>.</p>
     </dd>

     <dt>U+002E FULL STOP character (.)</dt>
     <dd>
      <p>Append to <var>classes</var> an entry whose value is <var>buffer</var>, set
      <var>buffer</var> to the empty string, and jump to the step labeled <i>next</i>.</p>
     </dd>

     <dt>U+003E GREATER-THAN SIGN character (>)</dt>
     <dd>
      <p>Advance <var>position</var> to the next character in <var>input</var>, then jump to the
      next "end-of-file marker" entry below.</p>
     </dd>

     <dt>End-of-file marker</dt>
     <dd>
      <p>Append to <var>classes</var> an entry whose value is <var>buffer</var>, then return a start
      tag whose tag name is <var>result</var>, with the classes given in <var>classes</var> but no
      annotation, and abort these steps.</p>
     </dd>

     <dt>Anything else</dt>
     <dd>
      <p>Append <var>c</var> to <var>buffer</var> and jump to the step labeled <i>next</i>.</p>
     </dd>

    </dl>

   </dd>

   <dt><dfn>WebVTT start tag annotation state</dfn></dt>

   <dd>

    <p>Jump to the entry that matches the value of <var>c</var>:</p>

    <dl>

     <dt>U+003E GREATER-THAN SIGN character (>)</dt>
     <dd>
      <p>Advance <var>position</var> to the next character in <var>input</var>, then jump to the
      next "end-of-file marker" entry below.</p>
     </dd>

     <dt>End-of-file marker</dt>
     <dd>
      <p>Remove any leading or trailing <a lt="space character">space characters</a> from
      <var>buffer</var>, and replace any sequence of one or more consecutive <a lt="space
      character">space characters</a> in <var>buffer</var> with a single U+0020 SPACE character;
      then, return a start tag whose tag name is <var>result</var>, with the classes given in
      <var>classes</var>, and with <var>buffer</var> as the annotation, and abort these steps.</p>
     </dd>

     <dt>Anything else</dt>
     <dd>
      <p>Append <var>c</var> to <var>buffer</var> and jump to the step labeled <i>next</i>.</p>
     </dd>

    </dl>

   </dd>

   <dt><dfn>WebVTT end tag state</dfn></dt>

   <dd>

    <p>Jump to the entry that matches the value of <var>c</var>:</p>

    <dl>

     <!-- should we ignore anything after spaces, tabs, and line feeds? -->

     <dt>U+003E GREATER-THAN SIGN character (>)</dt>
     <dd>
      <p>Advance <var>position</var> to the next character in <var>input</var>, then jump to the
      next "end-of-file marker" entry below.</p>
     </dd>

     <dt>End-of-file marker</dt>
     <dd>
      <p>Return an end tag whose tag name is <var>result</var> and abort these steps.</p>
     </dd>

     <dt>Anything else</dt>
     <dd>
      <p>Append <var>c</var> to <var>result</var> and jump to the step labeled <i>next</i>.</p>
     </dd>

    </dl>

   </dd>

   <dt><dfn>WebVTT timestamp tag state</dfn></dt>

   <dd>

    <p>Jump to the entry that matches the value of <var>c</var>:</p>

    <dl>

     <dt>U+003E GREATER-THAN SIGN character (>)</dt>
     <dd>
      <p>Advance <var>position</var> to the next character in <var>input</var>, then jump to the
      next "end-of-file marker" entry below.</p>
     </dd>

     <dt>End-of-file marker</dt>
     <dd>
      <p>Return a timestamp tag whose tag name is <var>result</var> and abort these steps.</p>
     </dd>

     <dt>Anything else</dt>
     <dd>
      <p>Append <var>c</var> to <var>result</var> and jump to the step labeled <i>next</i>.</p>
     </dd>

    </dl>

   </dd>

  </dl>

 </li>

 <li><p><i>Next</i>: Advance <var>position</var> to the next character in <var>input</var>.</p></li>

 <li><p>Jump to the step labeled <i>loop</i>.</p></li>

</ol>


<h3 id=dom-construction-rules><dfn>WebVTT cue text DOM construction rules</dfn></h3>

<p class="note">For the purpose of retrieving a <a>WebVTT cue</a>'s content via the
{{VTTCue/getCueAsHTML()}} method of the {{VTTCue}} interface, it needs to be parsed to a
{{DocumentFragment}}. This section describes how.</p>

<p>To convert a <a>list of WebVTT Node Objects</a> to a DOM tree for {{Document}} <var>owner</var>,
user agents must create a tree of DOM nodes that is isomorphous to the tree of <a lt="WebVTT Node
Object">WebVTT Node Objects</a>, with the following mapping of <a lt="WebVTT Node Object">WebVTT
Node Objects</a> to DOM nodes:</p>

<table class="complex data">
 <thead>
  <tr>
   <th><a>WebVTT Node Object</a></th>
   <th>DOM node</th>
  </tr>
 </thead>
 <tbody>
  <tr>
   <td class=long><a>List of WebVTT Node Objects</a></td>
   <td class=long>{{DocumentFragment}} node.</td>
  </tr>
  <tr>
   <td class=long><a>WebVTT Region Object</a></td>
   <td class=long>{{DocumentFragment}} node.</td>
  </tr>
  <tr>
   <td class=long><a>WebVTT Class Object</a></td>
   <td class=long>HTML <a spec=html element>span</a> element.</td>
  </tr>
  <tr>
   <td class=long><a>WebVTT Italic Object</a></td>
   <td class=long>HTML <a spec=html element>i</a> element.</td>
  </tr>
  <tr>
   <td class=long><a>WebVTT Bold Object</a></td>
   <td class=long>HTML <a spec=html element>b</a> element.</td>
  </tr>
  <tr>
   <td class=long><a>WebVTT Underline Object</a></td>
   <td class=long>HTML <a spec=html element>u</a> element.</td>
  </tr>
  <tr>
   <td class=long><a>WebVTT Ruby Object</a></td>
   <td class=long>HTML <a spec=html element>ruby</a> element.</td>
  </tr>
  <tr>
   <td class=long><a>WebVTT Ruby Text Object</a></td>
   <td class=long>HTML <a spec=html element>rt</a> element.</td>
  </tr>
  <tr>
   <td class=long><a>WebVTT Voice Object</a></td>
   <td class=long>HTML <a spec=html element>span</a> element with a <a spec=html
   element-attr>title</a> attribute set to the <a>WebVTT Voice Object</a>'s value.</td>
  </tr>
  <tr>
   <td class=long><a>WebVTT Language Object</a></td>
   <td class=long>HTML <a spec=html element>span</a> element with a <a spec=html
   element-attr>lang</a> attribute set to the <a>WebVTT Language Object</a>'s <a lt="WebVTT Node
   Object's applicable language">applicable language</a>.</td>
  </tr>
  <tr>
   <td class=long><a>WebVTT Text Object</a></td>
   <td class=long>{{Text}} node whose {{CharacterData/data}} is the value of the <a>WebVTT Text
   Object</a>.</td>
  </tr>
  <tr>
   <td class=long><a>WebVTT Timestamp Object</a></td>
   <td class=long>{{ProcessingInstruction}} node whose {{ProcessingInstruction/target}} is
   "<code>timestamp</code>" and whose {{CharacterData/data}} is a <a>WebVTT timestamp</a>
   representing the value of the <a>WebVTT Timestamp Object</a>, with all optional components
   included, with one leading zero if the <var>hours</var> component is less than ten, and with no
   leading zeros otherwise.</td>
  </tr>
 </tbody>
</table>

<p>HTML elements created as part of the mapping described above must have their
{{Node/namespaceURI}} set to the <a>HTML namespace</a>, use the appropriate IDL interface as defined
in the HTML specification, and, if the corresponding <a>WebVTT Internal Node Object</a> has any <a
lt="WebVTT Node Object's applicable classes">applicable classes</a>, must have a <a spec=html
element-attr>class</a> attribute set to the string obtained by concatenating all those classes, each
separated from the next by a single U+0020 SPACE character.</p>

<p>The {{Node/ownerDocument}} attribute of all nodes in the DOM tree must be set to the given
document <var>owner</var>.</p>

<p>All characteristics of the DOM nodes that are not described above or dependent on characteristics
defined above must be left at their initial values.</p>


<h3 id=rules-for-extracting-the-chapter-title>WebVTT rules for extracting the chapter title</h3>

<p>The <dfn>WebVTT rules for extracting the chapter title</dfn> are as follows:</p>

<ol>

 <li><p>Let <var>nodes</var> be the <a>list of WebVTT Node Objects</a> obtained by applying the
 <a>WebVTT cue text parsing rules</a> to the <var>cue</var>'s <a>text track cue text</a>.</p></li>

 <li><p>Return the concatenation of the values of each <a>WebVTT Text Object</a> in
 <var>nodes</var>, in a pre-order, depth-first traversal, excluding <a lt="WebVTT Ruby Text
 Object">WebVTT Ruby Text Objects</a> and their descendants.</p></li>

</ol>


<h2 id=rendering>Rendering</h2>

<p class="note">This section describes in some detail how to visually render WebVTT cues in a user
agent. It also specifies extra CSS functionality made available to manipulate the visual rendering.
The processing model is quite tightly linked to media elements in HTML. When supporting WebVTT in
media players that don't support CSS, equivalent visual rendering will need to be implemented.</p>


<h3 id=processing-model>Processing model</h3>

<p>The <dfn>rules for updating the display of WebVTT text tracks</dfn> render the <a lt="text
track">text tracks</a> of a <a>media element</a> (specifically, a <a element>video</a> element), or
of another playback mechanism, by applying the steps below. All the <a lt="text track">text
tracks</a> that use these rules for a given <a>media element</a>, or other playback mechanism, are
rendered together, to avoid overlapping subtitles from multiple tracks.</p>

<p class="note">In HTML, audio elements don't have a visual rendering area and therefore, this
algorithm will abort for audio elements. When authors do create WebVTT captions or subtitles for
audio resources, they need to publish them in a video element for rendering by the user agent.</p>

<p>The output of the steps below is a set of CSS boxes that covers the rendering area of the
<a>media element</a> or other playback mechanism, which user agents are expected to render in a
manner suiting the user.</p>

<p>The rules are as follows:</p>

<ol>

 <li><p>If the <a>media element</a> is an <a element>audio</a> element, or is another playback
 mechanism with no rendering area, abort these steps.</p></li>

 <li><p>Let <var>video</var> be the <a>media element</a> or other playback mechanism.</p></li>

 <li><p>Let <var>output</var> be an empty list of absolutely positioned CSS block boxes.</p></li>

 <li><p>If the user agent is <a lt="expose a user interface to the user">exposing a user
 interface</a> for <var>video</var>, add to <var>output</var> one or more completely transparent
 positioned CSS block boxes that cover the same region as the user interface.</p></li>

 <li><p>If the last time these rules were run, the user agent was not <a lt="expose a user interface
 to the user">exposing a user interface</a> for <var>video</var>, but now it is, optionally let
 <var>reset</var> be true. Otherwise, let <var>reset</var> be false.</p></li>

 <li><p>Let <var>tracks</var> be the subset of <var>video</var>'s <a>list of text tracks</a> that
 have as their <a>rules for updating the text track rendering</a> these <a>rules for updating the
 display of WebVTT text tracks</a>, and whose <a>text track mode</a> is <a lt="text track
 showing">showing</a>.</p></li>

 <li><p>Let <var>cues</var> be an empty list of <a lt="text track cue">text track cues</a>.</p></li>

 <li><p>For each track <var>track</var> in <var>tracks</var>, append to <var>cues</var> all the <a
 lt="text track cue">cues</a> from <var>track</var>'s <a lt="text track list of cues">list of
 cues</a> that have their <a>text track cue active flag</a> set.</p></li>

 <li><p>Let <var>regions</var> be an empty list of <a lt="WebVTT region">WebVTT
 regions</a>.</p></li>

 <li><p>For each track <var>track</var> in <var>tracks</var>, append to <var>regions</var> all the
 <a lt="WebVTT region">regions</a> from <var>track</var>'s <a lt="text track list of regions">list
 of regions</a>.</p></li>

 <li><p>If <var>reset</var> is false, then, for each <a>WebVTT region</a> <var>region</var> in
 <var>regions</var> let <var>regionNode</var> be a <a>WebVTT region object</a>.</p></li>

 <li>
  <p>Apply the following steps for each <var>regionNode</var>:</p>

  <ol>
   <li>
    <p>Prepare some variables for the application of CSS properties to <var>regionNode</var> as
    follows:</p>

    <ul>
     <li><p>Let <var>regionWidth</var> be the <a>WebVTT region width</a>. Let <var>width</var> be
     '<var>regionWidth</var> vw' (''vw'' is a CSS unit). [[!CSS-VALUES]]</p></li>

     <li><p>Let <var>lineHeight</var> be '5.33vh' (''vh'' is a CSS unit) [[!CSS-VALUES]] and
     <var>regionHeight</var> be the <a>WebVTT region lines</a>. Let <var>lines</var> be
     '<var>lineHeight</var> multiplied by <var>regionHeight</var>.</p></li>

     <li><p>Let <var>viewportAnchorX</var> be the x dimension of the <a>WebVTT region anchor</a> and
     <var>regionAnchorX</var> be the x dimension of the <a>WebVTT region anchor</a>. Let
     <var>leftOffset</var> be <var>regionAnchorX</var> multiplied by <var>width</var> divided by
     100.0. Let <var>left</var> be <var>leftOffset</var> subtracted from '<var>viewportAnchorX</var>
     vw'.</p></li>

     <li><p>Let <var>viewportAnchorY</var> be the y dimension of the <a>WebVTT region anchor</a> and
     <var>regionAnchorY</var> be the y dimension of the <a>WebVTT region anchor</a>. Let
     <var>topOffset</var> be <var>regionAnchorY</var> multiplied by <var>lines</var> divided by
     100.0. Let <var>top</var> be <var>topOffset</var> subtracted from '<var>viewportAnchorY</var>
     vh'.</p></li>
    </ul>
   </li>

   <li>
    <p>Apply the terms of the CSS specifications to <var>regionNode</var> within the following
    constraints, thus obtaining a CSS box <var>box</var> positioned relative to an initial
    containing block:</p>
    <ol>
     <li><p>No style sheets are associated with <var>regionNode</var>. (The regionNodes are
     subsequently restyled using style sheets after their boxes are generated, as described
     below.)</p></li>
     <li><p>Properties on <var>regionNode</var> have their values set as defined in the next
     section. (That section uses some of the variables whose values were calculated earlier in this
     algorithm.)</p></li>
     <li><p>The viewport (and initial containing block) is video's rendering area.</p></li>
    </ol>
   </li>

   <li><p>Add the CSS box <var>box</var> to <var>output</var>.</p></li>
  </ol>
 </li>

 <li>
  <p>If <var>reset</var> is false, then, for each <a>WebVTT cue</a> <var>cue</var> in
  <var>cues</var>: if <var>cue</var>'s <a>text track cue display state</a> has a set of CSS boxes,
  then:</p>

  <ul>
   <li><p>If <var>cue</var>'s <a>WebVTT cue region</a> is not null, add those boxes to that region's
   <var>box</var> and remove <var>cue</var> from <var>cues</var>.</p></li>
   <li><p>Otherwise, add those boxes to <var>output</var> and remove <var>cue</var> from
   <var>cues</var>.</p></li>
  </ul>

 </li>

 <li>

  <p>For each <a>WebVTT cue</a> <var>cue</var> in <var>cues</var> that has not yet had corresponding
  CSS boxes added to <var>output</var>, in <a>text track cue order</a>, run the following
  substeps:</p>

  <ol>

   <li><p>Let <var>nodes</var> be the <a>list of WebVTT Node Objects</a> obtained by applying the
   <a>WebVTT cue text parsing rules</a> to the <var>cue</var>'s <a>text track cue text</a>.</p></li>

   <li>
    <p>If <var>cue</var>'s <a>WebVTT cue region</a> is null, run the following substeps:</p>

    <ol>

     <li><a>Apply WebVTT cue settings</a> to obtain CSS boxes <var>boxes</var> from
     <var>nodes</var>.</li>

     <li><p>Let <var>cue</var>'s <a>text track cue display state</a> have the CSS boxes in
     <var>boxes</var>.</p></li>

     <li><p>Add the CSS boxes in <var>boxes</var> to <var>output</var>.</p></li>

    </ol>
   </li>

   <li>
    <p>Otherwise, run the following substeps:</p>
    <ol>
     <li><p>Let <var>region</var> be <var>cue</var>'s <a>WebVTT cue region</a>.</p></li>

     <li><p>If <var>region</var>'s <a>WebVTT region scroll</a> setting is '<code>up</code>' and
     <var>region</var> already has one child, set <var>region</var>'s 'transition-property' to 'top'
     and 'transition-duration' to '0.433s'.</p></li>

     <!-- The following steps are the subpart of the "apply WebVTT cue settings" algorithm that
     applies to regions -->
     <li>
      <p>Apply the Unicode Bidirectional Algorithm's Paragraph Level steps to the concatenation of
      the values of each <a>WebVTT Text Object</a> in <var>nodes</var>, in a pre-order, depth-first
      traversal, excluding <a lt="WebVTT Ruby Text Object">WebVTT Ruby Text Objects</a> and their
      descendants, to determine the <i>paragraph embedding level</i> of the first Unicode paragraph
      of the cue. [[!BIDI]]</p>
      <p class="note">Within a cue, paragraph boundaries are only denoted by Type B characters, such
      as U+000A LINE FEED (LF), U+0085 NEXT LINE (NEL), and U+2029 PARAGRAPH SEPARATOR. (This means
      each line of the cue is reordered as if it was a separate paragraph.)</p>
     </li>

     <li><p>If the <i>paragraph embedding level</i> determined in the previous step is even (the
     <i>paragraph direction</i> is left-to-right), let <var>direction</var> be "ltr", otherwise, let
     it be "rtl".</p></li>

     <li><p>Let <var>offset</var> be <var>cue</var>'s <a lt="cue computed position">computed
     position</a> multiplied by <var>region</var>'s <a>WebVTT region width</a> and divided by 100
     (i.e. interpret it as a percentage of the region width).</p></li>

     <li>
      <p>Adjust <var>offset</var> using <var>cue</var>'s <a lt="cue computed position
      alignment">computed position alignment</a> as follows:</p>
      <dl class="switch">
       <dt>If the <a lt="cue computed position alignment">computed position alignment</a> is <a
       lt="WebVTT cue position middle alignment">middle alignment</a></dt>
       <dd><p>Subtract half of <var>region</var>'s <a>WebVTT region width</a> from
       <var>offset</var>.</p></dd>

       <dt>If the <a lt="cue computed position alignment">computed position alignment</a> is <a
       lt="WebVTT cue position end alignment">end alignment</a></dt>
       <dd><p>Subtract <var>region</var>'s <a>WebVTT region width</a> from
       <var>offset</var>.</p></dd>
      </dl>
     </li>

     <li><p>Let <var>left</var> be '<var>offset</var>&#x2009;%'. ('%' is a CSS unit.)
     [[!CSS-VALUES]]</p></li>

     <li>
      <p>Apply the terms of the CSS specifications to <var>nodes</var> with the same constraints
      that are used when they are applied to <var>nodes</var> of a <var>cue</var> that is not part
      of a region.</p>

      <p>Let <var>boxes</var> be the boxes generated as descendants of the initial containing block,
      along with their positions.</p>
     </li>

     <li><p>If there are no line boxes in <var>boxes</var>, skip the remainder of these substeps for
     <var>cue</var>. The cue is ignored.</p></li>

     <li><p>Let <var>cue</var>'s <a>text track cue display state</a> have the CSS boxes in
     <var>boxes</var>.</p></li>

     <li><p>Add the CSS boxes in <var>boxes</var> to <var>region</var>.</p></li>

     <li><p>If the CSS boxes <var>boxes</var> together have a height less than the height of the
     <var>region</var> box, let <var>diff</var> be the absolute difference between the two height
     values. Increase <var>top</var> by <var>diff</var> and re-apply it to
     <var>regionNode</var>.</p></li>
    </ol>
   </li>

  </ol>
 </li>

 <li><p>Return <var>output</var>.</p></li>

</ol>

<p>User agents may allow the user to override the above algorithm's positioning of cues, e.g. by
dragging them to another location on the <a element>video</a>, or even off the <a element>video</a>
entirely.</p>

<p>When the algorithm above requires that the user agent <dfn>apply WebVTT cue settings</dfn> to
obtain CSS boxes from a <a>list of WebVTT Node Objects</a> <var>nodes</var>, the user agent must run
the following algorithm.</p>

<ol>

 <li>

  <p>Apply the Unicode Bidirectional Algorithm's Paragraph Level steps to the concatenation of the
  values of each <a>WebVTT Text Object</a> in <var>nodes</var>, in a pre-order, depth-first
  traversal, excluding <a lt="WebVTT Ruby Text Object">WebVTT Ruby Text Objects</a> and their
  descendants, to determine the <i>paragraph embedding level</i> of the first Unicode paragraph of
  the cue. [[!BIDI]]</p>

  <p class="note">Within a cue, paragraph boundaries are only denoted by Type B characters, such as
  U+000A LINE FEED (LF), U+0085 NEXT LINE (NEL), and U+2029 PARAGRAPH SEPARATOR. (This means each
  line of the cue is reordered as if it was a separate paragraph.)</p>

 </li>

 <li>

  <p>If the <i>paragraph embedding level</i> determined in the previous step is even (the
  <i>paragraph direction</i> is left-to-right), let <var>direction</var> be "ltr", otherwise, let it
  be "rtl".</p>

 </li>

 <li><p>If the <a>WebVTT cue writing direction</a> is <a lt="WebVTT cue horizontal writing
 direction">horizontal</a>, then let <var>writing-mode</var> be "horizontal-tb". Otherwise, if the
 <a>WebVTT cue writing direction</a> is <a lt="WebVTT cue vertical growing left writing
 direction">vertical growing left</a>, then let <var>writing-mode</var> be "vertical-rl". Otherwise,
 the <a>WebVTT cue writing direction</a> is <a lt="WebVTT cue vertical growing right writing
 direction">vertical growing right</a>; let <var>writing-mode</var> be "vertical-lr".</p></li>

 <li>

  <p>Determine the value of <var>maximum size</var> for <var>cue</var> as per the appropriate rules
  from the following list:</p>

  <dl class="switch">

   <dt>If the <a lt="cue computed position alignment">computed position alignment</a> is <a
   lt="WebVTT cue position start alignment">start</a></dt>
   <dd>
    <p>Let <var>maximum size</var> be the <a lt="cue computed position">computed position</a>
    subtracted from 100.</p>
   </dd>

   <dt>If the <a lt="cue computed position alignment">computed position alignment</a> is <a
   lt="WebVTT cue position end alignment">end</a></dt>
   <dd>
    <p>Let <var>maximum size</var> be the <a lt="cue computed position">computed position</a>.</p>
   </dd>

   <dt>If the <a lt="cue computed position alignment">computed position alignment</a> is <a
   lt="WebVTT cue position middle alignment">middle</a>, and the <a lt="cue computed
   position">computed position</a> is less than or equal to 50</dt>
   <dd>
    <p>Let <var>maximum size</var> be the <a lt="cue computed position">computed position</a>
    multiplied by two.</p>
   </dd>

   <dt>If the <a lt="cue computed position alignment">computed position alignment</a> is <a
   lt="WebVTT cue position middle alignment">middle</a>, and the <a lt="cue computed
   position">computed position</a> is greater than <!-- or equal to --> 50</dt>
   <dd>
    <p>Let <var>maximum size</var> be the result of subtracting <a lt="cue computed
    position">computed position</a> from 100 and then multiplying the result by two.</p>
   </dd>

  </dl>

 </li>

 <li><p>If the <a>WebVTT cue size</a> is less than <var>maximum size</var>, then let <var>size</var>
 be <a>WebVTT cue size</a>. Otherwise, let <var>size</var> be <var>maximum size</var>.</p></li>

 <li><p>If the <a>WebVTT cue writing direction</a> is <a lt="WebVTT cue horizontal writing
 direction">horizontal</a>, then let <var>width</var> be '<var>size</var>&#x2009;vw' and
 <var>height</var> be ''height/auto''. Otherwise, let <var>width</var> be ''width/auto'' and
 <var>height</var> be '<var>size</var>&#x2009;vh'. (These are CSS values used by the next section to
 set CSS properties for the rendering; ''vw'' and ''vh'' are CSS units.) [[!CSS-VALUES]]</p></li>

 <li>

  <p>Determine the value of <var>x-position</var> or <var>y-position</var> for <var>cue</var> as per
  the appropriate rules from the following list:</p>

  <dl class="switch">

   <dt>If the <a>WebVTT cue writing direction</a> is <a lt="WebVTT cue horizontal writing
   direction">horizontal</a></dt>
   <dd>
    <dl class="switch">
     <dt>If the <a lt="cue computed position alignment">computed position alignment</a> is <a
     lt="WebVTT cue position start alignment">start alignment</a></dt>
     <dd><p>Let <var>x-position</var> be the <a lt="cue computed position">computed
     position</a>.</p></dd>

     <dt>If the <a lt="cue computed position alignment">computed position alignment</a> is <a
     lt="WebVTT cue position middle alignment">middle alignment</a></dt>
     <dd><p>Let <var>x-position</var> be the <a lt="cue computed position">computed position</a>
     minus half of <var>size</var>.</p></dd>

     <dt>If the <a lt="cue computed position alignment">computed position alignment</a> is <a
     lt="WebVTT cue position end alignment">end alignment</a></dt>
     <dd><p>Let <var>x-position</var> be the <a lt="cue computed position">computed position</a>
     minus <var>size</var>.</p></dd>
    </dl>
   </dd>

   <dt>If the <a>WebVTT cue writing direction</a> is <a lt="WebVTT cue vertical growing left writing
   direction">vertical growing left</a> or <a lt="WebVTT cue vertical growing right writing
   direction">vertical growing right</a></dt>
   <dd>
    <dl class="switch">
     <dt>If the <a lt="cue computed position alignment">computed position alignment</a> is <a
     lt="WebVTT cue position start alignment">start alignment</a></dt>
     <dd><p>Let <var>y-position</var> be the <a lt="cue computed position">computed
     position</a>.</p></dd>

     <dt>If the <a lt="cue computed position alignment">computed position alignment</a> is <a
     lt="WebVTT cue position middle alignment">middle alignment</a></dt>
     <dd><p>Let <var>y-position</var> be the <a lt="cue computed position">computed position</a>
     minus half of <var>size</var>.</p></dd>

     <dt>If the <a lt="cue computed position alignment">computed position alignment</a> is <a
     lt="WebVTT cue position end alignment">end alignment</a></dt>
     <dd><p>Let <var>y-position</var> be the <a lt="cue computed position">computed position</a>
     minus <var>size</var>.</p></dd>
    </dl>
   </dd>

  </dl>

 </li>

 <li>

  <p>Determine the value of whichever of <var>x-position</var> or <var>y-position</var> is not yet
  calculated for <var>cue</var> as per the appropriate rules from the following list:</p>

  <dl class="switch">

   <dt>If the <a>WebVTT cue snap-to-lines flag</a> is not set</dt>
   <dd>
    <dl class="switch">

     <dt>If the <a>WebVTT cue writing direction</a> is <a lt="WebVTT cue horizontal writing
     direction">horizontal</a></dt>
     <dd><p>Let <var>y-position</var> be the <a lt="cue computed line">computed line</a>.</p></dd>

     <dt>If the <a>WebVTT cue writing direction</a> is <a lt="WebVTT cue vertical growing left
     writing direction">vertical growing left</a> or <a lt="WebVTT cue vertical growing right
     writing direction">vertical growing right</a></dt>
     <dd><p>Let <var>x-position</var> be the <a lt="cue computed line">computed line</a>.</p></dd>

    </dl>
   </dd>

   <dt>If the <a>WebVTT cue snap-to-lines flag</a> is set</dt>
   <dd>
    <dl class="switch">

     <dt>If the <a>WebVTT cue writing direction</a> is <a lt="WebVTT cue horizontal writing
     direction">horizontal</a></dt>
     <dd><p>Let <var>y-position</var> be 0.</p></dd>

     <dt>If the <a>WebVTT cue writing direction</a> is <a lt="WebVTT cue vertical growing left
     writing direction">vertical growing left</a> or <a lt="WebVTT cue vertical growing right
     writing direction">vertical growing right</a></dt>
     <dd><p>Let <var>x-position</var> be 0.</p></dd>

    </dl>
   </dd>

  </dl>

  <p class="note">These are not final positions, they are merely temporary positions used to
  calculate box dimensions below.</p>

 </li>

 <li>

  <p>If the <a>WebVTT cue snap-to-lines flag</a> is set, then run the appropriate steps from the
  following list:</p>

  <dl class="switch">

   <dt>If the <a>WebVTT cue writing direction</a> is <a lt="WebVTT cue horizontal writing
   direction">horizontal</a></dt>
   <dd>

    <ol>

     <li><p>Let <var>edge margin</var> be a user-agent-defined horizontal length, expressed as a
     percentage of the width of the <var>video</var>'s rendering area, which will be used to define
     a margin at the left and right edges of the video into which this cue will not be placed. In
     situations with overscan, this margin should be sufficient to place the cue within the
     title-safe area. In the absence of overscan, this value should be picked for aesthetics (to
     avoid text being aligned precisely on the left or right edge of the video, which can be
     ugly).</p></li>

     <li><p>If <var>x-position</var> is less than <var>edge margin</var> and the sum of
     <var>x-position</var> and <var>size</var> is more than <var>edge margin</var>, then increase
     <var>x-position</var> by <var>edge margin</var> and decrease <var>size</var> by the same
     amount.</p></li>

     <li><p>Let <var>right margin edge</var> be 100 minus <var>edge margin</var>.</p></li>

     <li><p>If <var>x-position</var> is less than <var>right margin edge</var>, and the sum of
     <var>x-position</var> and <var>size</var> is more than <var>right margin edge</var>, then
     decrease <var>size</var> by <var>edge margin</var>.</p></li>

    </ol>

   </dd>

   <dt>If the <a>WebVTT cue writing direction</a> is <a lt="WebVTT cue vertical growing left writing
   direction">vertical growing left</a> or <a lt="WebVTT cue vertical growing right writing
   direction">vertical growing right</a></dt>
   <dd>

    <ol>

     <li><p>Let <var>edge margin</var> be a user-agent-defined vertical length, expressed as a
     percentage of the height of the <var>video</var>'s rendering area, which will be used to define
     a margin at the top and bottom edges of the video into which this cue will not be placed. In
     situations with overscan, this margin should be sufficient to place the cue within the
     title-safe area. In the absence of overscan, this value should be picked for aesthetics (to
     avoid text being aligned precisely on the top or bottom edge of the video, which can be
     ugly).</p></li>

     <li><p>If <var>y-position</var> is less than <var>edge margin</var> and the sum of
     <var>y-position</var> and <var>size</var> is more than <var>edge margin</var>, then increase
     <var>y-position</var> by <var>edge margin</var> and decrease <var>size</var> by the same
     amount.</p></li>

     <li><p>Let <var>bottom margin edge</var> be 100 minus <var>edge margin</var>.</p></li>

     <li><p>If <var>y-position</var> is less than <var>bottom margin edge</var>, and the sum of
     <var>y-position</var> and <var>size</var> is more than <var>right margin edge</var>, then
     decrease <var>size</var> by <var>edge margin</var>.</p></li>

    </ol>

   </dd>

  </dl>

 </li>

 <li><p>Let <var>left</var> be '<var>x-position</var>&#x2009;vw' and <var>top</var> be
 '<var>y-position</var>&#x2009;vh'. (These are CSS values used by the next section to set CSS
 properties for the rendering; ''vw'' and ''vh'' are CSS units.) [[!CSS-VALUES]]</p></li>

 <li>

  <p>Apply the terms of the CSS specifications to <var>nodes</var> within the following constraints,
  thus obtaining a set of CSS boxes positioned relative to an initial containing block:
  [[!CSS21]]</p>

  <ul>

   <li><p>The <i>document tree</i> is the tree of <a lt="WebVTT Node Object">WebVTT Node Objects</a>
   rooted at <var>nodes</var>.</p></li>

   <li><p>For the purposes of processing by the CSS specification, <a lt="WebVTT Internal Node
   Object">WebVTT Internal Node Objects</a> are equivalent to elements with the same
   contents.</p></li>

   <li>For the purposes of processing by the CSS specification, <a lt="WebVTT Text Object">WebVTT
   Text Objects</a> are equivalent to {{Text}} nodes.</li>

   <li>No style sheets are associated with <var>nodes</var>. (The nodes are subsequently restyled
   using style sheets after their boxes are generated, as described below.)</li>

   <li>The children of the <var>nodes</var> must be wrapped in an anonymous box whose 'display'
   property has the value ''display/inline''. This is the <dfn>WebVTT cue background box</dfn>.</li>

   <li>Runs of children of <a lt="WebVTT Ruby Object">WebVTT Ruby Objects</a> that are not <a
   lt="WebVTT Ruby Text Object">WebVTT Ruby Text Objects</a> must be wrapped in anonymous boxes
   whose 'display' property has the value ''display/ruby-base''. [[!CSS3-RUBY]]</li>

   <li>Properties on <a lt="WebVTT Node Object">WebVTT Node Objects</a> have their values set as
   defined in the next section. (That section uses some of the variables whose values were
   calculated earlier in this algorithm.)</li>

   <li>
    Text runs must be wrapped according to the CSS
    line-wrapping rules, with the following additional constraints:

    <ul>

     <li>Regardless of the value of the 'white-space' property, lines must be wrapped at the edge of
     their containing blocks, even if doing so requires splitting a word where there is no line
     breaking opportunity. (Thus, normally text wraps as needed, but if there is a particularly long
     word, it does not overflow as it normally would in CSS, it is instead forcibly wrapped at the
     box's edge.)</li>

     <li>Regardless of the value of the 'white-space' property, any line breaks inserted by the user
     agent for the purposes of line wrapping must be placed so as to minimize &Delta; across each
     run of consecutive lines between preserved newlines in the source. &Delta; for a set of lines
     is defined as the sum over each line of the absolute of the difference between the line's
     length and the mean line length of the set.</li>

    </ul>

   </li>

   <li>The viewport (and initial containing block) is <var>video</var>'s rendering area.</li>

  </ul>

  <p>Let <var>boxes</var> be the boxes generated as descendants of the initial containing block,
  along with their positions.</p>

 </li>

 <li><p>If there are no line boxes in <var>boxes</var>, skip the remainder of these substeps for
 <var>cue</var>. The cue is ignored.</p></li>

 <li>

  <p>Adjust the positions of <var>boxes</var> according to the appropriate steps from the following
  list:</p>

  <dl class="switch">

   <dt>If <var>cue</var>'s <a>WebVTT cue snap-to-lines flag</a> is set</dt>

   <dd>

    <p>Many of the steps in this algorithm vary according to the <a>WebVTT cue writing
    direction</a>. Steps labeled "<strong>Horizontal</strong>" must be followed only when the
    <a>WebVTT cue writing direction</a> is <a lt="WebVTT cue horizontal writing
    direction">horizontal</a>, steps labeled "<strong>Vertical</strong>" must be followed when the
    <a>WebVTT cue writing direction</a> is either <a lt="WebVTT cue vertical growing left writing
    direction">vertical growing left</a> or <a lt="WebVTT cue vertical growing right writing
    direction">vertical growing right</a>, steps labeled "<strong>Vertical Growing Left</strong>"
    must be followed only when the <a>WebVTT cue writing direction</a> is <a lt="WebVTT cue vertical
    growing left writing direction">vertical growing left</a>, and steps labeled "<strong>Vertical
    Growing Right</strong>" must be followed only when the <a>WebVTT cue writing direction</a> is <a
    lt="WebVTT cue vertical growing right writing direction">vertical growing right</a>.</p>

    <ol>

     <li>

      <p><strong>Horizontal</strong>: Let <var>margin</var> be a user-agent-defined vertical length
      which will be used to define a margin at the top and bottom edges of the video into which cues
      will not be placed. In situations with overscan, this margin should be sufficient to place all
      cues within the title-safe area. In the absence of overscan, this value should be picked for
      aesthetics (to avoid text being aligned precisely on the bottom edge of the video, which can
      be ugly).</p>

      <p><strong>Vertical</strong>: Let <var>margin</var> be a user-agent-defined horizontal length
      which will be used to define a margin at the left and right edges of the video into which cues
      will not be placed. In situations with overscan, this margin should be sufficient to place all
      cues within the title-safe area. In the absence of overscan, this value should be picked for
      aesthetics (to avoid text being aligned precisely on the left or right edges of the video,
      which can be ugly).</p>

     </li>

     <li>

      <p><strong>Horizontal</strong>: Let <var>full dimension</var> be the height of
      <var>video</var>'s rendering area.</p>

      <p><strong>Vertical</strong>: Let <var>full dimension</var> be the width of <var>video</var>'s
      rendering area.</p>

      <p>These dimensions must not be adjusted for overscan. (The algorithm does that
      separately.)</p>

     </li>

     <li>

      <p>Let <var>max dimension</var> be
      <var>full&nbsp;dimension</var>&nbsp;-&nbsp;(2&nbsp;&times;&nbsp; <var>margin</var>).</p>

     </li>

     <li>

      <p><strong>Horizontal</strong>: Let <var>step</var> be the height of the first line box in
      <var>boxes</var>.</p>

      <p><strong>Vertical</strong>: Let <var>step</var> be the width of the first line box in
      <var>boxes</var>.</p>

     </li>

     <li><p>If <var>step</var> is zero, then jump to the step labeled <i>done positioning</i>
     below.</p></li>

     <li><p>Let <var>line</var> be <var>cue</var>'s <a lt="cue computed line">computed
     line</a>.</p></li>

     <li><p>Round <var>line</var> to an integer by adding 0.5 and then flooring it.</p></li>

     <li><p><strong>Vertical Growing Left</strong>: Add one to <var>line</var> then negate
     it.</p></li>

     <li><p>Let <var>position</var> be the result of multiplying <var>step</var> and <var>line
     offset</var>.</p></li>

     <li><p><strong>Vertical Growing Left</strong>: Decrease <var>position</var> by the width of the
     bounding box of the boxes in <var>boxes</var>, then increase <var>position</var> by
     <var>step</var>.</p></li>

     <li>

      <p>If <var>line</var> is less than zero then increase <var>position</var> by <var>max
      dimension</var>, and negate <var>step</var>.</p>

      <p>Otherwise, increase <var>position</var> by <var>margin</var>.</p>

     </li>

     <li>

      <p><strong>Horizontal</strong>: Move all the boxes in <var>boxes</var> down by the distance
      given by <var>position</var>.</p>

      <p><strong>Vertical</strong>: Move all the boxes in <var>boxes</var> right by the distance
      given by <var>position</var>.</p>

     </li>

     <li><p>Remember the position of all the boxes in <var>boxes</var> as their <var>specified
     position</var>.</p></li>

     <li>

      <p><strong>Horizontal</strong>: Let <var>title area</var> be a box that covers all of the
      <var>video</var>'s rendering area except for a height of <var>margin</var> at the top of the
      rendering area and a height of <var>margin</var> at the bottom of the rendering area.</p>

      <p><strong>Vertical</strong>: Let <var>title area</var> be a box that covers all of the
      <var>video</var>'s rendering area except for a width of <var>margin</var> at the left of the
      rendering area and a width of <var>margin</var> at the right of the rendering area.</p>

     </li>

     <li><p><i>Step loop</i>: If none of the boxes in <var>boxes</var> would overlap any of the
     boxes in <var>output</var>, and all of the boxes in <var>boxes</var> are entirely within the
     <var>title area</var> box, then jump to the step labeled <i>done positioning</i>
     below.</p></li>

     <li><p>Let <var>current position score</var> be the percentage of the area of the bounding box
     of the boxes in <var>boxes</var> that <!--overlaps the boxes in <var>output</var> (if any) or
     that--> is outside the <var>title area</var> box.</p></li>

     <li>

      <p><strong>Horizontal</strong>: If <var>step</var> is negative and the top of the first line
      box in <var>boxes</var> is now above the top of the <var>title area</var>, or if
      <var>step</var> is positive and the bottom of the first line box in <var>boxes</var> is now
      below the bottom of the <var>title area</var>, jump to the step labeled <i>switch
      direction</i>.</p>

      <p><strong>Vertical</strong>: If <var>step</var> is negative and the left edge of the first
      line box in <var>boxes</var> is now to the left of the left edge of the <var>title area</var>,
      or if <var>step</var> is positive and the right edge of the first line box in <var>boxes</var>
      is now to the right of the right edge of the <var>title area</var>, jump to the step labeled
      <i>switch direction</i>.</p>

     </li>

     <li>

      <p><strong>Horizontal</strong>: Move all the boxes in <var>boxes</var> down by the distance
      given by <var>step</var>. (If <var>step</var> is negative, then this will actually result in
      an upwards movement of the boxes in absolute terms.)</p>

      <p><strong>Vertical</strong>: Move all the boxes in <var>boxes</var> right by the distance
      given by <var>step</var>. (If <var>step</var> is negative, then this will actually result in a
      leftwards movement of the boxes in absolute terms.)</p>

     </li>

     <li><p>Jump back to the step labeled <i>step loop</i>.</p></li>

     <li><p><i>Switch direction</i>: If <var>switched</var> is true, then remove all the boxes in
     <var>boxes</var>, and jump to the step labeled <i>done positioning</i> below.</p></li>

     <li><p>Otherwise, move all the boxes in <var>boxes</var> back to their <var>specified
     position</var> as determined in the earlier step.</p></li>

     <li><p>Negate <var>step</var>.</p></li>

     <li><p>Set <var>switched</var> to true.</p></li>

     <li><p>Jump back to the step labeled <i>step loop</i>.</p></li>

    </ol>

   </dd>

   <dt>If <var>cue</var>'s <a>WebVTT cue snap-to-lines flag</a> is not set</dt>
   <dd>

    <ol>

     <li><p>Let <var>bounding box</var> be the bounding box of the boxes in
     <var>boxes</var>.</p></li>

     <li>
      <p>Run the appropriate steps from the following list:</p>

      <dl class="switch">
       <dt>If the <a>WebVTT cue writing direction</a> is <a lt="WebVTT cue horizontal writing
       direction">horizontal</a></dt>
       <dd>
        <dl class="switch">
         <dt>If the <a>WebVTT cue line alignment</a> is <a lt="WebVTT cue position middle
         alignment">middle alignment</a></dt>
         <dd><p>Move all the boxes in <var>boxes</var> up by half of the height of <var>bounding
         box</var>.</p></dd>

         <dt>If the <a>WebVTT cue line alignment</a> is <a lt="WebVTT cue position end
         alignment">end alignment</a></dt>
         <dd><p>Move all the boxes in <var>boxes</var> up by the height of <var>bounding
         box</var>.</p></dd>
        </dl>
       </dd>

       <dt>If the <a>WebVTT cue writing direction</a> is <a lt="WebVTT cue vertical growing left
       writing direction">vertical growing left</a> or <a lt="WebVTT cue vertical growing right
       writing direction">vertical growing right</a></dt>
       <dd>
        <dl class="switch">
         <dt>If the <a>WebVTT cue line alignment</a> is <a lt="WebVTT cue position middle
         alignment">middle alignment</a></dt>
         <dd><p>Move all the boxes in <var>boxes</var> left by half of the width of <var>bounding
         box</var>.</p></dd>

         <dt>If the <a>WebVTT cue line alignment</a> is <a lt="WebVTT cue position end
         alignment">end alignment</a></dt>
         <dd><p>Move all the boxes in <var>boxes</var> left by the width of <var>bounding
         box</var>.</p></dd>
        </dl>
       </dd>
      </dl>
     </li>

     <li><p>If none of the boxes in <var>boxes</var> would overlap any of the boxes in
     <var>output</var>, and all the boxes in <var>boxes</var> are within the <var>video</var>'s
     rendering area, then jump to the step labeled <i>done positioning</i> below.</p></li>

     <li><p>If there is a position to which the boxes in <var>boxes</var> can be moved while
     maintaining the relative positions of the boxes in <var>boxes</var> to each other such that
     none of the boxes in <var>boxes</var> would overlap any of the boxes in <var>output</var>, and
     all the boxes in <var>boxes</var> would be within the <var>video</var>'s rendering area, then
     move the boxes in <var>boxes</var> to the closest such position to their current position, and
     then jump to the step labeled <i>done positioning</i> below. If there are multiple such
     positions that are equidistant from their current position, use the highest one amongst them;
     if there are several at that height, then use the leftmost one amongst them.</p></li>

     <li><p>Otherwise, jump to the step labeled <i>done positioning</i> below. (The boxes will
     unfortunately overlap.)</p></li>

    </ol>

   </dd>

  </dl>

 </li>

 <li><p><i>Done positioning</i>: Return <var>boxes</var>.</p></li>

</ol>


<h4 id=applying-css-properties>Applying CSS properties to <a lt="WebVTT Node Object">WebVTT Node
Objects</a></h4>

<p>When following the <a>rules for updating the display of WebVTT text tracks</a>, user agents must
set properties of <a lt="WebVTT Node Object">WebVTT Node Objects</a> at the CSS user agent cascade
layer as defined in this section. [[!CSS21]]</p>

<p>Initialize the (root) <a>list of WebVTT Node Objects</a> with the following CSS settings:</p>

<ul>
 <li>the 'position' property must be set to ''position/absolute''</li>
 <li>the 'unicode-bidi' property must be set to ''unicode-bidi/plaintext''</li>
 <li>the 'direction' property must be set to <var>direction</var></li>
 <li>the 'writing-mode' property must be set to <var>writing-mode</var></li>
 <li>the 'top' property must be set to <var>top</var></li>
 <li>the 'left' property must be set to <var>left</var></li>
 <li>the 'width' property must be set to <var>width</var></li>
 <li>the 'height' property must be set to <var>height</var></li>
</ul>

<p>The variables <var>direction</var>, <var>writing-mode</var>, <var>top</var>, <var>left</var>,
<var>width</var>, and <var>height</var> are the values with those names determined by the <a>rules
for updating the display of WebVTT text tracks</a> for the <a>WebVTT cue</a> from whose <a lt="text
track cue text">text</a> the <a>list of WebVTT Node Objects</a> was constructed.</p>

<p>The 'text-align' property on the (root) <a>list of WebVTT Node Objects</a> must be set to the
value in the second cell of the row of the table below whose first cell is the value of the
corresponding <a lt="text track cue">cue</a>'s <a>WebVTT cue text alignment</a>:</p>

<table class="complex data">
 <thead>
  <tr>
   <th><a>WebVTT cue text alignment</a></th>
   <th>'text-align' value</th>
  </tr>
 </thead>
 <tbody>
  <tr>
   <td><a lt="WebVTT cue start alignment">Start alignment</a></td>
   <td>''text-align/start''</td>
  </tr>
  <tr>
   <td><a lt="WebVTT cue middle alignment">Middle alignment</a></td>
   <td>''text-align/center''</td>
  </tr>
  <tr>
   <td><a lt="WebVTT cue end alignment">End alignment</a></td>
   <td>''text-align/end''</td>
  </tr>
  <tr>
   <td><a lt="WebVTT cue left alignment">Left alignment</a></td>
   <td>''text-align/left''</td>
  </tr>
  <tr>
   <td><a lt="WebVTT cue right alignment">Right alignment</a></td>
   <td>''text-align/right''</td>
  </tr>
 </tbody>
</table>

<p>The 'font' shorthand property on the (root) <a>list of WebVTT Node Objects</a> must be set to
'5vh sans-serif'. [[!CSS3-RUBY]] [[!CSS-VALUES]]</p>

<p>The 'color' property on the (root) <a>list of WebVTT Node Objects</a> must be set to
'rgba(255,255,255,1)'. [[!CSS3-COLOR]]</p>

<p>The 'background' shorthand property on the <a>WebVTT cue background box</a> must be set to
'rgba(0,0,0,0.8)'. [[!CSS3-COLOR]]</p>

<p>The 'white-space' property on the (root) <a>list of WebVTT Node Objects</a> must be set to
''white-space/pre-line''. [[!CSS21]]</p>

<p>The 'font-style' property on <a lt="WebVTT Italic Object">WebVTT Italic Objects</a> must be set
to ''font-style/italic''.</p>

<p>The 'font-weight' property on <a lt="WebVTT Bold Object">WebVTT Bold Objects</a> must be set to
''font-weight/bold''.</p>

<p>The 'text-decoration' property on <a lt="WebVTT Underline Object">WebVTT Underline Objects</a>
must be set to ''text-decoration/underline''.</p>

<p>The 'display' property on <a lt="WebVTT Ruby Object">WebVTT Ruby Objects</a> must be set to
''display/ruby''. [[!CSS3-RUBY]]</p>

<p>The 'display' property on <a lt="WebVTT Ruby Text Object">WebVTT Ruby Text Objects</a> must be
set to ''display/ruby-text''. [[!CSS3-RUBY]]</p>

<p>Every <a>WebVTT region object</a> is initialized with the following CSS settings:</p>

<ul>
 <li>the 'position' property must be set to ''position/absolute''</li>
 <li>the 'writing-mode' property must be set to ''writing-mode/horizontal-tb''</li>
 <li>the 'background' shorthand property must be set to 'rgba(0,0,0,0.8)'</li>
 <li>the 'overflow-wrap' property must be set to ''overflow-wrap/break-word''</li>
 <li>the 'font' shorthand property must be set to 'calc(5.33vh/1.3) sans-serif'</li>
 <li>the 'line-height' shorthand property must be set to '5.33vh'</li>
 <li>the 'color' property must be set to 'rgba(255,255,255,1)'</li>
 <li>the 'overflow' property must be set to ''overflow/hidden''</li>
 <li>the 'width' property must be set to <var>width</var></li>
 <li>the 'min-height' property must be set to ''0px''</li>
 <li>the 'max-height' property must be set to <var>height</var></li>
 <li>the 'left' property must be set to <var>left</var></li>
 <li>the 'top' property must be set to <var>top</var></li>
 <li>the 'display' property must be set to ''display/inline-flex''</li>
 <li>the 'flex-flow' property must be set to ''flex-flow/column''</li>
 <li>the 'justify-content' property must be set to ''justify-content/flex-end''</li>
</ul>

<p>The variables <var>width</var>, <var>height</var>, <var>top</var>, and <var>left</var> are the
values with those names determined by the <a>rules for updating the display of WebVTT text
tracks</a> for the <a>WebVTT region</a> from which the <a>WebVTT region object</a> was
constructed.</p>

<p>The children of every <a>WebVTT region object</a> are further initialized with these CSS
settings:</p>
<ul>
 <li>the 'position' property must be set to ''position/relative''</li>
 <li>the 'unicode-bidi' property must be set to ''unicode-bidi/plaintext''</li>
 <li>the 'width' property must be set to ''width/auto''</li>
 <li>the 'height' property must be set to <var>height</var></li>
 <li>the 'left' property must be set to <var>left</var></li>
 <li>the 'text-align' property must be set as described for the root <a>List of WebVTT Node
 Objects</a> not part of a region</li>
</ul>

<p>All other non-inherited properties must be set to their initial values; inherited properties on
the root <a>list of WebVTT Node Objects</a> must inherit their values from the <a>media element</a>
for which the <a>WebVTT cue</a> is being rendered, if any. If there is no <a>media element</a> (i.e.
if the <a>text track</a> is being rendered for another media playback mechanism), then inherited
properties on the root <a>list of WebVTT Node Objects</a> and the <a lt="WebVTT region
object">WebVTT region objects</a> must take their initial values.</p>

<p>If there are style sheets that apply to the <a>media element</a> or other playback mechanism,
then they must be interpreted as defined in the next section.</p>


<h4 id=css-extensions>CSS extensions</h4>

<p>When a user agent is rendering one or more <a lt="WebVTT cue">WebVTT cues</a> according to the
<a>rules for updating the display of WebVTT text tracks</a>, <a lt="WebVTT Node Object">WebVTT Node
Objects</a> in the <a>list of WebVTT Node Objects</a> used in the rendering can be matched by
certain pseudo-selectors as defined below. These selectors can begin or stop matching individual <a
lt="WebVTT Node Object">WebVTT Node Objects</a> while a <a lt="text track cue">cue</a> is being
rendered, even in between applications of the <a>rules for updating the display of WebVTT text
tracks</a> (which are only run when the set of active cues changes). User agents that support the
pseudo-element described below must dynamically update renderings accordingly. When either
'white-space' or one of the properties corresponding to the 'font' shorthand (including
'line-height') changes value, then the <a>WebVTT cue</a>'s <a>text track cue display state</a> must
be emptied and the <a>text track</a>'s <a>rules for updating the text track rendering</a> must be
immediately rerun.</p>

<p>Pseudo-elements apply to elements that are matched by selectors. For the purpose of this section,
that element is the <i>matched element</i>. The pseudo-elements defined in the following sections
affect the styling of parts of <a lt="WebVTT cue">WebVTT cues</a> that are being rendered for the
<i>matched element</i>.</p>

<p class="note">If the <i>matched element</i> is not a <a element>video</a> element, the
pseudo-elements defined below won't have any effect according to this specification.</p>

<p>A CSS user agent that implements the <a lt="text track">text tracks</a> model must implement the
''::cue'' and ''::cue(<var>selector</var>)'' pseudo-elements, and the '':past'' and '':future''
pseudo-classes.</p>


<h5 id=the-cue-pseudo-element>The ''::cue'' pseudo-element</h5>

<p>The <dfn>::cue</dfn> pseudo-element (with no argument) matches any <a>list of WebVTT Node
Objects</a> constructed for the <i>matched element</i>, with the exception that the properties
corresponding to the 'background' shorthand must be applied to the <a>WebVTT cue background box</a>
rather than the <a>list of WebVTT Node Objects</a>.</p>

<p>The following properties apply to the ''::cue'' pseudo-element with no argument; other properties
set on the pseudo-element must be ignored:</p>

<ul class="brief">
 <li>'color'</li>
 <li>'opacity'</li>
 <li>'visibility'</li>
 <li>'text-decoration'</li>
 <li>'text-shadow'</li>
 <li>the properties corresponding to the 'background' shorthand</li>
 <li>the properties corresponding to the 'outline' shorthand</li>
 <li>the properties corresponding to the 'font' shorthand, including 'line-height'</li>
 <li>'white-space'</li>
 <!-- add more... -->
 <!-- definitely not: display, float, position, top, left, right, bottom, width, height, margin-top,
 margin-bottom, margin-left, margin-right, clip, clear, content, cursor, direction, max-height,
 min-height, max-width, min-width, orphans, overflow, page-break-*, text-align, unicode-bidi,
 widows, z-index -->
</ul>

<p>The <dfn>::cue(<var>selector</var>)</dfn> pseudo-element with an argument must have an argument
that consists of a CSS selector [[!SELECTORS4]]. It matches any <a>WebVTT Internal Node Object</a>
constructed for the <i>matched element</i> that also matches the given CSS selector, with the nodes
being treated as follows:</p>

<ul>

 <li><p>The <i>document tree</i> against which the selectors are matched is the tree of <a
 lt="WebVTT Node Object">WebVTT Node Objects</a> rooted at the <a>list of WebVTT Node Objects</a>
 for the cue.</p></li>

 <li><p><a lt="WebVTT Internal Node Object">WebVTT Internal Node Objects</a> are elements in the
 tree.</p></li>

 <li><a lt="WebVTT Leaf Node Object">WebVTT Leaf Node Objects</a> cannot be matched.</li>

 <li>

  <p>For the purposes of element type selectors, the names of <a lt="WebVTT Internal Node
  Object">WebVTT Internal Node Objects</a> are as given by the following table, where objects having
  the concrete class given in a cell in the first column have the name given by the second column of
  the same row:</p>

  <table class="complex data">
   <thead>
    <tr>
     <th>Concrete class</th>
     <th>Name</th>
    </tr>
   </thead>
   <tbody>
    <tr>
     <td><a lt="WebVTT Class Object">WebVTT Class Objects</a></td>
     <td><code>c</code></td>
    </tr>
    <tr>
     <td><a lt="WebVTT Italic Object">WebVTT Italic Objects</a></td>
     <td><code>i</code></td>
    </tr>
    <tr>
     <td><a lt="WebVTT Bold Object">WebVTT Bold Objects</a></td>
     <td><code>b</code></td>
    </tr>
    <tr>
     <td><a lt="WebVTT Underline Object">WebVTT Underline Objects</a></td>
     <td><code>u</code></td>
    </tr>
    <tr>
     <td><a lt="WebVTT Ruby Object">WebVTT Ruby Objects</a></td>
     <td><code>ruby</code></td>
    </tr>
    <tr>
     <td><a lt="WebVTT Ruby Text Object">WebVTT Ruby Text Objects</a></td>
     <td><code>rt</code></td>
    </tr>
    <tr>
     <td><a lt="WebVTT Voice Object">WebVTT Voice Objects</a></td>
     <td><code>v</code></td>
    </tr>
    <tr>
     <td><a lt="WebVTT Language Object">WebVTT Language Objects</a></td>
     <td><code>lang</code></td>
    </tr>
    <tr>
     <td>Other elements (specifically, <a lt="list of WebVTT Node Objects">lists of WebVTT Node
     Objects</a>)</td>
     <td>No explicit name.</td>
    </tr>
   </tbody>
  </table>

 </li>

 <li><p>For the purposes of element type and universal selectors, <a lt="WebVTT Internal Node
 Object">WebVTT Internal Node Objects</a> are considered as being in the namespace expressed as the
 empty string.</p></li>

 <li><p>For the purposes of attribute selector matching, <a lt="WebVTT Internal Node Object">WebVTT
 Internal Node Objects</a> have no attributes, except for <a lt="WebVTT Voice Object">WebVTT Voice
 Objects</a>, which have a single attribute named "<code>voice</code>" whose value is the value of
 the <a>WebVTT Voice Object</a>, and <a lt="WebVTT Language Object">WebVTT Language Objects</a>,
 which have a single attribute named "<code>lang</code>" whose value is the object's <a lt="WebVTT
 Node Object's applicable language">applicable language</a>.</p></li>

 <li><p>For the purposes of class selector matching, <a lt="WebVTT Internal Node Object">WebVTT
 Internal Node Objects</a> have the classes described as the <a>WebVTT Node Object's applicable
 classes</a>.</p></li> <!-- ok, this isn't especially well-defined, but the Selectors spec doesn't
 really give one much to go on here. -->

 <li><p>For the purposes of the <code lt="selector-lang">:lang()</code> pseudo-class, <a lt="WebVTT
 Internal Node Object">WebVTT Internal Node Objects</a> have the language described as the <a>WebVTT
 Node Object's applicable language</a>.</p></li>

 <li><p>For the purposes of ID selector matching, <a lt="list of WebVTT Node Objects">lists of
 WebVTT Node Objects</a> have the ID given by the cue's <a>text track cue identifier</a>, if
 any.</p></li>

</ul>

<p>The following properties apply to the ''::cue()'' pseudo-element with an argument:</p>

<ul class="brief">
 <li>'color'</li>
 <li>'opacity'</li>
 <li>'visibility'</li>
 <li>'text-decoration'</li>
 <li>'text-shadow'</li>
 <li>the properties corresponding to the 'background' shorthand</li>
 <li>the properties corresponding to the 'outline' shorthand</li>
 <li>properties relating to the transition and animation features</li>
 <!-- add more... -->
 <!-- but definitely not anything that affects dimensions of boxes, e.g. the 'font' shorthand's
 properties or 'white-space'; those are listed below instead -->
</ul>

<!--v2 Would be nice to support transitions that are directional, e.g. changing text fill colour or
shadow size of the start of a segment when the segment becomes "past", and having the change
propagate towards the end of the segment so that it reaches the end of the segment when the next
segment becomes "past". -->

<p>In addition, the following properties apply to the ''::cue()'' pseudo-element with an argument
when the selector does not contain the '':past'' and '':future'' pseudo-classes:</p>

<ul class="brief">
 <li>the properties corresponding to the 'font' shorthand, including 'line-height'</li>
 <li>'white-space'</li>
 <!-- add more... -->
 <!-- definitely not: display, float, position, top, left, right, bottom, width, height, margin-top,
 margin-bottom, margin-left, margin-right, clip, clear, content, cursor, direction, max-height,
 min-height, max-width, min-width, orphans, overflow, page-break-*, text-align, unicode-bidi,
 widows, z-index -->
</ul>

<p>Properties that do not apply must be ignored.</p>

<p>As a special exception, the properties corresponding to the 'background' shorthand, when they
would have been applied to the <a>list of WebVTT Node Objects</a>, must instead be applied to the
<a>WebVTT cue background box</a>.</p>


<h5 id=the-past-and-future-pseudo-classes>The '':past'' and '':future'' pseudo-classes</h5>

<p>The '':past'' and '':future'' pseudo-classes sometimes match <a lt="WebVTT Node Object">WebVTT
Node Objects</a>. [[!SELECTORS4]]</p>

<p>The <dfn>:past</dfn> pseudo-class only matches <a lt="WebVTT Node Object">WebVTT Node Objects</a>
that are <i>in the past</i>.</p>

<p>A <a>WebVTT Node Object</a> <var>c</var> is <dfn>in the past</dfn> if, in a pre-order,
depth-first traversal of the <a>WebVTT cue</a>'s <a>list of WebVTT Node Objects</a>, there exists a
<a>WebVTT Timestamp Object</a> whose value is less than the <a>current playback position</a> of the
<a>media element</a> that is the <i>matched element</i>, entirely after the <a>WebVTT Node
Object</a> <var>c</var>.</p>

<p>The <dfn>:future</dfn> pseudo-class only matches <a lt="WebVTT Node Object">WebVTT Node
Objects</a> that are <i>in the future</i>.</p>

<p>A <a>WebVTT Node Object</a> <var>c</var> is <dfn>in the future</dfn> if, in a pre-order,
depth-first traversal of the <a>WebVTT cue</a>'s <a>list of WebVTT Node Objects</a>, there exists a
<a>WebVTT Timestamp Object</a> whose value is greater than the <a>current playback position</a> of
the <a>media element</a> that is the <i>matched element</i>, entirely before the <a>WebVTT Node
Object</a> <var>c</var>.</p>


<h5 id=the-cue-region-pseudo-element>The ''::cue-region'' pseudo-element</h5>

<p>Pseudo-elements apply to elements that are matched by selectors. For the purpose of this section,
that element is the matched element. The pseudo-element defined below affects the styling of text
track regions that are being rendered for the matched element.</p>

<p class="note">If the matched element is not a video element, the pseudo-element defined below
won't have any effect according to this specification.</p>

<p>The <dfn>::cue-region</dfn> pseudo-element (with no argument) matches any list of <a lt="WebVTT
region object">WebVTT region objects</a> constructed for the <i>matched element</i>.</p>

<p>The same properties that apply to ''::cue'' apply to the ''::cue-region'' pseudo-element with no
argument; other properties set on the pseudo-element must be ignored.</p>

<p>When a user agent is rendering one or more text track regions according to the <a>rules for
updating the display of WebVTT text tracks</a>, <a lt="WebVTT region object">WebVTT region
objects</a> used in the rendering can be matched by the above pseudo-element. User agents that
support the pseudo-element must dynamically update renderings accordingly. When either 'white-space'
or one of the properties corresponding to the 'font' shorthand (including 'line-height') changes
value, then the text track cue display state of all the <a lt="WebVTT cue">WebVTT cues</a> in the
region must be emptied and the text track's rules for updating the text track rendering must be
immediately rerun.</p>

<p>A CSS user agent that implements the text tracks model must implement the ''::cue-region''
pseudo-element.</p>


<h2 id=api>API</h2>


<h3 id=the-vttcue-interface>The {{VTTCue}} interface</h3>

<p>The following interface is used to expose WebVTT cues in the DOM API:</p>

<pre class="idl">
enum AutoKeyword { "auto" };
enum DirectionSetting { "" /* horizontal */, "rl", "lr" };
enum LineAlignSetting { "start", "middle", "end" };
enum PositionAlignSetting { "start", "middle", "end", "auto" };
enum AlignSetting { "start", "middle", "end", "left", "right" };
[Constructor(double startTime, double endTime, DOMString text)]
interface VTTCue : TextTrackCue {
  attribute VTTRegion? region;
  attribute DirectionSetting vertical;
  attribute boolean snapToLines;
  attribute (double or AutoKeyword) line;
  attribute LineAlignSetting lineAlign;
  attribute (double or AutoKeyword) position;
  attribute PositionAlignSetting positionAlign;
  attribute double size;
  attribute AlignSetting align;
  attribute DOMString text;
  DocumentFragment getCueAsHTML();
};
</pre>

<dl class="note">

 <dt><var>cue</var> = new <a constructor lt="VTTCue()">VTTCue</a>( <var>startTime</var>,
 <var>endTime</var>, <var>text</var> )</dt>
 <dd>
  <p>Returns a new {{VTTCue}} object, for use with the {{TextTrack/addCue()}} method.</p>
  <p>The <var>startTime</var> argument sets the <a>text track cue start time</a>.</p>
  <p>The <var>endTime</var> argument sets the <a>text track cue end time</a>.</p>
  <p>The <var>text</var> argument sets the <a>text track cue text</a>.</p>
 </dd>

 <dt><var>cue</var> . {{VTTCue/region}}</dt>
 <dd>
  <p>Returns the {{VTTRegion}} object to which this cue belongs, if any, or null otherwise.</p>
  <p>Can be set.</p>
 </dd>

 <dt><var>cue</var> . {{VTTCue/vertical}} [ = <var>value</var> ]</dt>
 <dd>
  <p>Returns a string representing the <a>WebVTT cue writing direction</a>, as follows:</p>
  <dl class="switch">
   <dt>If it is <a lt="WebVTT cue horizontal writing direction">horizontal</a></dt>
   <dd><p>The empty string.</p></dd>
   <dt>If it is <a lt="WebVTT cue vertical growing left writing direction">vertical growing
   left</a></dt>
   <dd><p>The string "<code>rl</code>".</p></dd>
   <dt>If it is <a lt="WebVTT cue vertical growing right writing direction">vertical growing
   right</a></dt>
   <dd><p>The string "<code>lr</code>".</p></dd>
  </dl>
  <p>Can be set.</p>
 </dd>

 <dt><var>cue</var> . {{VTTCue/snapToLines}} [ = <var>value</var> ]</dt>
 <dd>
  <p>Returns true if the <a>WebVTT cue snap-to-lines flag</a> is set, false otherwise.</p>
  <p>Can be set.</p>
 </dd>

 <dt><var>cue</var> . {{VTTCue/line}} [ = <var>value</var> ]</dt>
 <dd>
  <p>Returns the <a>WebVTT cue line</a>. In the case of the value being <a lt="WebVTT cue line
  automatic">auto</a>, the string "<code>auto</code>" is returned.</p>
  <p>Can be set.</p>
 </dd>

 <dt><var>cue</var> . {{VTTCue/lineAlign}} [ = <var>value</var> ]</dt>
 <dd>
  <p>Returns a string representing the <a>WebVTT cue line alignment</a>, as follows:</p>
  <dl class="switch">
   <dt>If it is <a lt="WebVTT cue line start alignment">start alignment</a></dt>
   <dd><p>The string "<code>start</code>".</p></dd>
   <dt>If it is <a lt="WebVTT cue line middle alignment">middle alignment</a></dt>
   <dd><p>The string "<code>middle</code>".</p></dd>
   <dt>If it is <a lt="WebVTT cue line end alignment">end alignment</a></dt>
   <dd><p>The string "<code>end</code>".</p></dd>
  </dl>
  <p>Can be set.</p>
 </dd>

 <dt><var>cue</var> . {{VTTCue/position}} [ = <var>value</var> ]</dt>
 <dd>
  <p>Returns the <a>WebVTT cue position</a>. In the case of the value being <a lt="WebVTT cue
  automatic position">auto</a>, the string "<code>auto</code>" is returned.</p>
  <p>Can be set.</p>
 </dd>

 <dt><var>cue</var> . {{VTTCue/positionAlign}} [ = <var>value</var> ]</dt>
 <dd>
  <p>Returns a string representing the <a>WebVTT cue position alignment</a>, as follows:</p>
  <dl class="switch">
   <dt>If it is <a lt="WebVTT cue position start alignment">start alignment</a></dt>
   <dd><p>The string "<code>start</code>".</p></dd>
   <dt>If it is <a lt="WebVTT cue position middle alignment">middle alignment</a></dt>
   <dd><p>The string "<code>middle</code>".</p></dd>
   <dt>If it is <a lt="WebVTT cue position end alignment">end alignment</a></dt>
   <dd><p>The string "<code>end</code>".</p></dd>
   <dt>If it is <a lt="WebVTT cue position automatic alignment">automatic alignment</a></dt>
   <dd><p>The string "<code>auto</code>".</p></dd>
  </dl>
  <p>Can be set.</p>
 </dd>

 <dt><var>cue</var> . {{VTTCue/size}} [ = <var>value</var> ]</dt>
 <dd>
  <p>Returns the <a>WebVTT cue size</a>.</p>
  <p>Can be set.</p>
 </dd>

 <dt><var>cue</var> . {{VTTCue/align}} [ = <var>value</var> ]</dt>
 <dd>
  <p>Returns a string representing the <a>WebVTT cue text alignment</a>, as follows:</p>
  <dl class="switch">
   <dt>If it is <a lt="WebVTT cue start alignment">start alignment</a></dt>
   <dd><p>The string "<code>start</code>".</p></dd>
   <dt>If it is <a lt="WebVTT cue middle alignment">middle alignment</a></dt>
   <dd><p>The string "<code>middle</code>".</p></dd>
   <dt>If it is <a lt="WebVTT cue end alignment">end alignment</a></dt>
   <dd><p>The string "<code>end</code>".</p></dd>
   <dt>If it is <a lt="WebVTT cue left alignment">left alignment</a></dt>
   <dd><p>The string "<code>left</code>".</p></dd>
   <dt>If it is <a lt="WebVTT cue right alignment">right alignment</a></dt>
   <dd><p>The string "<code>right</code>".</p></dd>
  </dl>
  <p>Can be set.</p>
 </dd>

 <dt><var>cue</var> . {{VTTCue/text}} [ = <var>value</var> ]</dt>
 <dd>
  <p>Returns the <a>text track cue text</a> in raw unparsed form.</p>
  <p>Can be set.</p>
 </dd>

 <dt><var>fragment</var> = <var>cue</var> . <a method lt="getCueAsHTML()">getCueAsHTML</a>()</dt>
 <dd>
  <p>Returns the <a>text track cue text</a> as a {{DocumentFragment}} of <a spec=html>HTML
  elements</a> and other DOM nodes.</p>
 </dd>

</dl>

<p>The <dfn constructor for=VTTCue lt="VTTCue()">VTTCue(<var>startTime</var>, <var>endTime</var>,
<var>text</var>)</dfn> constructor, when invoked, must run the following steps:</p>

<ol>

 <li><p>Create a new <a>WebVTT cue</a>. Let <var>cue</var> be that <a>WebVTT cue</a>.</p></li>

 <li><p>Let <var>cue</var>'s <a>text track cue start time</a> be the value of the
 <var>startTime</var> argument, interpreted as a time in seconds.</p></li>

 <li><p>Let <var>cue</var>'s <a>text track cue end time</a> be the value of the <var>endTime</var>
 argument, interpreted as a time in seconds.</p></li>

 <li><p>Let <var>cue</var>'s <a>text track cue text</a> be the value of the <var>text</var>
 argument, and let the <a>rules for extracting the chapter title</a> be the <a>WebVTT rules for
 extracting the chapter title</a>.</p></li>

 <!-- default settings -->

 <li><p>Let <var>cue</var>'s <a>text track cue identifier</a> be the empty string.</p></li>

 <li><p>Let <var>cue</var>'s <a>text track cue pause-on-exit flag</a> be false.</p></li>

 <li><p>Let <var>cue</var>'s <a>WebVTT cue region</a> be null.</p></li>

 <li><p>Let <var>cue</var>'s <a>WebVTT cue writing direction</a> be <a lt="WebVTT cue horizontal
 writing direction">horizontal</a>.</p></li>

 <li><p>Let <var>cue</var>'s <a>WebVTT cue snap-to-lines flag</a> be true.</p></li>

 <li><p>Let <var>cue</var>'s <a>WebVTT cue line</a> be <a lt="WebVTT cue line
 automatic">auto</a>.</p></li>

 <li><p>Let <var>cue</var>'s <a>WebVTT cue line alignment</a> be <a lt="WebVTT cue line start
 alignment">start alignment</a>.</p></li>

 <li><p>Let <var>cue</var>'s <a>WebVTT cue position</a> be <a lt="WebVTT cue automatic
 position">auto</a>.</p></li>

 <li><p>Let <var>cue</var>'s <a>WebVTT cue position alignment</a> be <a lt="WebVTT cue position
 automatic alignment">auto</a>.</p></li>

 <li><p>Let <var>cue</var>'s <a>WebVTT cue size</a> be 100.</p></li>

 <li><p>Let <var>cue</var>'s <a>WebVTT cue text alignment</a> be <a lt="WebVTT cue middle
 alignment">middle alignment</a>.</p></li>

 <li><p>Return the {{VTTCue}} object representing <var>cue</var>.</p></li>

</ol>

<p>The <dfn attribute for=VTTCue>region</dfn> attribute, on getting, must return the {{VTTRegion}}
object representing the <a>WebVTT cue region</a> of the <a>WebVTT cue</a> that the {{VTTCue}} object
represents, if any; or null otherwise. On setting, the <a>WebVTT cue region</a> must be set to the
new value.</p>

<p>The <dfn attribute for=VTTCue>vertical</dfn> attribute, on getting, must return the string from
the second cell of the row in the table below whose first cell is the <a>WebVTT cue writing
direction</a> of the <a>WebVTT cue</a> that the {{VTTCue}} object represents:</p>

<table class="complex data">
 <thead>
  <tr>
   <th><a>WebVTT cue writing direction</a></th>
   <th>{{VTTCue/vertical}} value</th>
  </tr>
 </thead>
 <tbody>
  <tr>
   <td><a lt="WebVTT cue horizontal writing direction">Horizontal</a></td>
   <td>"<code lt=""></code>" (the empty string)</td>
  </tr>
  <tr>
   <td><a lt="WebVTT cue vertical growing left writing direction">Vertical growing left</a></td>
   <td>"<code lt="">rl</code>"</td>
  </tr>
  <tr>
   <td><a lt="WebVTT cue vertical growing right writing direction">Vertical growing right</a></td>
   <td>"<code lt="">lr</code>"</td>
  </tr>
 </tbody>
</table>

<p>On setting, the <a>WebVTT cue writing direction</a> must be set to the value given in the first
cell of the row in the table above whose second cell is a <a>case-sensitive</a> match for the new
value.</p>

<p>The <dfn attribute for=VTTCue>snapToLines</dfn> attribute, on getting, must return true if the
<a>WebVTT cue snap-to-lines flag</a> of the <a>WebVTT cue</a> that the {{VTTCue}} object represents
is set; or false otherwise. On setting, the <a>WebVTT cue snap-to-lines flag</a> must be set if the
new value is true, and must be unset otherwise.</p>

<p>The <dfn attribute for=VTTCue>line</dfn> attribute, on getting, must return the <a>WebVTT cue
line</a> of the <a>WebVTT cue</a> that the {{VTTCue}} object represents. The special value <a
lt="WebVTT cue line automatic">auto</a> must be represented as the string "<code>auto</code>". On
setting, the <a>WebVTT cue line</a> must be set to the new value; if the new value is the string
"<code lt="">auto</code>", then it must be interpreted as the special value <a lt="WebVTT cue line
automatic">auto</a>.</p>

<p>The <dfn attribute for=VTTCue>lineAlign</dfn> attribute, on getting, must return the string from
the second cell of the row in the table below whose first cell is the <a>WebVTT cue line
alignment</a> of the <a>WebVTT cue</a> that the {{VTTCue}} object represents:</p>

<table class="complex data">
 <thead>
  <tr>
   <th><a>WebVTT cue line alignment</a></th>
   <th>{{VTTCue/lineAlign}} value</th>
  </tr>
 </thead>
 <tbody>
  <tr>
   <td><a lt="WebVTT cue line start alignment">Start alignment</a></td>
   <td>"<code lt="">start</code>"</td>
  </tr>
  <tr>
   <td><a lt="WebVTT cue line middle alignment">Middle alignment</a></td>
   <td>"<code lt="">middle</code>"</td>
  </tr>
  <tr>
   <td><a lt="WebVTT cue line end alignment">End alignment</a></td>
   <td>"<code lt="">end</code>"</td>
  </tr>
 </tbody>
</table>

<p>On setting, the <a>WebVTT cue line alignment</a> must be set to the value given in the first cell
of the row in the table above whose second cell is a <a>case-sensitive</a> match for the new
value.</p>

<p>The <dfn attribute for=VTTCue>position</dfn> attribute, on getting, must return the <a>WebVTT cue
position</a> of the <a>WebVTT cue</a> that the {{VTTCue}} object represents. The special value <a
lt="WebVTT cue automatic position">auto</a> must be represented as the string "<code
lt="">auto</code>". On setting, if the new value is negative or greater than 100, then an
{{IndexSizeError}} exception must be thrown. Otherwise, the <a>WebVTT cue position</a> must be set
to the new value; if the new value is the string "<code lt="">auto</code>", then it must be
interpreted as the special value <a lt="WebVTT cue automatic position">auto</a>.</p>

<p>The <dfn attribute for=VTTCue>positionAlign</dfn> attribute, on getting, must return the string
from the second cell of the row in the table below whose first cell is the <a>WebVTT cue position
alignment</a> of the <a>WebVTT cue</a> that the {{VTTCue}} object represents:</p>

<table class="complex data">
 <thead>
  <tr>
   <th><a>WebVTT cue position alignment</a></th>
   <th>{{VTTCue/positionAlign}} value</th>
  </tr>
 </thead>
 <tbody>
  <tr>
   <td><a lt="WebVTT cue position start alignment">Start alignment</a></td>
   <td>"<code lt="">start</code>"</td>
  </tr>
  <tr>
   <td><a lt="WebVTT cue position middle alignment">Middle alignment</a></td>
   <td>"<code lt="">middle</code>"</td>
  </tr>
  <tr>
   <td><a lt="WebVTT cue position end alignment">End alignment</a></td>
   <td>"<code lt="">end</code>"</td>
  </tr>
  <tr>
   <td><a lt="WebVTT cue position automatic alignment">Automatic alignment</a></td>
   <td>"<code lt="">auto</code>"</td>
  </tr>
 </tbody>
</table>

<p>On setting, the <a>WebVTT cue position alignment</a> must be set to the value given in the first
cell of the row in the table above whose second cell is a <a>case-sensitive</a> match for the new
value.</p>

<p>The <dfn attribute for=VTTCue>size</dfn> attribute, on getting, must return the <a>WebVTT cue
size</a> of the <a>WebVTT cue</a> that the {{VTTCue}} object represents. On setting, if the new
value is negative or greater than 100, then an {{IndexSizeError}} exception must be thrown.
Otherwise, the <a>WebVTT cue size</a> must be set to the new value.</p>

<p>The <dfn attribute for=VTTCue>align</dfn> attribute, on getting, must return the string from the
second cell of the row in the table below whose first cell is the <a>WebVTT cue text alignment</a>
of the <a>WebVTT cue</a> that the {{VTTCue}} object represents:</p>

<table class="complex data">
 <thead>
  <tr>
   <th><a>WebVTT cue text alignment</a></th>
   <th>{{VTTCue/align}} value</th>
  </tr>
 </thead>
 <tbody>
  <tr>
   <td><a lt="WebVTT cue start alignment">Start alignment</a></td>
   <td>"<code lt="">start</code>"</td>
  </tr>
  <tr>
   <td><a lt="WebVTT cue middle alignment">Middle alignment</a></td>
   <td>"<code lt="">middle</code>"</td>
  </tr>
  <tr>
   <td><a lt="WebVTT cue end alignment">End alignment</a></td>
   <td>"<code lt="">end</code>"</td>
  </tr>
  <tr>
   <td><a lt="WebVTT cue left alignment">Left alignment</a></td>
   <td>"<code lt="">left</code>"</td>
  </tr>
  <tr>
   <td><a lt="WebVTT cue right alignment">Right alignment</a></td>
   <td>"<code lt="">right</code>"</td>
  </tr>
 </tbody>
</table>

<p>On setting, the <a>WebVTT cue text alignment</a> must be set to the value given in the first cell
of the row in the table above whose second cell is a <a>case-sensitive</a> match for the new
value.</p>

<p>The <dfn attribute for=VTTCue>text</dfn> attribute, on getting, must return the raw <a>text track
cue text</a> of the <a>WebVTT cue</a> that the {{VTTCue}} object represents. On setting, the <a>text
track cue text</a> must be set to the new value.</p>

<p>The <dfn method for=VTTCue>getCueAsHTML()</dfn> method must convert the <a>text track cue
text</a> to a {{DocumentFragment}} for the <a spec=html>responsible document</a> specified by the <a
spec=html>entry settings object</a> by applying the <a>WebVTT cue text DOM construction rules</a> to
the result of applying the <a>WebVTT cue text parsing rules</a> to the <a>text track cue
text</a>.</p>


<h3 id=the-vttregion-interface>The {{VTTRegion}} interface</h3>

<p>The following interface is used to expose WebVTT regions in the DOM API:</p>

<pre class="idl">
enum ScrollSetting { "" /* none */, "up" };
[Constructor]
interface VTTRegion {
  attribute double width;
  attribute long lines;
  attribute double regionAnchorX;
  attribute double regionAnchorY;
  attribute double viewportAnchorX;
  attribute double viewportAnchorY;
  attribute ScrollSetting scroll;
};
</pre>

<dl class="note">

 <dt><var>region</var> = new <a constructor lt="VTTRegion()">VTTRegion</a>()</dt>
 <dd>
  <p>Returns a new {{VTTRegion}} object.</p>
 </dd>

 <dt><var>region</var> . {{VTTRegion/width}}</dt>
 <dd>
  <p>Returns the WebVTT region width as a percentage of the video width. Can be set. Throws an
  {{IndexSizeError}} if the new value is not in the range 0..100.</p>
 </dd>

 <dt><var>region</var> . {{VTTRegion/lines}}</dt>
 <dd>
  <p>Returns the text track region height as a number of lines. Can be set.</p>
 </dd>

 <dt><var>region</var> . {{VTTRegion/regionAnchorX}}</dt>
 <dd>
  <p>Returns the WebVTT region anchor X offset as a percentage of the region width. Can be set.
  Throws an {{IndexSizeError}} if the new value is not in the range 0..100.</p>
 </dd>

 <dt><var>region</var> . {{VTTRegion/regionAnchorX}}</dt>
 <dd>
  <p>Returns the WebVTT region anchor Y offset as a percentage of the region height. Can be set.
  Throws an {{IndexSizeError}} if the new value is not in the range 0..100.</p>
 </dd>

 <dt><var>region</var> . {{VTTRegion/viewportAnchorX}}</dt>
 <dd>
  <p>Returns the WebVTT region viewport anchor X offset as a percentage of the video width. Can be
  set. Throws an {{IndexSizeError}} if the new value is not in the range 0..100.</p>
 </dd>

 <dt><var>region</var> . {{VTTRegion/viewportAnchorY}}</dt>
 <dd>
  <p>Returns the WebVTT region viewport anchor Y offset as a percentage of the video height. Can be
  set. Throws an {{IndexSizeError}} if the new value is not in the range 0..100.</p>
 </dd>

 <dt><var>region</var> . {{VTTRegion/scroll}}</dt>
 <dd>
  <p>Returns a string representing the <a>WebVTT region scroll</a> as follows:</p>
  <dl class="switch">
   <dt>If it is unset</dt>
   <dd><p>The empty string.</p></dd>
   <dt>If it is up</dt>
   <dd><p>The string "<code lt="">up</code>".</p></dd>
  </dl>
  <p>Can be set.</p>
 </dd>
</dl>

<p>The <dfn constructor for=VTTRegion>VTTRegion()</dfn> constructor, when invoked, must run the
following steps:</p>

<ol>
 <li><p>Create a new <a>WebVTT region</a>. Let <var>region</var> be that <a>WebVTT
 region</a>.</p></li>

 <!-- default settings -->
 <li><p>Let <var>region</var>'s <a>WebVTT region identifier</a> be the empty string.</p></li>

 <li><p>Let <var>region</var>'s <a>WebVTT region width</a> be 100.</p></li>

 <li><p>Let <var>region</var>'s <a>WebVTT region lines</a> be 3.</p></li>

 <li><p>Let <var>region</var>'s <a lt="WebVTT region anchor">text track region regionAnchorX</a> be
 0.</p></li>

 <li><p>Let <var>region</var>'s <a lt="WebVTT region anchor">text track region regionAnchorY</a> be
 100.</p></li>

 <li><p>Let <var>region</var>'s <a lt="WebVTT region viewport anchor">text track region
 viewportAnchorX</a> be 0.</p></li>

 <li><p>Let <var>region</var>'s <a lt="WebVTT region viewport anchor">text track region
 viewportAnchorY</a> be 100.</p></li>

 <li><p>Let <var>region</var>'s <a>WebVTT region scroll</a> be the empty string.</p></li>

 <li><p>Return the {{VTTRegion}} object representing <var>region</var>.</p></li>
</ol>

<p>The <dfn attribute for=VTTRegion>width</dfn> attribute, on getting, must return the <a>WebVTT
region width</a> of the <a>WebVTT region</a> that the {{VTTRegion}} object represents, in percent of
video width. On setting, the <a>WebVTT region width</a> must be set to the new value, interpreted as
a percentage.</p>

<p>The <dfn attribute for=VTTRegion>lines</dfn> attribute, on getting, must return the <a>WebVTT
region lines</a> of the <a>WebVTT region</a> that the {{VTTRegion}} object represents, as number of
lines. On setting, the <a>WebVTT region lines</a> must be set to the new value, interpreted as a
number of lines.</p>

<p>The <dfn attribute for=VTTRegion>regionAnchorX</dfn> attribute, on getting, must return the
<a>WebVTT region anchor</a> X offset of the <a>WebVTT region</a> that the {{VTTRegion}} object
represents, in percent of region width. On setting, the <a>WebVTT region anchor</a> X distance must
be set to the new value, interpreted as a percentage.</p>

<p>The <dfn attribute for=VTTRegion>regionAnchorY</dfn> attribute, on getting, must return the
<a>WebVTT region anchor</a> Y offset of the <a>WebVTT region</a> that the {{VTTRegion}} object
represents, in percent of region height. On setting, the <a>WebVTT region anchor</a> Y distance must
be set to the new value, interpreted as a percentage.</p>

<p>The <dfn attribute for=VTTRegion>viewportAnchorX</dfn> attribute, on getting, must return the
<a>WebVTT region viewport anchor</a> X offset of the <a>WebVTT region</a> that the {{VTTRegion}}
object represents, in percent of video width. On setting, the <a>WebVTT region viewport anchor</a> X
distance must be set to the new value, interpreted as a percentage.</p>

<p>The <dfn attribute for=VTTRegion>viewportAnchorY</dfn> attribute, on getting, must return the
<a>WebVTT region viewport anchor</a> Y offset of the <a>WebVTT region</a> that the {{VTTRegion}}
object represents, in percent of video height. On setting, the <a>WebVTT region viewport anchor</a>
Y distance must be set to the new value, interpreted as a percentage.</p>

<p>The <dfn attribute for=VTTRegion>scroll</dfn> attribute, on getting, must return the string from
the second cell of the row in the table below whose first cell is the <a>WebVTT region scroll</a>
setting of the <a>WebVTT region</a> that the {{VTTRegion}} object represents:</p>
<table class="complex data">
 <thead>
  <tr>
   <th><a>WebVTT region scroll</a></th>
   <th>{{VTTRegion/scroll}} value</th>
  </tr>
 </thead>
 <tbody>
  <tr>
   <td><a lt="WebVTT region scroll none">None</a></td>
   <td>"<code lt=""></code>" (the empty string)</td>
  </tr>
  <tr>
   <td><a lt="WebVTT region scroll up">Up</a></td>
   <td>"<code lt="">up</code>"</td>
  </tr>
 </tbody>
</table>

<p>On setting, the <a>WebVTT region scroll</a> must be set to the value given on the first cell of
the row in the table above whose second cell is a <a>case-sensitive</a> match for the new value.</p>


<h2 id=iana>IANA considerations</h2>
<!-- http://www.w3.org/2002/06/registering-mediatype.html -->


<h3 id=iana-text-vtt><dfn><code>text/vtt</code></dfn></h3>

<p>This registration is for community review and will be submitted to the IESG for review, approval,
and registration with IANA.</p>

<!-- To: ietf-types@iana.org Subject: Registration of media type text/vtt -->

<dl>
 <dt>Type name:</dt>
 <dd>text</dd>
 <dt>Subtype name:</dt>
 <dd>vtt</dd>
 <dt>Required parameters:</dt>
 <dd>No parameters</dd>
 <dt>Optional parameters:</dt>
 <dd>No parameters</dd>
 <dt>Encoding considerations:</dt>
 <dd>8bit (always UTF-8)</dd>
 <dt>Security considerations:</dt>
 <dd>
  <p>Text track files themselves pose no immediate risk unless sensitive information is included
  within the data. Implementations, however, are required to follow specific rules when processing
  text tracks, to ensure that certain origin-based restrictions are honored. Failure to correctly
  implement these rules can result in information leakage, cross-site scripting attacks, and the
  like.</p>
 </dd>
 <dt>Interoperability considerations:</dt>
 <dd>
  <p>Rules for processing both conforming and non-conforming content are defined in this
  specification.</p>
 </dd>
 <dt>Published specification:</dt>
 <dd>
  This document is the relevant specification.
 </dd>
 <dt>Applications that use this media type:</dt>
 <dd>
  Web browsers and other video players.
 </dd>
 <dt>Additional information:</dt>
 <dd>
  <dl>
   <dt>Magic number(s):</dt>
   <dd>
    <p>WebVTT files all begin with one of the following byte sequences (where "EOF" means the end of
    the file):</p>
    <ul class="brief">
     <li> EF BB BF 57 45 42 56 54 54 0A </li>
     <li> EF BB BF 57 45 42 56 54 54 0D </li>
     <li> EF BB BF 57 45 42 56 54 54 20 </li>
     <li> EF BB BF 57 45 42 56 54 54 09 </li>
     <li> EF BB BF 57 45 42 56 54 54 EOF </li>
     <li> 57 45 42 56 54 54 0A </li>
     <li> 57 45 42 56 54 54 0D </li>
     <li> 57 45 42 56 54 54 20 </li>
     <li> 57 45 42 56 54 54 09 </li>
     <li> 57 45 42 56 54 54 EOF </li>
    </ul>
    <p class="note">(An optional UTF-8 BOM, the ASCII string "<code lt="">WEBVTT</code>", and
    finally a space, tab, line break, or the end of the file.)</p>
   </dd>
   <dt>File extension(s):</dt>
   <dd>"<code lt="">vtt</code>"</dd>
   <dt>Macintosh file type code(s):</dt>
   <dd>No specific Macintosh file type codes are recommended for this type.</dd>
  </dl>
 </dd>
 <dt>Person &amp; email address to contact for further information:</dt>
 <dd>Silvia Pfeiffer &lt;silviapfeiffer1@gmail.com></dd>
 <dt>Intended usage:</dt>
 <dd>Common</dd>
 <dt>Restrictions on usage:</dt>
 <dd>No restrictions apply.</dd>
 <dt>Authors:</dt>
 <dd>Silvia Pfeiffer &lt;silviapfeiffer1@gmail.com>, Simon Pieters &lt;simonp@opera.com>, Philip
 J&auml;genstedt &lt;philipj@opera.com>, Ian Hickson &lt;ian@hixie.ch></dd>
 <dt>Change controller:</dt>
 <dd>W3C</dd>
</dl>

<p>Fragment identifiers have no meaning with <code>text/vtt</code> resources.</p>


<h2 class="no-num" id=acknowledgements>Acknowledgements</h2>

<p>Thanks to the SubRip community, including in particular Zuggy and ai4spam, for their work on the
SubRip software program whose SRT file format was used as the basis for the WebVTT text track file
format.</p>

<p>Thanks to Ian Hickson and many others for their work on the HTML standard, where WebVTT was
originally specified. [[!HTML]]</p>

<p>
 Further thanks to:
 Glenn Adams,
 Victor C&#259;rbune,
 Eric Carlson,
 Anna Cavender,
 Cyril Concolato,
 Rick Eyre,
 fantasai,
 John Foliot,
 Lawrence Forooghian,
 Ralph Giles,
 Loretta Guarino Reid,
 Kyle Huey,
 Richard Ishida,
 Anne van Kesteren,
 Glenn Maynard,
 Ronny Mennerich,
 Ms2ger,
 Frank Olivier,
 Giuseppe Pascale,
 Addison Phillips;
 Caitlin Potter,
 Brian Quass,
 David Singer,
 Andreas Tai.
</p>