Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Audiobooks and WPUB #78

Closed
llemeurfr opened this issue Oct 3, 2017 · 15 comments
Closed

Audiobooks and WPUB #78

llemeurfr opened this issue Oct 3, 2017 · 15 comments

Comments

@llemeurfr
Copy link
Contributor

It is important to integrate audiobooks (i.e audio only, no text) in the work of this WG, something that has been unsuccessfully tried with EPUB 3.

An audiobook, as currently produced, is made of a sequence of audio files (often mp3) with a specific naming convention. mp3 files can be enriched by some metadata (ID3 tags; https://en.wikipedia.org/wiki/ID3) and in proprietary cases (e.g. Apple M4B format) some metadata (a title, author and genre in the M4B case) can be added to the whole set.

The Daisy Consortium has released guidelines on how to structure an EPUB 3 audiobook in http://www.daisy.org/ties/navigable-audio-only-epub3-guidelines, but it seems that only few a11y orgs are producing such publications, and this practice was not adopted by mainstream publishers.

This issue can help finding which metadata would be useful for audiobooks available as Web Publications and EPUB 4 files.

@HadrienGardeur
Copy link

Aside purely from metadata, there are also a number of other questions that we need to raise:

  • can we use audio files in our list of primary resources for a WP? This would be the most logical thing to do for an audiobook
  • for audiobooks where a single audio track is used, can we rely on navigation (table of contents) to provide chapter level metadata?
  • can we rely on media fragments for such navigation ?

For reference, here's how this could look like using a syntax inspired by the Readium effort:

Multiple tracks

"primary": [
  {"href": "chapter1.mp3", "type": "audio/mpeg", "title": "Chapter 1 - The Beginning"},
  {"href": "chapter2.mp3", "type": "audio/mpeg", "title": "Chapter 2 - Continued"},
  {"href": "chapter3.mp3", "type": "audio/mpeg", "title": "Chapter 3 - The End"}
]

Single track plus navigation

"primary": [
  {"href": "audiobook.mp3", "type": "audio/mpeg"}
],
"navigation": [
  {"href": "audiobook.mp3#t=0,587", "type": "audio/mpeg", "title": "Chapter 1 - The Beginning"},
  {"href": "audiobook.mp3#t=588,1274", "type": "audio/mpeg", "title": "Chapter 2 - Continued"},
  {"href": "audiobook.mp3#t=1275", "type": "audio/mpeg", "title": "Chapter 3 - The End"}
]

@llemeurfr
Copy link
Contributor Author

+1 for the support of audio files as primary resources in a WP.
The json snippet proposed by @HadrienGardeur is up to the point: easy to create, easy to process, with a structure that is compatible with text-first publications, and reuses W3C standards = media fragments.

@danielweck
Copy link
Member

danielweck commented Oct 3, 2017

The navigation requirements for DAISY talking books and EPUB3 Media Overlays (full-text full-audio and audio-only) are pretty much the same: coarse or fine, flat or hierarchical (depending on the authors needs) headings structure to match chaptering / sectioning in the main content. Special list of landmarks, for quick access to tables, figures, print page breaks, etc.
Advanced features include:
Escapability, to identify complex structures that readers / listeners can jump out of during the reading / playback experience. Skippability, to identify optional fragments of content that users can skip (e.g. footnotes, page breaks, etc.) during the default "linear" reading order.

@danielweck
Copy link
Member

@HadrienGardeur
Copy link

Any update post-TPAC on audiobooks? cc @iherman

@iherman
Copy link
Member

iherman commented Nov 15, 2017

Not specifically on audio books, but we had a session on media overlays with further steps planned.

@HadrienGardeur
Copy link

We need to at least decide if pointing directly to audio files will be OK in the context of a WP.

@iherman
Copy link
Member

iherman commented Nov 15, 2017

In some sense, the whole discussion on the entry page came out of this issue (and the related issue raised by Samuel on mangas/BDs). I am not sure it was documented, so I give my own interpretation only: it should be possible to have a WP that includes audio or images files only for the 'content' (that is the authors' choice) but, mainly for "starting up" a WP it is important to have some entry point that today's browsers also understand.

@HadrienGardeur
Copy link

So, for audiobooks and comics, that "entry point" could serve some sort of viewer as well?

@iherman
Copy link
Member

iherman commented Nov 15, 2017

Could. Or could trigger an implementation that is the general viewer. The details are still fuzzy, of course.

@HadrienGardeur
Copy link

OK, but the door is open to the idea of having audio or images for the "content" as long as the entry point remains HTML, that's good to know.

@TzviyaSiegman
Copy link
Contributor

assigned to Wendy Reid

@clapierre
Copy link

I am working with a group building EPUBs for countries with limited TTS support for their native language and the books they are creating are children's audio books with MediaOverlays but the images only have the ability for alt text which is textual only and therefor unable to be read to the child. We need to also support media overlays for an alt/extended description of the images.

@danielweck
Copy link
Member

@clapierre I think your comment might be even more relevant in this issue:
#134

@TzviyaSiegman
Copy link
Contributor

This issue is addressed by subsequent issues.

@iherman iherman closed this as completed Sep 24, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

8 participants