Creation of code to load LibriVox and format for the python_speech_features package #46

kdavis-mozilla · 2016-10-05T12:59:31Z

The code for the TED corpus is in the fork issue2. One should take this code as a starting point.

reuben · 2016-10-06T23:14:55Z

LibriVox doesn't have properly aligned transcriptions. Is figuring out a solution for that within the scope of this issue?

reuben · 2016-10-06T23:18:04Z

Another alternative would be using existing corpuses (corpi?) extracted from LibriVox like LibriSpeech: http://www.openslr.org/12/

kdavis-mozilla · 2016-10-07T02:55:05Z

Have you looked at the TED code in issue 2?

reuben · 2016-10-07T03:02:17Z

Yep. I started writing a bunch of code for downloading and formatting the LibriVox data directly, from the Internet Archive, but after reading the LibriSpeech paper I learned that proper alignment and segmentation is a very large effort and we should probably just use that corpus directly, so I'm gonna do that.

kdavis-mozilla · 2016-10-07T10:46:59Z

Before you go off on a wild goose chase, please define what you mean by "proper alignment".

kdavis-mozilla · 2016-10-07T10:51:21Z

Also did you read and understand the Deep Speech paper?

The Deep Speech paper and our code under master uses the CTC algorithm which does not require "alignment" in the sense used for HMM STT engines.

kdavis-mozilla · 2016-10-07T10:55:46Z

Using LibriSpeech directly is fine, it's actually what I expected form the start, but do not spend time trying to "align" the corpus in the sense used for HMM STT engines. CTC does not require such "alignment".

reuben · 2016-10-07T11:33:04Z

Also did you read and understand the Deep Speech paper?

Not as well as I thought I had, evidently! Either that or I'm just abusing the jargon.

I was under the impression that the transcriptions need to have a minimal resemblance to the audio, which the raw LibriVox data, by default, doesn't have. That's as far as my definition of "alignment" went: skipping the initial audio disclaimers, skipping the license header on the Project Gutenberg files, etc.

In any case, we've ended up on the same page, albeit in my case that included a few bumps along the way :P

…, #46, #47, and #48

…, #46, #47, and #48 Merge of pull requests #49, #50, and #52. Fixes issues #2, #4, #11, #12, #46, #47, and #48

…issues mozilla#2, mozilla#4, mozilla#11, mozilla#12, mozilla#46, mozilla#47, and mozilla#48

lock · 2019-01-04T02:58:17Z

This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.

kdavis-mozilla added this to the Integration of LibriVox Corpus into DeepSpeech milestone Oct 5, 2016

kdavis-mozilla assigned reuben Oct 5, 2016

kdavis-mozilla mentioned this issue Oct 5, 2016

Fixed issue #2 and issue #4 #50

Closed

kdavis-mozilla mentioned this issue Oct 9, 2016

Fix #48; separation of training and validation #49

Merged

kdavis-mozilla added a commit that referenced this issue Oct 13, 2016

Merge of pull requests #49, #50, and #52. Fixes issues #2, #4, #11, #12…

a3abc9d

…, #46, #47, and #48

kdavis-mozilla added a commit that referenced this issue Oct 13, 2016

Merge of pull requests #49, #50, and #52. Fixes issues #2, #4, #11, #12…

84c030a

…, #46, #47, and #48 Merge of pull requests #49, #50, and #52. Fixes issues #2, #4, #11, #12, #46, #47, and #48

kdavis-mozilla closed this as completed Oct 13, 2016

andrenatal pushed a commit to andrenatal/DeepSpeech that referenced this issue Oct 19, 2016

Merge of pull requests mozilla#49, mozilla#50, and mozilla#52. Fixes …

ff820dc

…issues mozilla#2, mozilla#4, mozilla#11, mozilla#12, mozilla#46, mozilla#47, and mozilla#48

lock bot locked and limited conversation to collaborators Jan 4, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Creation of code to load LibriVox and format for the python_speech_features package #46

Creation of code to load LibriVox and format for the python_speech_features package #46

kdavis-mozilla commented Oct 5, 2016

reuben commented Oct 6, 2016

reuben commented Oct 6, 2016

kdavis-mozilla commented Oct 7, 2016

reuben commented Oct 7, 2016

kdavis-mozilla commented Oct 7, 2016

kdavis-mozilla commented Oct 7, 2016

kdavis-mozilla commented Oct 7, 2016

reuben commented Oct 7, 2016

lock bot commented Jan 4, 2019

Creation of code to load LibriVox and format for the python_speech_features package #46

Creation of code to load LibriVox and format for the python_speech_features package #46

Comments

kdavis-mozilla commented Oct 5, 2016

reuben commented Oct 6, 2016

reuben commented Oct 6, 2016

kdavis-mozilla commented Oct 7, 2016

reuben commented Oct 7, 2016

kdavis-mozilla commented Oct 7, 2016

kdavis-mozilla commented Oct 7, 2016

kdavis-mozilla commented Oct 7, 2016

reuben commented Oct 7, 2016

lock bot commented Jan 4, 2019