Audio corpora prep should allow for WAV files and mix multichannels #13

tshastry · 2018-10-15T18:24:37Z

The current audio corpora prep seems to only work on SPH files. In addition, the current description says this:

 Note that filenames with hyphens will be sanitized to underscores and that audio files will be forced to single channel, 16 kHz, signed PCM format. If two channels are present, only the first will be used.

Many corpora come in WAV files instead of SPH files, and many also have two unmixed channels that need to be mixed to properly account for all audio.

The text was updated successfully, but these errors were encountered:

This PR enables audio corpus objects to accept SPH, WAV, and MP3 files from directories. It still expects file names to match between audio files and transcript STM files. Further, this PR mixes stereo channels down to mono instead of discarding extra channels. Fixes #13

tshastry assigned mgoldey Oct 15, 2018

mgoldey mentioned this issue Oct 15, 2018

generalize audio corpus prep #15

Merged

mgoldey closed this as completed in #15 Oct 17, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Audio corpora prep should allow for WAV files and mix multichannels #13

Audio corpora prep should allow for WAV files and mix multichannels #13

tshastry commented Oct 15, 2018

Audio corpora prep should allow for WAV files and mix multichannels #13

Audio corpora prep should allow for WAV files and mix multichannels #13

Comments

tshastry commented Oct 15, 2018