scipy.io.wavfile should be able to read 24 bit signed wave (Trac #1405) #1930

scipy-gitbot · 2013-04-25T17:35:09Z

Original ticket http://projects.scipy.org/scipy/ticket/1405 on 2011-03-10 by trac user tuxicoman, assigned to unknown.

So far the scipy.io.wavfile module cannot read 24 bit wave files whereas it's quite a popular format.

After looking to source code, it seems that because there i no numpy.int24 type; the code fails on 24bit wave file. The function crashes on following line :

data = numpy.fromfile(fid, dtype=dtype, count=size//bytes)

with dtype = "<i3" :-(

Because 24bit is stored on 3 bytes, scipy.io.wavfile needs additional code to slice and convert filebytes into an 32bit type array for example. Unfortunately, even after googling, I'm not enough talented in python/numpy/wavfile to code this. So i file this ticket for someone else to code it. Thanks.

The text was updated successfully, but these errors were encountered:

scipy-gitbot · 2013-04-25T17:35:10Z

@stefanv wrote on 2011-03-10

Could you also attach a (very short) .wav file of the required format? That would help with debugging. Thanks!

scipy-gitbot · 2013-04-25T17:35:15Z

Attachment added by trac user tuxicoman on 2011-03-10: 24test.wav

scipy-gitbot · 2013-04-25T17:35:15Z

trac user tuxicoman wrote on 2011-03-10

Sure, this is one i've done with audacity.
16bit and 32bit versions of this file (created through audacity also) work.

scipy-gitbot · 2013-04-25T17:35:15Z

Milestone changed to Unscheduled by @WarrenWeckesser on 2011-12-29

WarrenWeckesser · 2013-11-14T05:08:01Z

The gist https://gist.github.com/WarrenWeckesser/7461696 contains a couple functions for generating wav files with a 3 byte sample width. Something like this could be used for unit tests if support for 24 bit files is ever added.

WarrenWeckesser · 2013-11-14T05:20:44Z

For anyone who needs to read 24 bit files: I created a gist containing the function readwav that can read (uncompressed) 24 bit wav files: https://gist.github.com/WarrenWeckesser/7461781

Update: The gist now includes a function called writewav24 for creating 24 bit wav files.

Update 2: I moved the gist to a regular github repository at https://github.com/WarrenWeckesser/wavio

matthew-brett · 2014-07-31T18:04:42Z

Warren - OK to add this to scipy?

WarrenWeckesser · 2014-07-31T20:16:43Z

@matthew-brett: Sure, that would be great.

damnsavage · 2015-06-17T14:02:56Z

Can we add this code from Warren? This is essentially what is required

data = numpy.fromfile(fid, dtype='u1', count=size) # first read byte per byte
a = numpy.empty((len(data)/3, 4), dtype=`u1`)
a[:, :3] = data.reshape((-1, 3))
a[:, 3:] = (a[:, 3 - 1:3] >> 7) * 255
data = a.view('<i4').reshape(a.shape[:-1])

24bit wav format is very common, unfortunately it's not possible to attach a sample to this post...

rgommers · 2015-06-23T21:11:25Z

@Snotzer if that fixes the issue (I didn't check), then I guess the answer is yes. PR welcome I'd say. Would need to include a small .wav file for a unit test, similar to the ones that are already there now in scipy/io/tests/data/.

gozzilli · 2015-08-11T17:36:44Z

Including a 24bit wav file similar to the one in scipy/io/tests/data
test-44100-le-1ch-3bytes.wav

Input File     : 'test-44100-le-1ch-3bytes.wav'
Channels       : 1
Sample Rate    : 44100
Precision      : 24-bit
Duration       : 00:00:00.10 = 4418 samples = 7.51361 CDDA sectors
File Size      : 13.3k
Bit Rate       : 1.06M
Sample Encoding: 24-bit Signed Integer PCM

endolith · 2016-03-24T00:41:43Z

3-bytes per sample is only possible for the case of mmap=False, correct? Since numpy has no 3-byte data type?

Note that WAV files can have any number of bits per sample. 1-8 bits are interpreted as unsigned, 9 or more are signed. Adobe Audition can output 20-bit files, for instance, starting like so:

52 49 46 46 2A 43 00 00 57 41 56 45 66 6D 74 20 
10 00 00 00 01 00 01 00 80 BB 00 00 80 32 02 00 
03 00 14 00 64 61 74 61
      ^^^^^

      0x0014 = 20-bit

It is still stored as 3 bytes per sample, though.

I suggested in #5990 that it be possible to convert all different formats into floating point normalized from -1 to +1, the way scikits.audiolab does, for when you just want to process the signal and don't care what format it was stored in. Also if not normalized, it should have an option to output the bit depth.

rgommers · 2016-12-12T07:45:28Z

gh-6849 has a link to an implementation. The author doesn't want to make the time to do a PR though, so if someone wants to have a look at that that'd be great.

patrickmmartin · 2017-09-24T11:51:30Z

I encountered this for myself - my DAWS software records in 24-bit WAV files so it would be convenient for me just to reading

patrickmmartin/AxesXplained#7

jeremycochoy · 2019-08-05T16:56:51Z

Is the support of 24-bits wav available in a recent release of scipy, or is it still work in progress since 2014?

endolith · 2019-08-06T01:09:43Z

@jeremycochoy It's still a work in progress, in PR #6852. Can you use PySoundFile?

WarrenWeckesser · 2019-08-06T01:41:19Z

FYI: The gist that I created back in 2013 has since been converted to a package called wavio that is available on PyPI: https://pypi.org/project/wavio/

giacaglia · 2019-12-09T22:54:08Z

Another way of going around this is to change the input's bit depth using a tool like ffmpeg. The following should help:
fmpeg -i input.wav -sample_fmt s16 output.wav

WarrenWeckesser mentioned this issue Nov 6, 2013

wavfile doesn't work with 24-bit audio files #3043

Closed

WarrenWeckesser mentioned this issue Jul 31, 2014

Scipy fails to read wav file #3846

Closed

WarrenWeckesser mentioned this issue Dec 14, 2014

MAINT: io: Give an informative error when attempting to read a 24 bit wav file. #4266

Merged

argriffing mentioned this issue Jul 29, 2015

Feature request: Support for arbitrary byte length integers numpy/numpy#6130

Closed

rgommers mentioned this issue Dec 12, 2016

A few useful additions to wavfile.py (24 bit read/write support, cue markers support, loop makers support, etc.) #6849

Open

bradleycolquitt mentioned this issue Dec 1, 2017

support for 24-bit wav files vocalpy/hybrid-vocal-classifier#44

Open

This was referenced May 7, 2020

README needs warning about default write() scaling behavior WarrenWeckesser/wavio#12

Closed

scipy.io.wavfile should be able to convert to/from float arrays #12059

Open

endolith mentioned this issue Jun 1, 2020

ENH: Read arbitrary bit depth (including 24-bit) WAVs #12287

Merged

larsoner closed this as completed in #12287 Aug 26, 2020

tylerjereddy added this to the 1.6.0 milestone Sep 1, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

scipy.io.wavfile should be able to read 24 bit signed wave (Trac #1405) #1930

scipy.io.wavfile should be able to read 24 bit signed wave (Trac #1405) #1930

scipy-gitbot commented Apr 25, 2013

scipy-gitbot commented Apr 25, 2013

scipy-gitbot commented Apr 25, 2013

scipy-gitbot commented Apr 25, 2013

scipy-gitbot commented Apr 25, 2013

WarrenWeckesser commented Nov 14, 2013

WarrenWeckesser commented Nov 14, 2013

matthew-brett commented Jul 31, 2014

WarrenWeckesser commented Jul 31, 2014

damnsavage commented Jun 17, 2015

rgommers commented Jun 23, 2015

gozzilli commented Aug 11, 2015

endolith commented Mar 24, 2016

rgommers commented Dec 12, 2016

patrickmmartin commented Sep 24, 2017

jeremycochoy commented Aug 5, 2019

endolith commented Aug 6, 2019 •

edited

WarrenWeckesser commented Aug 6, 2019

giacaglia commented Dec 9, 2019 •

edited

scipy.io.wavfile should be able to read 24 bit signed wave (Trac #1405) #1930

scipy.io.wavfile should be able to read 24 bit signed wave (Trac #1405) #1930

Comments

scipy-gitbot commented Apr 25, 2013

scipy-gitbot commented Apr 25, 2013

scipy-gitbot commented Apr 25, 2013

scipy-gitbot commented Apr 25, 2013

scipy-gitbot commented Apr 25, 2013

WarrenWeckesser commented Nov 14, 2013

WarrenWeckesser commented Nov 14, 2013

matthew-brett commented Jul 31, 2014

WarrenWeckesser commented Jul 31, 2014

damnsavage commented Jun 17, 2015

rgommers commented Jun 23, 2015

gozzilli commented Aug 11, 2015

endolith commented Mar 24, 2016

rgommers commented Dec 12, 2016

patrickmmartin commented Sep 24, 2017

jeremycochoy commented Aug 5, 2019

endolith commented Aug 6, 2019 • edited

WarrenWeckesser commented Aug 6, 2019

giacaglia commented Dec 9, 2019 • edited

endolith commented Aug 6, 2019 •

edited

giacaglia commented Dec 9, 2019 •

edited