Read wavfiles of size > 4GB #8529

TimFelixBeyer · 2018-03-07T01:24:00Z

Adds the ability to open a .wav file with the 'RF64' format for files with a file size > 4GB, I'm not sure how to write a unit test for this, since it'd require uploading a huge .wav file.

larsoner · 2018-03-07T03:10:17Z

Would it be a lot of extra work to add the equivalent writer? If not, then a round-trip test could work.

pv · 2018-03-07T09:13:31Z

Or, add a small wav file in the large-file format. Presumably, the format can describe also small files.

TimFelixBeyer · 2018-03-17T21:26:29Z

I've added the corresponding writer, however I'm still not sure how to go about implementing the test, since the format only really makes sense for data > 4GB, so the writer should only use the format if the data is too large for RIFF.

larsoner · 2018-03-18T18:03:35Z

the format only really makes sense for data > 4GB, so the writer should only use the format if the data is too large for RIFF.

Although this might be true in practice, can you opt in to using the RF64 format even for files < 4 GB (even if just for the purpose of testing round-trip IO)?

TimFelixBeyer · 2018-03-21T20:01:13Z

According to the official specs, the standard filesize-locations should be used if the file is smaller than 4GB: https://tech.ebu.ch/docs/tech/tech3306-2009.pdf
It‘d be possible to force the RF64 format with a smaller file but then a large part (writing/reading the new 64bit-size-fields) of the changes wouldn’t be tested.
Alternatively one could disregard the specs, and write the filesize into the 64Bit fields even if it’s < 4GB. Let me know which of these options is the best and I’ll try to implement it

larsoner · 2018-03-21T20:07:57Z

According to the official specs, the standard filesize-locations should be used if the file is smaller than 4GB

This makes sense, and seems like reasonable behavior for SciPy to follow.

Alternatively one could disregard the specs, and write the filesize into the 64Bit fields even if it’s < 4GB.

This is what I propose to do for testing purposes only, yes. Especially if there is some way to do it that does not expose the option to the user (since it's non-standard). Maybe a private function that does the heavy lifting could be used for this purpose.

777arc · 2023-12-02T01:49:19Z

This would be a valuable addition!!

lucascolley · 2024-01-16T14:24:02Z

Alternatively one could disregard the specs, and write the filesize into the 64Bit fields even if it’s < 4GB.

This is what I propose to do for testing purposes only, yes. Especially if there is some way to do it that does not expose the option to the user (since it's non-standard). Maybe a private function that does the heavy lifting could be used for this purpose.

@TimFelixBeyer are you interested in returning to do this? Sounds like it's all that is needed to test this properly and get this in. Judging by the recent comment, there is still some demand for this feature.

TimFelixBeyer · 2024-01-18T04:05:49Z

Sure, I’m currently busy but can take another look in about 3 weeks if that’s ok.

TimFelixBeyer · 2024-02-12T12:54:17Z

@lucascolley I'd like some input regarding testing.
In addition to testing with small modified RF64 files, I have a working test on my local machine that just performs a round-trip write of a large (>4GB) array, but it obviously takes quite a while and requires a large amount of memory to complete.
However, I think a round-trip test would be valuable in principle.
Are there any guidelines regarding how long test cases are allowed to last?

lucascolley · 2024-02-12T13:32:14Z

We have a marker for extremely slow tests, which can be applied with pytest.mark.xslow, to avoid running on every CI job. How about you add the test with that marker, then we can see how long it takes?

TimFelixBeyer · 2024-02-12T19:09:45Z

Opened #20079 to avoid having to rebase etc. @lucascolley feel free to check out the new PR

TimFelixBeyer added 5 commits March 7, 2018 02:18

Update wavfile.py

3093866

Update wavfile.py

94903cb

Update wavfile.py

7580d38

Update wavfile.py

bfe2f59

Update wavfile.py

99c097a

Add RF64 writer and fix read issue

fe24322

rgommers added enhancement A new feature or improvement scipy.io labels Mar 17, 2018

lucascolley added the needs-work Items that are pending response from the author label Jan 16, 2024

TimFelixBeyer mentioned this pull request Feb 12, 2024

ENH: io: Read and write wav files of size > 4GB #20079

Merged

TimFelixBeyer closed this Feb 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Read wavfiles of size > 4GB #8529

Read wavfiles of size > 4GB #8529

TimFelixBeyer commented Mar 7, 2018

larsoner commented Mar 7, 2018

pv commented Mar 7, 2018 via email

TimFelixBeyer commented Mar 17, 2018

larsoner commented Mar 18, 2018

TimFelixBeyer commented Mar 21, 2018

larsoner commented Mar 21, 2018

777arc commented Dec 2, 2023

lucascolley commented Jan 16, 2024

TimFelixBeyer commented Jan 18, 2024

TimFelixBeyer commented Feb 12, 2024

lucascolley commented Feb 12, 2024

TimFelixBeyer commented Feb 12, 2024

Read wavfiles of size > 4GB #8529

Read wavfiles of size > 4GB #8529

Conversation

TimFelixBeyer commented Mar 7, 2018

larsoner commented Mar 7, 2018

pv commented Mar 7, 2018 via email

TimFelixBeyer commented Mar 17, 2018

larsoner commented Mar 18, 2018

TimFelixBeyer commented Mar 21, 2018

larsoner commented Mar 21, 2018

777arc commented Dec 2, 2023

lucascolley commented Jan 16, 2024

TimFelixBeyer commented Jan 18, 2024

TimFelixBeyer commented Feb 12, 2024

lucascolley commented Feb 12, 2024

TimFelixBeyer commented Feb 12, 2024