Use directly in memory, instead of via files, possible? #6

carlthome · 2016-06-27T16:00:36Z

Would it be possible to input audio as ndarray:s into pysox directly as well as files from disk? Why I'm asking is because I'm using librosa for onset detection, and thus already have the audio loaded, but would still like to apply some audio effects and stuff afterwards.

Like maybe the constructors could do a type check, and build() could return the destination audio or something?

y = librosa.load(path)[0]
tfm = sox.Transformer(y)
# Do stuff...
y = tfm.build()

I realize this adds extra complexity to pysox (like having librosa as a dependency or similar) which is intended to be a clean wrapper around SoX, so I get if it's deemed out of scope. Just asking!

rabitt · 2016-08-06T16:06:07Z

I thought about this, and I decided it's out of scope for this library. I'd suggest simply writing the audio back to disk and then using pysox on the file. I realize this isn't ideal, but I think to do this in pysox without saving the audio first requires a lot of command line magic.

That said, if you find a clean way to do it, I'm happy to look at the PR.

justinsalamon · 2016-10-05T20:37:18Z

Not that this solves the issue of taking an i/o hit, but in the meanwhile a safe way to concatenate operations is to use tempfile:

with tempfile.NamedTemporaryFile(suffix='.wav') as tmp:

    tfm = sox.Transformer()
    tfm.trim(0, 5)
    tfm.build(infile, tmp.name)

    cbn = sox.Combiner()
    cbn.build([tmp.name, tmp.name], outfile, 'concatenate')

The downside is that since sox requires a filename, you need to use a NamedTemporaryFile and that means the system will think the file already exists when you call build(), which means you'll always get an overwrite warning (which perhaps could be disabled by the addition of an optional flag @rabitt ?)

Right now the workaround is by changing the logger level:

logger = logging.getLogger()
logger.setLevel('CRITICAL')

with tempfile.NamedTemporaryFile(suffix='.wav') as tmp:

    tfm = sox.Transformer()
    tfm.trim(0, 5)
    tfm.build(infile, tmp.name)

    cbn = sox.Combiner()
    cbn.build([tmp.name, tmp.name], outfile, 'concatenate')

logger.setLevel('WARNING')

Or if you just want to suppress all logging you can do logger.disabled = True

carlthome · 2016-10-06T15:16:58Z

@justinsalamon, @rabitt, I've published a lightweight SoX effects wrapper (reverb, phaser, delay, etc.) that pipes NumPy ndarrays over stdin/stdout instead of creating extra I/O load with tempfiles. Maybe you can use bits of it in pysox: https://github.com/carlthome/python-audio-effects

rabitt · 2016-10-07T17:31:56Z

@justinsalamon can you open a separate issue about optionally disabling the overwrite warning?

rabitt · 2016-10-07T17:32:38Z

@carlthome Looks awesome! I plan to include this feature when I bump to v1.3

justinsalamon · 2016-10-11T16:36:23Z

@justinsalamon can you open a separate issue about optionally disabling the overwrite warning?

#25

smolendawid · 2018-01-17T12:55:10Z

I must admit that it would be very useful. I need this transformations on the fly, holding everything in RAM, so saving to a disk and then again loading to python is inefficient.

hadware · 2018-01-27T23:27:55Z

I'll try porting @carlthome 's work to pysox, It'd be nice if you guys could give me some pointers on how you'd wish it to be implemented. I'd personally see it as another method on the transformer and combine class to output a numpy ndarray, such as build_ndarray.

To input ndarray soundfiles, i guess this could be just some slight modifications on the args of the build method.

carlthome · 2018-02-04T14:41:54Z

Sound great! I'd love to get rid of my package entirely and have pysox be the only SoX Python wrapper if possible.

Consolidate:

lostanlen · 2018-08-02T15:57:19Z

@carlthome I do not think that solving this issue would require having all of librosa as a dependency. Since it's purely a matter of I/O, pysoundfile is sufficient for the most common lossless formats: WAV / FLAC / OGG

https://github.com/bastibe/SoundFile

rabitt · 2018-08-10T16:07:48Z

@hadware -

I'll try porting @carlthome 's work to pysox, It'd be nice if you guys could give me some pointers on how you'd wish it to be implemented. I'd personally see it as another method on the transformer and combine class to output a numpy ndarray, such as build_ndarray.

I agree that what makes the most sense is to have a separate function build_array that mirrors build but inputs and outputs an ndarray. I prefer this to overloading the current build function.

To input ndarray soundfiles, i guess this could be just some slight modifications on the args of the build method.

Yes, as far as I can tell it's doable by changing the input and output filename to the - character and piping the audio data to stdin. Some relevant info in the SoX documentation in the section on "Special Filenames".

In terms of capturing the audio data from stdout, I think everything is already in place, but if you run into issues, this is where the problem is likely to be.

Let me know if you need any help!

pseeth · 2020-05-15T22:29:35Z

I'm going to give addressing this issue a shot, mostly by trying to port @carlthome's pysndfx fix into this library. Do people have the bandwidth for a PR for this soon (hopefully)? Looking over the comments here, I believe such a change would require adding numpy as a dependency.

rabitt · 2020-05-21T13:12:56Z

This is done in #102 , thanks @pseeth !
I've opened #106 to discuss how the API should look moving forward before pushing to a full release.

danihodovic · 2022-09-27T14:00:06Z

Is it possible to use the file_info API with in memory files?

rabitt closed this as completed Aug 6, 2016

rabitt reopened this Aug 17, 2016

rabitt added this to the 1.3 milestone Aug 19, 2016

rabitt self-assigned this Aug 19, 2016

rabitt added the enhancement label Aug 19, 2016

justinsalamon mentioned this issue Oct 11, 2016

Disable warnings when overwriting output file #25

Closed

rabitt modified the milestones: 1.4, 1.3 May 29, 2017

lostanlen mentioned this issue Jul 4, 2019

Converting wav to vox #90

Closed

epicycles mentioned this issue Feb 13, 2020

Unexpected warnings with dumping sources alongside mix justinsalamon/scaper#75

Open

pseeth mentioned this issue May 16, 2020

Use Transformer in-memory with stdin/stdout #102

Merged

rabitt closed this as completed May 21, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use directly in memory, instead of via files, possible? #6

Use directly in memory, instead of via files, possible? #6

carlthome commented Jun 27, 2016 •

edited

rabitt commented Aug 6, 2016 •

edited

justinsalamon commented Oct 5, 2016

carlthome commented Oct 6, 2016

rabitt commented Oct 7, 2016

rabitt commented Oct 7, 2016

justinsalamon commented Oct 11, 2016

smolendawid commented Jan 17, 2018

hadware commented Jan 27, 2018

carlthome commented Feb 4, 2018

lostanlen commented Aug 2, 2018 •

edited

rabitt commented Aug 10, 2018

pseeth commented May 15, 2020

rabitt commented May 21, 2020 •

edited

danihodovic commented Sep 27, 2022

Use directly in memory, instead of via files, possible? #6

Use directly in memory, instead of via files, possible? #6

Comments

carlthome commented Jun 27, 2016 • edited

rabitt commented Aug 6, 2016 • edited

justinsalamon commented Oct 5, 2016

carlthome commented Oct 6, 2016

rabitt commented Oct 7, 2016

rabitt commented Oct 7, 2016

justinsalamon commented Oct 11, 2016

smolendawid commented Jan 17, 2018

hadware commented Jan 27, 2018

carlthome commented Feb 4, 2018

lostanlen commented Aug 2, 2018 • edited

rabitt commented Aug 10, 2018

pseeth commented May 15, 2020

rabitt commented May 21, 2020 • edited

danihodovic commented Sep 27, 2022

carlthome commented Jun 27, 2016 •

edited

rabitt commented Aug 6, 2016 •

edited

lostanlen commented Aug 2, 2018 •

edited

rabitt commented May 21, 2020 •

edited