ffmpeg.decode_audio cannot be run in parallel #5804

jheymann85 · 2016-11-23T10:19:26Z

To decode the input, the function writes the content to a temporary file. Its name is generated by the function GetTempFilename found in tensorflow/contrib/ffmpeg/default/ffmpeg_lib.cc. The template for the filename is %tmp_dir/tmp_file_%PID.%EXT.

When using multiple decoders in parallel this causes an undetermined behaviour since all decoders want to write and afterwards delete the same file.

A possible solution would be to use the thread id instead of the process id. I.e.

#include <sys/syscall.h>
#define gettid() syscall(SYS_gettid)
...
return io::JoinPath(dir, StrCat("tmp_file_", gettid(), ".", extension));

The first two lines are necessary because glibc does not wrap the call.

This solution works for me (on Linux). I'm, however, not sure if it works on all supported platforms. If that's fine, I can make a pull request.

The text was updated successfully, but these errors were encountered:

prb12 · 2016-11-23T16:58:50Z

@fredbertsch You are listed as owner for this contrib dir... could you please take a look?

jonasrauber · 2017-01-27T17:28:38Z

I also have this problem. Will you fix this?

gunan · 2017-06-16T20:57:58Z

@fredbertsch Any updates here?
Is this still a problem?

rryan · 2017-10-23T19:42:21Z

Just keeping folks in the loop, we have a candidate fix internally that @fredbertsch authored. Hopefully it will show up in master over the next day or two!

fredbertsch · 2017-10-24T15:40:40Z

A test and fix were added internally, and they should propagate here soon.

Bug is at: tensorflow#5804 Fix is to add a unique identifier to each temp file name. The id is unique to the process. Multiple processes could still have a conflict, though even there the odds do go down somewhat with this fix. PiperOrigin-RevId: 173261202

tensorflowbutler · 2017-12-22T07:40:29Z

It has been 14 days with no activity and this issue has an assignee.Please update the label and/or status accordingly.

prb12 assigned fredbertsch Nov 23, 2016

woodshop mentioned this issue Jul 30, 2017

tf.contrib.ffmpeg.decode_audio causes kernel crash w/ multi-threading #10196

Closed

gunan closed this as completed Dec 22, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ffmpeg.decode_audio cannot be run in parallel #5804

ffmpeg.decode_audio cannot be run in parallel #5804

jheymann85 commented Nov 23, 2016

prb12 commented Nov 23, 2016

jonasrauber commented Jan 27, 2017

gunan commented Jun 16, 2017

rryan commented Oct 23, 2017

fredbertsch commented Oct 24, 2017

tensorflowbutler commented Dec 22, 2017

ffmpeg.decode_audio cannot be run in parallel #5804

ffmpeg.decode_audio cannot be run in parallel #5804

Comments

jheymann85 commented Nov 23, 2016

prb12 commented Nov 23, 2016

jonasrauber commented Jan 27, 2017

gunan commented Jun 16, 2017

rryan commented Oct 23, 2017

fredbertsch commented Oct 24, 2017

tensorflowbutler commented Dec 22, 2017