tensorflow: 'module' object has no attribute 'argv' #81

c-spencer · 2017-06-03T15:41:07Z

Caused when trying to import tensorflow (v1.2). Most likely caused by PySys_SetArgv not having been called?

Java 8, Python 2.7, Jep 3.6.3, running from within an IntelliJ IDEA scala project.

Full error:

scala> 
import jep.Jep
val jep = new Jep(false)
jep.runScript("src/main/python/test.py")

scala> jep: jep.Jep = jep.Jep@4c531172

scala> jep.JepException: <type 'exceptions.AttributeError'>: 'module' object has no attribute 'argv'
  at /System/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/argparse.__init__(argparse.py:1586)
  at /Users/chris/tensorflow/lib/python2.7/site-packages/tensorflow/python/platform/flags.<module>(flags.py:25)
  at /Users/chris/tensorflow/lib/python2.7/site-packages/tensorflow/python/platform/app.<module>(app.py:23)
  at /Users/chris/tensorflow/lib/python2.7/site-packages/tensorflow/python/__init__.<module>(__init__.py:98)
  at /Users/chris/tensorflow/lib/python2.7/site-packages/tensorflow/__init__.<module>(__init__.py:24)
  at src/main/python/test.<module>(test.py:3)

test.py:

import tensorflow as tf

The text was updated successfully, but these errors were encountered:

bsteffensmeier · 2017-06-04T02:48:01Z

I think it would be good to make it so the args can be set in PyConfig

ndjensen · 2017-06-04T23:43:19Z

Can you set Python's sys.argv before calling runScript? That may be a workaround.

PyConfig is for pre-init parameters that apply to the entire Python interpreter. argv is potentially applicable to a single script, especially when called through Jep.runScript(String). We should overload the method Jep.runScript(String) to take more arguments.

c-spencer · 2017-06-05T07:20:48Z

Setting sys.argv before (or at the top of) runScript gets around this, thanks. Passing args through runScript sounds like a good solution.

akrauchanka · 2017-06-09T14:27:30Z

I've recently faced the same issue with shared module feature. In this case this workaround can't be applied as adding shared modules happens in Jep constructor.

Could it be qualified as issue in this case?

ndjensen · 2017-06-09T16:04:37Z

@akrauchanka, can you explain your use case in more detail?

akrauchanka · 2017-06-12T15:19:51Z

@ndjensen, sure. I've added tensorflow module to shared modules list, but with Jep constructor call I've got the same exception as mentioned in issue title. Workaround you've provided hadn't worked, because exception has been thrown on constructor call. So I've downloaded sources, made changes to C code to pass empty string to embedded interpreter as parameter during creation. Then I rebuild JEP from sources and tried it - works good.
Looks like hack, agree, but there is no options to pass params to embedded interpreter on creation to prevent modules, that depends on sys.argv parameter to work properly on sharing.

ndjensen · 2017-06-12T18:50:40Z

It sounds like for flexibility we need PyConfig to have a default argv of empty string "" so it can be set globally. That code would be CPython mostly using

Then we also need to overload Jep.runScript() to take a list of arguments and we'd manipulate sys.argv with Python code.

Update: Went with Jep.setSharedModulesArgv() since it's not quite the same as PyConfig. Skipping Jep.runScript() for this ticket.

ndjensen · 2017-06-13T14:23:37Z

Does anybody know why tensorflow requires sys.argv? If tensorflow would like to work well in an embedded environment, it shouldn't be so reliant on sys.argv. That said, the entire concept of shared modules was born from a lack of external libraries working well within embedded environments. Therefore, we will strive to make shared modules work as well as possible.

ndjensen · 2017-06-21T17:21:51Z

For this ticket we want to add to PyConfig a variable argv, probably a String[], and then in the CPython where PyConfig is used (search for pyembed_preinit) use PySys_SetArgvEx. We'll split off Jep.runScript() changes to a separate task.

Target branch dev_3.7. If anyone wants to submit a pull request it will get done faster, otherwise I will eventually get to it.

eastcirclek · 2017-06-26T08:28:43Z

I really hope this issue is solved.
I'm using Jep to do inference using Keras on top of Theano and TensorFlow inside Apache Flink.
As I usually have multiple sub-interpreters by different threads in a single process, I have to use dev_3.7 to avoid race condition in shared modules as reported in #69.

To have tensorflow 1.2 in shared module, I make a file named 'tf_init.py' under my working directory which looks below.
import sys
sys.argv = ['pdm']
import tensorflow.python

Then I initialize Jep using new JepConfig().addSharedModules("tf_init", "numpy", "scipy", "h5py", "tensorflow")

Before doing any Keras/TF-related stuff, I do jep.eval("import tf_init") so that sys.argv is set in the top interpreter.
The reason I add import tensorflow.python is to make sure that tensorflow is loaded by the top interpreter.
Before I upgrade to tensorflow 1.2, I just need the following two lines in tf_init.py:
import sys
sys.argv = ['pdm']

For the top interpreter to see "tf_init.py" I set PYTHONPATH to my working directory. I had to do this because I don't think Jep provides a means to set an include path for the top interpreter, which is irrelevant to this issue but I hope someone to figure it out as well as this issue.

fixes ninia#81

ndjensen · 2017-07-10T17:23:06Z

Ok, two things:

@c-spencer, @akrauchanka, or @eastcirclek, can one of you open a tensorflow issue that is basically, "tensorflow doesn't work well in embedded environments due to reliance on sys.argv". Also link to this Jep ticket, and on this Jep ticket add a link to the tensorflow ticket. I am not comfortable opening the tensorflow ticket due to my lack of familiarity with tensorflow. But once open, we can all add comments with more information to the ticket.
I added a method setSharedModulesArgv(String...) on my fork of Jep dev_3.7 to attempt to fix the issue. I wrote a unit test for it but have not tested it with tensorflow. @akrauchanka and @eastcirclek, can you test it out?

A simple test could be something like:

Jep.setSharedModulesArgv("");
Jep jep = new Jep();
jep.eval("import tensorflow");

If it works I will merge it into the main repository's dev_3.7.

eastcirclek · 2017-07-11T06:45:58Z

I tested your code of branch dev_3.7.
Thanks to Jep.setSharedModulesArgv(), I can safely remove the two lines from the file I explained above #81 (comment):

#import sys
#sys.argv = ['pdm']
import tensorflow.python

eastcirclek · 2017-07-11T07:22:04Z

@ndjensen

I don't think this is specific to tensorflow; it can be a problem for other programs which call the argparse module.

val jep = new Jep()
jep.eval("import argparse")
jep.eval("parser = argparse.ArgumentParser()")

Upon executing argparse.ArgumentParser(), I got the error:
Exception in thread "main" jep.JepException: <class 'AttributeError'>: module 'sys' has no attribute 'argv'

The simplest way to avoid this error seems to declare another class in tensorflow/python/platform/flags.py that doesn't depend on argparse.ArgumentParser.

ndjensen · 2017-07-11T13:24:04Z

@eastcirclek, thanks for testing. I've merged the code from my fork to the main jep dev_3.7.

Ok, I agree we don't need a tensorflow ticket based on your investigation. I have concerns about the complexity we're adding to Jep to support quirks of various CPython extensions (the entire shared modules concept was added to work around issues with numpy). But since it helps the Jep community we'll keep doing our best.

eastcirclek · 2017-07-13T00:34:55Z

@ndjensen

I'm sure of the importance of having shared module in Jep. Python libraries like Scipy, H5py, Theano, and TensorFlow to name a few are not a pure Python library, so without Jep they cannot be used inside JVM-based data processing engines in which multiple sub-interpreters should be created and destroyed as user jobs are created and finished. TensorFlow supports its Java API and I tested it. However, Tensorflow Java API somehow shows worse performance than TensorFlow Python API; so I stick to use Jep+TensorFlow Python API.

So there's no doubt about shared module to me 💯

ndjensen added the improvement label Jun 5, 2017

ndjensen added a commit to ndjensen/jep that referenced this issue Jul 7, 2017

enable setting sys.argv on the main/top interpreter

c823960

fixes ninia#81

ndjensen added a commit to ndjensen/jep that referenced this issue Jul 7, 2017

enable setting sys.argv on the main/top interpreter

fdc1a59

fixes ninia#81

ndjensen added a commit to ndjensen/jep that referenced this issue Jul 7, 2017

enable setting sys.argv on the main/top interpreter

ff8d7ee

fixes ninia#81

ndjensen added a commit to ndjensen/jep that referenced this issue Jul 7, 2017

enable setting sys.argv on the main/top interpreter

be5dabf

fixes ninia#81

ndjensen added a commit to ndjensen/jep that referenced this issue Jul 7, 2017

enable setting sys.argv on the main/top interpreter

5a86053

fixes ninia#81

ndjensen added a commit to ndjensen/jep that referenced this issue Jul 7, 2017

enable setting sys.argv on the main/top interpreter

37344de

fixes ninia#81

ndjensen added a commit to ndjensen/jep that referenced this issue Jul 7, 2017

enable setting sys.argv on the main/top interpreter

345c987

fixes ninia#81

ndjensen added a commit to ndjensen/jep that referenced this issue Jul 7, 2017

enable setting sys.argv on the main/top interpreter

0a82199

fixes ninia#81

ndjensen changed the title ~~'module' object has no attribute 'argv'~~ tensorflow: 'module' object has no attribute 'argv' Jul 10, 2017

ndjensen mentioned this issue Jul 11, 2017

enable setting sys.argv on the main/top interpreter #85

Merged

akshaynayak mentioned this issue Jul 17, 2017

jep.JepException: <class 'AttributeError'>: module 'sys' has no attribute 'argv' sushant-hiray/scala-python-example#4

Closed

ndjensen closed this as completed in 4088190 Aug 2, 2017

ndjensen mentioned this issue May 3, 2019

matplotlib.pyplot.plot() not working with Jep (python 3.7.3) #187

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tensorflow: 'module' object has no attribute 'argv' #81

tensorflow: 'module' object has no attribute 'argv' #81

c-spencer commented Jun 3, 2017 •

edited

bsteffensmeier commented Jun 4, 2017

ndjensen commented Jun 4, 2017

c-spencer commented Jun 5, 2017

akrauchanka commented Jun 9, 2017 •

edited

ndjensen commented Jun 9, 2017

akrauchanka commented Jun 12, 2017 •

edited

ndjensen commented Jun 12, 2017 •

edited

ndjensen commented Jun 13, 2017 •

edited

ndjensen commented Jun 21, 2017

eastcirclek commented Jun 26, 2017 •

edited

ndjensen commented Jul 10, 2017

eastcirclek commented Jul 11, 2017

eastcirclek commented Jul 11, 2017 •

edited

ndjensen commented Jul 11, 2017

eastcirclek commented Jul 13, 2017 •

edited

tensorflow: 'module' object has no attribute 'argv' #81

tensorflow: 'module' object has no attribute 'argv' #81

Comments

c-spencer commented Jun 3, 2017 • edited

bsteffensmeier commented Jun 4, 2017

ndjensen commented Jun 4, 2017

c-spencer commented Jun 5, 2017

akrauchanka commented Jun 9, 2017 • edited

ndjensen commented Jun 9, 2017

akrauchanka commented Jun 12, 2017 • edited

ndjensen commented Jun 12, 2017 • edited

ndjensen commented Jun 13, 2017 • edited

ndjensen commented Jun 21, 2017

eastcirclek commented Jun 26, 2017 • edited

ndjensen commented Jul 10, 2017

eastcirclek commented Jul 11, 2017

eastcirclek commented Jul 11, 2017 • edited

ndjensen commented Jul 11, 2017

eastcirclek commented Jul 13, 2017 • edited

c-spencer commented Jun 3, 2017 •

edited

akrauchanka commented Jun 9, 2017 •

edited

akrauchanka commented Jun 12, 2017 •

edited

ndjensen commented Jun 12, 2017 •

edited

ndjensen commented Jun 13, 2017 •

edited

eastcirclek commented Jun 26, 2017 •

edited

eastcirclek commented Jul 11, 2017 •

edited

eastcirclek commented Jul 13, 2017 •

edited