Values of '[Not Supported]' are not handled properly. #2

tasptz · 2017-10-05T09:02:43Z

Values of '[Not Supported]' are not handled properly.

In [1]: import GPUtil

In [2]: g = GPUtil.getGPUs()
---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)
<ipython-input-2-871afb3451f3> in <module>()
----> 1 g = GPUtil.getGPUs()

~\AppData\Local\Continuum\Anaconda3\envs\tensorflow\lib\site-packages\GPUtil\__init__.py in getGPUs()
     80                 deviceIds[g] = int(vals[i])
     81             elif (i == 1):
---> 82                 gpuUtil[g] = float(vals[i])/100
     83             elif (i == 2):
     84                 memTotal[g] = int(vals[i])

ValueError: could not convert string to float: '[Not Supported]'

The text was updated successfully, but these errors were encountered:

anderskm · 2017-10-05T11:22:46Z

Thank you for opening the issue.

The issue can be handled by wrapping it in a try-catch statement and setting it to some fixed value, if the typecasting fails.
This, however, opens the questions, what the most appropriate value would be. Any number between 0 (no load) and 1 (full load) doesn't really make sense, as we do not know anything about the load. Then NaN comes to mind, but that messes with the sorting functions, as NaN is an unordered "number".
Despite this, I think the best option is setting it to NaN, if the typecasting fails and then writing a custom sorting function. However, I am open for suggestions.

anderskm · 2017-10-09T11:48:43Z

@tasptz I've committed an update to GPUtil, which should resolve the issue.
The code now tries to typecast the load (and memory) to a float, but if it fails, it sets it to nan.
When getting available GPUs (getAvailable, getFirstAvailable or getAvailability) the optional input "includeNan" can be set to True (default: False) include GPUs with a nan load or memory.
I have tested it by manually inserting nan values. Would you mind testing it using your setup?

anderskm · 2017-10-26T11:51:08Z

@tasptz Have you had a chance to test if the updated version works for you?

dizcza · 2017-10-31T19:29:29Z

@anderskm I also had the same problem and I confirm current master (commit a492d3b) fixes it
yet you broke py2-3 compatibility in v1.2.3...master#diff-6d20cf947cfd76895c515f6b1c48b0a0R145
python3 list's options do NOT contain comparator thus current master code does not work with py3.
please, consider adding unit tests to prevent such regression

anderskm · 2017-11-01T08:53:31Z

@dizcza Yay, and doh! :-)

Thank you for confirming that it fixes the initial problem, but also broke the compatibility. I was not aware of the removal of the cmp parameters from py2 to py3. I'll see if I can find a good solution within the next few days. I'm open for suggestions of how to best fix it ;-)

I completely agree with the unit testing. It should be done, and it has been on my to-do list, but I have not had time to set it up properly yet. So far it has mainly been done manually, which is far from ideal, as the recent update shows.

anderskm · 2017-11-01T11:46:21Z

@dizcza I believe, I have found a solution, which is compatible with both py2 and py3.
The solution uses the key option in list.sort(), which should work in both py2 and py3.
E.g.:
GPUs.sort(key=lambda x: np.Inf if np.isnan(x.id) else x.id, reverse=False)

It also seems like a nicer solution than the custom compare function.

Will test it properly before committing ;-)

anderskm · 2017-11-01T14:34:12Z

Had a chance to test it, and it seems to work in both py2 and py3.
Tested it manually by setting the load of odd GPU id's to "Not supported" and sorting according to load.
Got same expected results in both py2 and py3 environments (using anaconda) on the same machine.
I have pushed a new version (f1aa347).

anderskm added the bug label Oct 9, 2017

anderskm self-assigned this Oct 9, 2017

anderskm closed this as completed Nov 8, 2017

ChrisPalmerNZ mentioned this issue Mar 22, 2018

ValueError #5

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Values of '[Not Supported]' are not handled properly. #2

Values of '[Not Supported]' are not handled properly. #2

tasptz commented Oct 5, 2017

anderskm commented Oct 5, 2017

anderskm commented Oct 9, 2017

anderskm commented Oct 26, 2017

dizcza commented Oct 31, 2017

anderskm commented Nov 1, 2017

anderskm commented Nov 1, 2017

anderskm commented Nov 1, 2017

Values of '[Not Supported]' are not handled properly. #2

Values of '[Not Supported]' are not handled properly. #2

Comments

tasptz commented Oct 5, 2017

anderskm commented Oct 5, 2017

anderskm commented Oct 9, 2017

anderskm commented Oct 26, 2017

dizcza commented Oct 31, 2017

anderskm commented Nov 1, 2017

anderskm commented Nov 1, 2017

anderskm commented Nov 1, 2017