FlairEmbeddings function gives ValueError in python 3.6 (latest Nvidia Pytorch Docker container) #1744

tylerlekang · 2020-07-06T22:33:16Z

Describe the bug
Simply running the code FlairEmbeddings('news-forward') gives a ValueError, in Python 3.6.10 (Conda), which is the python environment included in the most recent PyTorch Docker container from Nvidia. (https://docs.nvidia.com/deeplearning/frameworks/pytorch-release-notes/rel_20-06.html#rel_20-06)

Here is the error message:

Traceback (most recent call last):
  File "fineTune_langModel.py", line 10, in <module>
    language_model = FlairEmbeddings('news-forward').lm
  File "/opt/conda/lib/python3.6/site-packages/flair/embeddings/token.py", line 578, in __init__
    self.lm: LanguageModel = LanguageModel.load_language_model(model)
  File "/opt/conda/lib/python3.6/site-packages/flair/models/language_model.py", line 202, in load_language_model
    dropout=state["dropout"],
  File "/opt/conda/lib/python3.6/site-packages/flair/models/language_model.py", line 63, in __init__
    self.to(flair.device)
  File "/opt/conda/lib/python3.6/site-packages/torch/nn/modules/module.py", line 465, in to
    return self._apply(convert)
  File "/opt/conda/lib/python3.6/site-packages/flair/models/language_model.py", line 404, in _apply
    for info in torch.__version__.replace("+",".").split('.') if info.isdigit())
ValueError: not enough values to unpack (expected at least 3, got 2)

To Reproduce
Run this code in the container (after pip installing flair):
from flair.embeddings import FlairEmbeddings
FlairEmbeddings('news-forward')

Expected behavior
The function should work with no problems (in particular, the .lm is intended to be given to the LanguageModelTrainer function).

Environment (please complete the following information):

Docker container running Linux (all details of Ubuntu, Python, PyTorch, etc. versions is in the Nvidia link above)
Flair version is 0.5

Additional context
Running the code in Python 3.7.7 (Conda) gives no problems.

The text was updated successfully, but these errors were encountered:

alanakbik · 2020-07-07T09:46:09Z

@tylerlekang thanks for reporting this. Could you print the torch version you get with torch.__version__? Also, can you try updating to Flair 0.5.1?

tylerlekang · 2020-07-07T14:53:12Z

@alanakbik torch.__version__ reports 1.6.0a0+9907a3e , which matches as shown in https://docs.nvidia.com/deeplearning/frameworks/pytorch-release-notes/rel_20-06.html#rel_20-06

Used pip install --upgrade flair to upgrade to 0.5.1. The same error persists:

>>> flair.__version__
'0.5.1'
>>>
>>> from flair.embeddings import FlairEmbeddings
>>> FlairEmbeddings('news-forward')
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/opt/conda/lib/python3.6/site-packages/flair/embeddings/token.py", line 586, in __init__
    self.lm: LanguageModel = LanguageModel.load_language_model(model)
  File "/opt/conda/lib/python3.6/site-packages/flair/models/language_model.py", line 202, in load_language_model
    dropout=state["dropout"],
  File "/opt/conda/lib/python3.6/site-packages/flair/models/language_model.py", line 63, in __init__
    self.to(flair.device)
  File "/opt/conda/lib/python3.6/site-packages/torch/nn/modules/module.py", line 465, in to
    return self._apply(convert)
  File "/opt/conda/lib/python3.6/site-packages/flair/models/language_model.py", line 404, in _apply
    for info in torch.__version__.replace("+",".").split('.') if info.isdigit())
ValueError: not enough values to unpack (expected at least 3, got 2)

Did you test with this container? It seems like a common and important container to verify, as it is official Nvidia optimized for PyTorch applications.

Thank you very much for your support! :)

tylerlekang · 2020-07-07T15:03:41Z

@alanakbik In the models/language_model.py code, the first line of _apply is (starts at line 402):

major, minor, build, *_ = (int(info)
                                for info in torch.__version__.replace("+",".").split('.') if info.isdigit())

If I simply run torch.__version__.replace("+",".").split('.') in the container (python 3.6.10) it returns ['1', '6', '0a0', '9907a3e']. Then if I run for i in (int(info) for info in torch.__version__.replace("+",".").split('.') if info.isdigit()): print(i) it prints:

1
6

However, on my local machine running vanilla python 3.7.7, the torch version is just 1.5.0. So this may be the problem.

I have no idea why Nvidia has chosen this version of Pytorch with letters in the version number, but they did make this choice and this container is supposed to be an easy solution for highly optimized GPU runs on their hardware.

Do you have any workaround ideas?

tylerlekang · 2020-07-07T17:40:01Z

@alanakbik could I just hardcode the major, minor, build numbers, in my local version of language_model.py if there is no workaround?

major = 1
minor = 6
build = 0

It seems the code just checks that the major.minor is >= 1.4 ? But I don't want to mess up any other parts of the code.

alanakbik · 2020-07-07T19:01:52Z

Yes, I guess you could just overwrite torch.__version__ as a quick fix by calling this before your script:

import torch

torch.__version__ = '1.5.0'

Meanwhile, I will put in a PR to fix the error.

tylerlekang · 2020-07-07T19:05:14Z

@alanakbik just wanting to triple-confirm, that shouldn't cause any problems with the rest of the FlairEmbeddings or LanguageModelTrainer codes? Thank you!

alanakbik · 2020-07-07T19:12:16Z

It shouldn't cause any problems on the flair side. We use the string to determine whether and old version of torch is used (<1.4.0) or not, so changing it to another string that is above 1.4.0 won't change anything.

GH-1744: remove torch version checks

tylerlekang added the bug Something isn't working label Jul 6, 2020

alanakbik added a commit that referenced this issue Jul 7, 2020

GH-1744: remove torch version checks

b72545e

alanakbik mentioned this issue Jul 7, 2020

GH-1744: remove torch version checks #1745

Merged

alanakbik closed this as completed in #1745 Jul 9, 2020

alanakbik added a commit that referenced this issue Jul 9, 2020

Merge pull request #1745 from flairNLP/GH-1744-torch-version

412ea85

GH-1744: remove torch version checks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FlairEmbeddings function gives ValueError in python 3.6 (latest Nvidia Pytorch Docker container) #1744

FlairEmbeddings function gives ValueError in python 3.6 (latest Nvidia Pytorch Docker container) #1744

tylerlekang commented Jul 6, 2020

alanakbik commented Jul 7, 2020

tylerlekang commented Jul 7, 2020

tylerlekang commented Jul 7, 2020 •

edited

tylerlekang commented Jul 7, 2020

alanakbik commented Jul 7, 2020

tylerlekang commented Jul 7, 2020

alanakbik commented Jul 7, 2020

FlairEmbeddings function gives ValueError in python 3.6 (latest Nvidia Pytorch Docker container) #1744

FlairEmbeddings function gives ValueError in python 3.6 (latest Nvidia Pytorch Docker container) #1744

Comments

tylerlekang commented Jul 6, 2020

alanakbik commented Jul 7, 2020

tylerlekang commented Jul 7, 2020

tylerlekang commented Jul 7, 2020 • edited

tylerlekang commented Jul 7, 2020

alanakbik commented Jul 7, 2020

tylerlekang commented Jul 7, 2020

alanakbik commented Jul 7, 2020

tylerlekang commented Jul 7, 2020 •

edited