Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Cannot download models (biobert) #57

Open
si-rob opened this issue Mar 20, 2020 · 3 comments
Open

Cannot download models (biobert) #57

si-rob opened this issue Mar 20, 2020 · 3 comments

Comments

@si-rob
Copy link

si-rob commented Mar 20, 2020

trying to run nboost with Tensorflow using Biobert model. Getting the following stack trace.

:resolve_model:[__i:res: 43]:Extracting "/usr/local/lib/python3.6/dist-packages/nboost/.cache/biobert-base-uncased-msmarco" from /usr/local/lib/python3.6/dist-packages/nboost/.cache/biobert-base-uncased-msmarco.tar.gz                                                                                                                                                                                                                
 Traceback (most recent call last):                                                                                                                                                                                    
   File "/usr/local/bin/nboost", line 8, in <module>                                                                                                                                                                   
     sys.exit(main())                                                                                                                                                                                                  
   File "/usr/local/lib/python3.6/dist-packages/nboost/__main__.py", line 10, in main                                                                                                                                  
     proxy = Proxy(**vars(args))                                                                                                                                                                                       
   File "/usr/local/lib/python3.6/dist-packages/nboost/proxy.py", line 56, in __init__                                                                                                                                 
     **cli_args)  # type: RerankModelPlugin                                                                                                                                                                            
   File "/usr/local/lib/python3.6/dist-packages/nboost/plugins/models/__init__.py", line 44, in resolve_model                                                                                                          
     extract_tar_gz(binary_path, data_dir)                                                                                                                                                                             
   File "/usr/local/lib/python3.6/dist-packages/nboost/helpers.py", line 96, in extract_tar_gz                                                                                                                         
     tar = tarfile.open(fileobj=fileobj)                                                                                                                                                                               
   File "/usr/lib/python3.6/tarfile.py", line 1576, in open                                                                                                                                                            
     raise ReadError("file could not be opened successfully")                                                                                                                                                          
 tarfile.ReadError: file could not be opened successfully

Looks like there's an issue with the hosted file: https://storage.googleapis.com/koursaros/biobert-base-uncased-msmarco.tar.gz I'm getting:

<Code>UserProjectAccountProblem</Code>
<Message>User project billing account not in good standing.</Message>
<Details>
The billing account for the owning project is disabled in state closed
</Details>
</Error>```
@jassem123
Copy link

Encountering the same issue , I guess this means we need to introduce custom pretrained models to nboost

@MartinXPN
Copy link

MartinXPN commented Mar 31, 2020

Is there any place where these models can be accessed for a quick fix?

One very tricky workaround that worked for me was:

  • manually download the model from huggingface repo: https://huggingface.co/nboost/pt-tinybert-msmarco#
  • when running docker/docker-compose:
    • use image: koursaros/nboost:0.3.3-pt
    • bind volume: path/to/manually/downloaded/model/on/host to /opt/conda/lib/python3.6/site-packages/nboost/.cache/nboost/pt-tinybert-msmarco
    • set --model "PtBertRerankModelPlugin"
    • set --model_dir "nboost/pt-tinybert-msmarco"

@SagarPalyal
Copy link
Contributor

SagarPalyal commented Jul 3, 2020

Hi MartinXPN,

Would you please explain how did you download the model manually? when I go to provided repo url, I don't see any download option.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants