-
Notifications
You must be signed in to change notification settings - Fork 124
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Finetuning script broken? #420
Comments
I believe that triton flash attention will not work on P100s. Could you try uninstalling |
Thank you for your quick response! Unfortunately I do not have a |
Apologies, I think I got the package wrong, and it's actually |
Did also not work for me unfortunately. However, I just switched to pretrain hf-bert, that works fine. Thank you for your help! |
Hey,
as finetuning after the import to transformers is not possible, I tried the finetuning script that you provide.
I tried to run the function 'test_classification_script()' from 'tests/test_classification.py' as a first step to test your finetuning framework.
To do so, I used a linux server with ubuntu and with 4 x NVIDIA Tesla P100 (16 GB).
For the setup, I followed all the steps that you recommend here, i.e.:
I have installed the cuda release 117, as the following output suggests:
'nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2022 NVIDIA Corporation
Built on Wed_Jun__8_16:49:14_PDT_2022
Cuda compilation tools, release 11.7, V11.7.99
Build cuda_11.7.r11.7/compiler.31442593_0'
To test your finetuning script, I simply did the following in the console:
Here is the complete output:
Note that I replaced in the output above the paths with my personal information by (...).
Also note that the commands
composer sequence_classification.py yamls/test/sequence_classification.yaml
composer sequence_classification.py yamls/test/sequence_classification.yaml model.name=mosaic_bert
yield the same error message.
Did I something wrong or is this an error in the code? I would be incredibly grateful for any guidance as I urgently need to fine-tune my model, but unfortunately, I'm currently facing the mentioned challenges that are preventing me from doing so.
Thank you very much!
The text was updated successfully, but these errors were encountered: