-
Notifications
You must be signed in to change notification settings - Fork 53
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Missing apps-train-files json file? #4
Comments
Could you please provide more details on what you're trying to do? I couldn't understand from your description. |
I couldn't find the file this line requires in the dataset (https://github.com/hendrycks/apps/blob/main/train/tune_apps_gpt.py#L159). But I think it is just a json file with list of training folders. |
I'll update you later tonight. I need to redownload the apps dataset as I thought we included it in there. |
Not a big issue because the json file can be inferred from the APPSBaseDataset.py file. Btw, I wonder how many gpus need to fine-tune the gpt-neo on apps. I saw the batch size per replica is only 2. |
I added the instructions here: https://github.com/hendrycks/apps/blob/main/train/README.md As for how many GPUs I believe it is listed in the paper. I can't remember the numbers off hand. |
Hi,
Thank you for releasing this amazing codebase! I found that the appsdata need to take apps-train-files json file as an input but I couldn't find anything in the provided apps dataset. I wonder if I am missing somewhere.
Thanks!
The text was updated successfully, but these errors were encountered: