-
Notifications
You must be signed in to change notification settings - Fork 488
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Adds a concrete finetuning example from a custom dataset #156
Conversation
This is going to be useful! I think everything is clear and merge-ready, except I think we need to address that arc_easy is under the CC-BY-SA license, which requires attribution. We can't put it in the JSONL file, since JSONL doesn't support comments. I think we are fine adding it to the finetune_example readme. The requirements are
Where "appropriate credit" means "you must provide the name of the creator and attribution parties, a copyright notice, a license notice, a disclaimer notice, and a link to the material." So I'd add to the readme (fixing what I said about split to be right, if it we used a different split)
I am not sure why they gave it a commercial license but then added a noncommercial disclaimer. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
🤩
* Add default to download script and adjust yamls Co-authored-by: dblalock <davis@mosaicml.com>
Adds a
finetune_example
directory in thetrain
directory, which includes: