New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
add TSSB-3M dataset #2693
add TSSB-3M dataset #2693
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Had a few questions, awesome job nevertheless! This will make a great addition to the instruciton datasets.
…y generate synonymous instructions 3. filter invalid commit mesage
❌ pre-commit failed. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The .tsv
data file will need removing before merge
|
I would advocate having it in a HF repo and downloading it as part of the script. But I guess it is reasonably small so maybe not a big problem |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
add the TSSM-3M code bugs dataset
issue: #1395