Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Could you please offer a script for gpt2 to generate the training data? #1

Open
Luobots opened this issue Nov 2, 2022 · 1 comment

Comments

@Luobots
Copy link

Luobots commented Nov 2, 2022

I made a gpt2 script to generate the training data following your paper, but it did badly in qnli task, different to CTRL, the filtering rule is not suitable for gpt2, but I can't handle the rule myself.
Thank you.

@yumeng5
Copy link
Owner

yumeng5 commented Nov 3, 2022

Hi,

We have only used GPT-2 for generation on MNLI and SST-2 tasks, but the code adaptation for other tasks should be straightforward. You do need to change a few places like removing the control codes used in CTRL and how the termination of a generated sequence is determined -- this can be done by using certain punctuation marks based on the generation patterns of GPT-2 on the target task.

Best,
Yu

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants