-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Training data , Data Format #2
Comments
Hi @saramoeini20,
I hope this answers your questions. |
So for training only M2 format is needed for GEC or in your case it is that way? |
And also you used base models output only or used their models too? |
Yes, only the M2 format is needed. ESC only requires the outputs, you don't need the models' weights. |
Thank you. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Hello,
I have two questions,
I saw dev-text folder and it had more than one .txt file. I know about source/target file ( it has incorect/correct pair , right?) but what about other files that has same sentences with different order/grammar?
From what I have found, for training phase we need source/target file (incorrect/correct sentence) but what about M2 format? Is it for just enhancing the model or other things?
The text was updated successfully, but these errors were encountered: