Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Is there any option for fine-tuning mt5 models instead of training from scratch? #6

Closed
farshadfiruzi opened this issue Jul 26, 2021 · 2 comments

Comments

@farshadfiruzi
Copy link

Hi,
Thanks for the amazing simpleT5 package. I use the following script to train a mt5 model for summarization task.

from simplet5 import SimpleT5

model = SimpleT5()

model.from_pretrained(model_type="mt5", model_name="google/mt5-small")

model.train(train_df=train_df, eval_df=test_df, source_max_token_len=256, target_max_token_len=64, batch_size=8, max_epochs=3, use_gpu=True, outputdir = "outputs", early_stopping_patience_epochs = 0, )

When I run this code, training start from scratch. My question is that is there any flag to fine-tune the mt5 model instead of training from scratch?

@farshadfiruzi farshadfiruzi changed the title Is there any option for fine-tuning mt5 models instead of train from scratch? Is there any option for fine-tuning mt5 models instead of training from scratch? Jul 26, 2021
@Shivanandroy
Copy link
Owner

model.train() finetunes the pre-trained model on your dataset. It doesn't train the model from scratch.

@farshadfiruzi
Copy link
Author

Thank you so much.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants