Skip to content
This repository has been archived by the owner on Nov 3, 2023. It is now read-only.

Allow flan-t5 models in ParlAI with fp16 improvment #4875

Merged
merged 2 commits into from Nov 11, 2022
Merged

Conversation

dexterju27
Copy link
Contributor

Patch description
The current setting does allow flan-t5 model to run under parlai hugging face agent seamlessly. Adding Flan-t5 options in the agent.

Testing steps

parlai train_model -t convai2  -dt train --batchsize 1  --fp16 True  --gradient-clip 1.0 --label-truncate 256 --text-truncate 512 --log-every-n-secs 30 --lr-scheduler reduceonplateau --max-train-time 169344.0 --model hugging_face/t5 --save-after-valid True --skip-generation True --optimizer adam -lr 1e-05 -veps 0.15 -vmm min -vmt ppl -vp 15 -tblog True --t5-model-arch google/flan-t5-large  --t5-model-parallel True --dict-tokenizer gpt2

@dexterju27 dexterju27 changed the title Allow flan-t5 models in ParlAI Allow flan-t5 models in ParlAI with fp16 improvment Nov 11, 2022
Copy link
Contributor

@klshuster klshuster left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

thanks!

@dexterju27 dexterju27 merged commit c4d7b08 into main Nov 11, 2022
@dexterju27 dexterju27 deleted the allow-flan-t5 branch November 11, 2022 18:41
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants