-
Notifications
You must be signed in to change notification settings - Fork 58
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
unable to train mt5 from t5x using mixtures ValueError: Dataset is missing an expected feature during input_validation validation: 'inputs' #310
Comments
Hi @StephennFernandes , your target_to_key preprocessor doesn't look correct: |
@StephennFernandes Hi mate, have you done with data sampling for pre-training mT5? I am still stuck in pre-processing data for that (using Flax). |
@tarudesu the following is how i create seqio
|
Hey there,
I am currently pretraining mt5 model on 23 different languages. but when i create a mixture and set the mixture name in t5x .gin config file for training on the mixture i get the following error.
ValueError: Dataset is missing an expected feature during input_validation validation: 'inputs'
However when i individually ran the independent tasks by setting them in the gin file everything works fine.
the following is how my task.py file looks like.
upon running the mt5 model with the mixture name in the .gin file i get the following error:
The text was updated successfully, but these errors were encountered: