Pretrain from scratch #30

agemagician · 2019-12-29T23:22:44Z

Hi,

I am trying to pretrain from scratch and I am using the fine-tune colab example as a base for my code.
Everything runs fine except for the train part.

I have changed the train part to :

TRAIN_STEPS = 25000 #@param {type: "integer"}

model.train(
    mixture_or_task_name="ss3",
    steps = TRAIN_STEPS
)

It shows me the following error:
Required bindings for make_layer_stack not provided in config: ['layers']

I would assume this is a problem because I didn't define the model configuration.

Could you please let us know how you can adjust the fine-tune colab example to pretrain example from scratch ?

The text was updated successfully, but these errors were encountered:

craffel · 2019-12-31T18:24:53Z

When one calls model.finetune as is done in the colab, the operative gin config file which resides in the pretrained model directly is automatically loaded.
https://github.com/google-research/text-to-text-transfer-transformer/blob/master/t5/models/mtf_model.py#L308
Parsing this config file provides values for things like make_layer_stack. When using model.train, there is no pre-trained model directory so there is no operative config to look at. One option would be to decide on a model you want to use and load the operative config for that file, e.g. for the Base model

    import gin
    with gin.unlock_config():
      gin.parse_config_file("gs://t5-data/pretrained_models/base/operative_config.gin")

Note that you still may need to provide or modify other gin parameters.

craffel closed this as completed Dec 31, 2019

matchlesswei mentioned this issue May 6, 2020

how to preprocess text for pretrain? #211

Closed

chrisrytting mentioned this issue May 19, 2021

Training T5 from scratch on English (no pre-training) #387

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pretrain from scratch #30

Pretrain from scratch #30

agemagician commented Dec 29, 2019

craffel commented Dec 31, 2019

Pretrain from scratch #30

Pretrain from scratch #30

Comments

agemagician commented Dec 29, 2019

craffel commented Dec 31, 2019