Skip to content

gpt-oss Training Fails (20B) #84

@linkspreed

Description

@linkspreed

Hello,

we try to train the GPT OSS 20B with a Colab T4 GPU and receive this error:

AttributeError                            Traceback (most recent call last)
[/tmp/ipython-input-3842036555.py](https://localhost:8080/#) in <cell line: 0>()
      1 from trl import SFTConfig, SFTTrainer
      2 from transformers import DataCollatorForSeq2Seq
----> 3 trainer = SFTTrainer(
      4     model = model,
      5     tokenizer = tokenizer,

4 frames
[/usr/local/lib/python3.11/dist-packages/torch/nn/modules/module.py](https://localhost:8080/#) in __delattr__(self, name)
   2075             del self._modules[name]
   2076         else:
-> 2077             super().__delattr__(name)
   2078 
   2079     def _register_state_dict_hook(self, hook):

AttributeError: 'PeftModel' object has no attribute '_flag_for_generation'

Do you have any solution for that?
Thank You very much!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions