[Feature request] Change default text generation model from gpt2 to distilgpt2 #45

chelouche9 · 2023-03-22T09:55:40Z

Currently, the default model is gpt2 for text generation. It isn't good and I could not configure it to work correctly.

I used distilgpt2 and it works great out of the box. I want to create a PR and change it. @xenova What do you think?

xenova · 2023-03-22T14:12:42Z

The only reason I use gpt2 as the default, is because HF uses it as a default:

I think default performance might improve once I add more generation parameters (no repeat n grams, etc.)

xenova · 2023-04-06T17:18:17Z

I think default performance might improve once I add more generation parameters (no repeat n grams, etc.)

We recently added repetition_penalty and no_repeat_ngram_size generation parameters by the way :)

chelouche9 added the enhancement New feature or request label Mar 22, 2023

chelouche9 closed this as completed Apr 6, 2023

Provide feedback