Add support for GPT-neo and GPT-J #2

chijames · 2022-02-20T03:28:17Z

Hi,

Thanks for the great project! Do you plan to support GPT-neo and GPT-J down the road?

ShengdingHu · 2022-02-22T14:20:15Z

Thanks for your comment, the project is basically compatible with any PyTorch-based models.
To apply OpenDelta to a new model that we haven't tested, please follow the instructions in the docs, basically,

visualize and understand the parameter/modular structure of the pre-trained model using Visualization
specify the modified modules according to your knowledge about the delta methods, e.g., if applying Lora, any linear can be specified, if applying adapters, any submodules that have hidden state as the output can be specified.
freeze the irrelevant parameters.

However, as a kind reminder, prefix-tuning is not supported due to its compicated nature to process the attention mask.

If you still have any other questions, feel free to let us know.

LiuYuemei111 added the enhancement New feature or request label Feb 21, 2022

ShengdingHu closed this as completed Apr 2, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for GPT-neo and GPT-J #2

Add support for GPT-neo and GPT-J #2

chijames commented Feb 20, 2022

ShengdingHu commented Feb 22, 2022

Add support for GPT-neo and GPT-J #2

Add support for GPT-neo and GPT-J #2

Comments

chijames commented Feb 20, 2022

ShengdingHu commented Feb 22, 2022