You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thanks for your comment, the project is basically compatible with any PyTorch-based models.
To apply OpenDelta to a new model that we haven't tested, please follow the instructions in the docs, basically,
visualize and understand the parameter/modular structure of the pre-trained model using Visualization
specify the modified modules according to your knowledge about the delta methods, e.g., if applying Lora, any linear can be specified, if applying adapters, any submodules that have hidden state as the output can be specified.
freeze the irrelevant parameters.
However, as a kind reminder, prefix-tuning is not supported due to its compicated nature to process the attention mask.
If you still have any other questions, feel free to let us know.
Hi,
Thanks for the great project! Do you plan to support GPT-neo and GPT-J down the road?
The text was updated successfully, but these errors were encountered: