-
Notifications
You must be signed in to change notification settings - Fork 59
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Multi-task batching #30
Comments
Hi @einarbmag, sorry for the long delay. |
Hi @einarbmag, here is one possible implementation we can use for a batch containing examples from multiple tasks: Assume
We can obtain the required IA^3 vectors for each example by using Then, we modify the input activations, which have the shape |
Hi I have to use the mixed task batch so I'll do it if I need to,.. |
Hi @dodoyeon sorry we did not, but Muqeeth's sketch above can provide a starting point! |
In the paper, you mention that IA^3 is compatible with multi-task batching, a requirement to be comparable to ICL. Unfortunately, the current implementation of Huggingface PEFT does not support this, and it would apparently be a big refactoring to do so huggingface/peft#759.
Do you know of an implementation or example that shows how to do this?
The text was updated successfully, but these errors were encountered: