You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello! Thanks for your idea and codes, and I am applying the code to my model. There are two questions for me now:
The paper says "greedily search" over the parameters and have a "simple compose strategy" when composing reductions across layers。Does this mean that search the best rate in different later MLP layers and then simply compose them?
Can I use a single command line to realize composing reductions across layers? Or i need to repeat doing intervention on a single layer for a few times to compose reductions?
Thank you!
The text was updated successfully, but these errors were encountered:
There might be a dependence on whether a reduction at a certain layer is helpful, given there is a reduction made in another layer. Therefore, we cannot just compose reductions after individually searching for the best rate per layer, while holding other layers fixed.
Added a script that walks through our procedure for this.
We wanted to check if the benefits of performing laser across layers are additive or not, so we employed a simple strategy. The current strategy we use is as follows:
1. Initialize a vector that represents different amounts of reduction across each of the different layers
2. Edit the model with Laser starting from the final layer (restricted over a set $\rho$ values)
3. Validate over the validation set of the dataset
4. use the signal from step 2 to reduce or increase the amount of reduction
5. repeat until convergence
6. return the vector of reductions
I have also added a script that walks through this procedure under the scripts directory.
One thing to note: This search procedure is probably not the most efficient and only performs a sparse search over possible $\rho$ values and only over the encoder MLP layers ('MLP_FC_IN') layers. A more thorough search might result in additional increased improvements!
Hello! Thanks for your idea and codes, and I am applying the code to my model. There are two questions for me now:
Thank you!
The text was updated successfully, but these errors were encountered: