You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I've also read another paper along these lines too (but can't find it atm) that suggests the layers marked by the yellow areas at the bottom:
are just doing some kind of "averaging", and IIRC the paper suggests replacing these penultimate layers with some other much cheaper operation (IIRC, it was a training time modification rather than post-training though).
Not all Layers of LLMs are Necessary during Inference (v2)
The text was updated successfully, but these errors were encountered: