Completely refactor injection code #16

tomaarsen · 2023-10-12T12:32:44Z

Hello!

Pull Request overview

Completely refactor injection code

Details

The injection is now done at the end of the regular from_pretrained call, and is even possible on the AutoModel... classes. This was not possible before, and was the big motivation for this refactor. With this change implemented, architectures that require trust_remote_code=True can also benefit from attention_sinks, such as Qwen.

This also helps a decent bit with code duplication.

Tom Aarsen

Completely refactor injection code

fc58461

tomaarsen merged commit e0ab568 into main Oct 12, 2023

tomaarsen deleted the refactor/injection branch October 12, 2023 12:33

EGjoni mentioned this pull request Oct 13, 2023

Trying a minimal example with LlamaForCasualLM, sadly it fails #1

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Completely refactor injection code #16

Completely refactor injection code #16

tomaarsen commented Oct 12, 2023

Completely refactor injection code #16

Completely refactor injection code #16

Conversation

tomaarsen commented Oct 12, 2023

Pull Request overview

Details