This repository has been archived by the owner on Jun 24, 2024. It is now read-only.

Implement SuperHOT/interpolated RoPE support #378

Closed

philpax opened this issue Jul 17, 2023 · 1 comment · Fixed by #389

Assignees

Labels

issue:enhancement model:llama

Collaborator

philpax commented Jul 17, 2023

Another llama.cpp feature that seems to have shrunk the paper-to-implementation pipeline to less than one week!

This allows for a much longer context (assuming you have the (V)RAM for it)

We can probably close out #77 if this is done.

philpax added issue:enhancement model:llama labels

Contributor

LLukas22 commented Jul 19, 2023

To do this we only need a new rope_scaling model parameter. Or am i missing something?

LLukas22 self-assigned this

LLukas22 mentioned this issue

Custom RoPE Scaling #389

Merged

philpax closed this as completed in #389

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.