Q3 ROADMAP #30

robertgshaw2-neuralmagic · 2024-07-22T12:17:08Z

halexan · 2024-09-02T08:49:39Z

Looking forward to
Documented examples of non-llama language models (gemma, phi, qwen, mixtral, deepseek, ... request others?)

CharlesRiggins · 2024-09-03T12:48:53Z

This paper can be helpful for "understanding why the BoS token has an impact".

IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact

It is found that such huge outliers usually occur at the [BOS] token and some other uninformative initial tokens (e.g., "." or ",") at particular channels, regardless of the rest of the input sequence. We thus name these tokens pivot tokens given their dominating values in the activation. The attention scores will be concentrated on these pivot tokens than the rest ones, a.k.a attention sinks (Xiao et al., 2024).

fengyang95 · 2024-09-09T02:21:46Z

May I ask when AWQ will be supported

robertgshaw2-neuralmagic · 2024-09-09T14:00:43Z

May I ask when AWQ will be supported

We are actively working on this now. Ideally in a week or so

robertgshaw2-neuralmagic added enhancement New feature or request roadmap Items planned to be worked on and removed enhancement New feature or request labels Jul 22, 2024

robertgshaw2-neuralmagic pinned this issue Aug 11, 2024

robertgshaw2-neuralmagic changed the title ~~Near Term Items~~ Near Term Roadmap Aug 11, 2024

robertgshaw2-neuralmagic changed the title ~~Near Term Roadmap~~ Roadmap Aug 11, 2024

robertgshaw2-neuralmagic changed the title ~~Roadmap~~ ROADMAP Aug 11, 2024

robertgshaw2-neuralmagic changed the title ~~ROADMAP~~ Q3 ROADMAP Aug 11, 2024

robertgshaw2-neuralmagic mentioned this issue Aug 29, 2024

Do we have a roadmap? #128

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Q3 ROADMAP #30

Q3 ROADMAP #30

robertgshaw2-neuralmagic commented Jul 22, 2024 •

edited

Loading

halexan commented Sep 2, 2024

CharlesRiggins commented Sep 3, 2024 •

edited

Loading

fengyang95 commented Sep 9, 2024

robertgshaw2-neuralmagic commented Sep 9, 2024

Q3 ROADMAP #30

Q3 ROADMAP #30

Comments

robertgshaw2-neuralmagic commented Jul 22, 2024 • edited Loading

halexan commented Sep 2, 2024

CharlesRiggins commented Sep 3, 2024 • edited Loading

fengyang95 commented Sep 9, 2024

robertgshaw2-neuralmagic commented Sep 9, 2024

robertgshaw2-neuralmagic commented Jul 22, 2024 •

edited

Loading

CharlesRiggins commented Sep 3, 2024 •

edited

Loading