How to enable vllm #536

lucasjinreal · 2023-07-04T05:20:21Z

Feature request

How to enable vllm

Motivation

How to enable vllm

Your contribution

How to enable vllm

OlivierDehaene · 2023-07-04T07:36:59Z

Use 0.9 and a supported model

lucasjinreal · 2023-07-04T08:44:46Z

@OlivierDehaene Hi, don' know if you are stuff in huggingface or not,
but PLEASE indicates more details not just type use 0.9

Does it enabled by default?
if not, how to enable?
any documentations indicates how to use?
I never know a huggingface stuff can be so impatient to github community member.

Narsil · 2023-07-04T08:57:46Z

can be so impatient to github community member.

Please read your initial "Feature request" and tell me you did the effort to actually express intelligibly a feature request ?
Is this a bug ?

There are many ways you could have phrased that, starting with a request to improve the docs because you couldn't find what you want. What is the actual question you had, where did you look for it, and what did you find instead.

Also filling in properly the template instead of repeating the same thing over.

Our effort in replying scale with your effort.

Yes
It's always on ,there's no opt-out, it's just better overall that what was there previously.

There's no doc for it because well, it's not necessary, if you use flash models, you just get it.

It's also NOT vllm. It's a custom variant of PagedAttention, which is what makes vllm faster. We do reuse a slightly modified version of their low level kernel.

lucasjinreal · 2023-07-04T10:56:29Z

@Narsil Hi, where does PagedAttention custom kernel included in?

OlivierDehaene closed this as completed Jul 4, 2023

shermansiu mentioned this issue Nov 22, 2023

Integration with other open-source libraries hao-ai-lab/LookaheadDecoding#1

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to enable vllm #536

How to enable vllm #536

lucasjinreal commented Jul 4, 2023

OlivierDehaene commented Jul 4, 2023

lucasjinreal commented Jul 4, 2023 •

edited

Loading

Narsil commented Jul 4, 2023 •

edited

Loading

lucasjinreal commented Jul 4, 2023

How to enable vllm #536

How to enable vllm #536

Comments

lucasjinreal commented Jul 4, 2023

Feature request

Motivation

Your contribution

OlivierDehaene commented Jul 4, 2023

lucasjinreal commented Jul 4, 2023 • edited Loading

Narsil commented Jul 4, 2023 • edited Loading

lucasjinreal commented Jul 4, 2023

lucasjinreal commented Jul 4, 2023 •

edited

Loading

Narsil commented Jul 4, 2023 •

edited

Loading