[DOCS] add clarification in distilabel vLLM reference to specify dtype #326

kcentric · 2024-02-05T05:24:05Z

Which page or section is this issue related to?

Currently the code snippet in the vLLM section of the guide (https://distilabel.argilla.io/latest/technical-reference/llms/#vllm) looks like:

llm = vLLM(
    model=LLM(model="argilla/notus-7b-v1"),
    task=TextGenerationTask(),
...

Running this as-is in a Colab notebook will result in "ValueError: Bfloat16 is only supported on GPUs with compute capability of at least 8.0. Your Tesla T4 GPU has compute capability 7.5." This is discussed in vLLM issues here.

Because everyone who uses Colab would likely use the T4 GPU (if they're on the free Colab at least), they'd face the same error every time if they copy our snippet from the Docs and try to test-run vLLM in their notebook.

I'd want to change the snippet to something like this:

llm = vLLM(
    model=LLM(model="argilla/notus-7b-v1", dtype="Bfloat16"),  # If using Tesla T4 on Colab, 
                                                      # specify dtype = float16 to prevent compute compatibility error
    task=TextGenerationTask(),

and add a brief clarification about it in the text with a link for anyone who wants to understand it further.

davidberenstein1957 · 2024-02-05T09:13:44Z

Hi @kcentric , feel free to create a PR for this.

alvarobartt closed this as not planned Won't fix, can't repro, duplicate, stale May 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DOCS] add clarification in distilabel vLLM reference to specify dtype #326

[DOCS] add clarification in distilabel vLLM reference to specify dtype #326

kcentric commented Feb 5, 2024

davidberenstein1957 commented Feb 5, 2024

[DOCS] add clarification in distilabel vLLM reference to specify dtype #326

[DOCS] add clarification in distilabel vLLM reference to specify dtype #326

Comments

kcentric commented Feb 5, 2024

Which page or section is this issue related to?

davidberenstein1957 commented Feb 5, 2024