Skip to content

vllm-project/vllm projects

Search results

  • Ray

    #7 updated Mar 9, 2025
    Tracks Ray issues and pull requests in vLLM
  • DeepSeek V3/R1 Template

    #5 updated Mar 7, 2025
    2025-02-25: DeepSeek V3/R1 is supported with optimized block FP8 kernels, MLA, MTP spec decode, multi-node PP, EP, and W4A16 quantization
  • #6 updated Mar 7, 2025
    A list of onboarding tasks for first-time contributors to get started with vLLM.
  • #1 updated Feb 28, 2025
    [Testing] Optimize V1 PP efficiency.
  • #2 updated Feb 28, 2025