Skip to content

cuda: dropping some cuda deps for 0.19.0#3

Open
d-goldin wants to merge 10 commits into
CertainLach:push-lklxouywkrnvfrom
d-goldin:vllm-0.19-cuda-deps
Open

cuda: dropping some cuda deps for 0.19.0#3
d-goldin wants to merge 10 commits into
CertainLach:push-lklxouywkrnvfrom
d-goldin:vllm-0.19-cuda-deps

Conversation

@d-goldin
Copy link
Copy Markdown

From what I can tell nvidia-cudnn-frontend, was just added to pin transitive dependnecies of flashinfer which doesn't affect us in nixpkgs right now.

Cubins was added to get some AOT compiled kernels, primarily for blackwell. JIT should still work fine and we can with time update the flashinfer package as well.

See:

CertainLach and others added 10 commits April 6, 2026 23:04
Co-authored-by: Luna Nova <git@lunnova.dev>
- Bumping triton to a newer version, the older one didn't
  work for me with 0.17
- Drops quarck-kernels and cuteDSL from dependencies.
  From what I can tell those are only needed for FA4
  and would also require some nvidia blobs. We are at FA2
  right now, so this shouldn't remove any functionality
  that was present before
- Adding NCCL to wrapper args, for better UX
From what I can tell nvidia-cudnn-frontend, was just added
to pin transitive dependnecies of flashinfer which doesn't affect
us in nixpkgs right now.

Cubins was added to get some AOT compiled kernels, primarily for
blackwell. JIT should still work fine and we can with time update
the flashinfer package as well.

See:

- vllm-project/vllm#36719
- vllm-project/vllm#37233
@CertainLach CertainLach force-pushed the push-lklxouywkrnv branch 6 times, most recently from dc73532 to ba2de00 Compare April 24, 2026 01:42
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants