Skip to content

Fix CUDA graph compilation#627

Merged
tgaddair merged 6 commits intomainfrom
fix-compile
Oct 2, 2024
Merged

Fix CUDA graph compilation#627
tgaddair merged 6 commits intomainfrom
fix-compile

Conversation

@tgaddair
Copy link
Contributor

@tgaddair tgaddair commented Oct 2, 2024

CUDA graph compilation has been broken since we added FlashInfer and prefix caching support. This fixes the issues and adds some flexibility to how it works.

@tgaddair tgaddair requested a review from noah-yoshida October 2, 2024 22:16
@tgaddair tgaddair merged commit e3f7d6e into main Oct 2, 2024
@tgaddair tgaddair deleted the fix-compile branch October 2, 2024 22:32
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants