Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[spike] evaluate + potentially prototype out-of-tree backend registration #1557

Closed
4 tasks
Titus-von-Koeller opened this issue Mar 6, 2025 · 1 comment
Closed
4 tasks
Assignees
Labels
Milestone

Comments

@Titus-von-Koeller
Copy link
Collaborator

Titus-von-Koeller commented Mar 6, 2025

review:

Matthew:

  • "The way pytorch uses the package entry points in that blog is more or less what I saw adopted by vLLM."
  • "And, as far as I am aware, as long as our custom ops are defined first and then we load a custom backend package, the entry point for that would be perfect capable of just registering its implementations for those ops. I think even it can override a backend implementation we provide, e.g. let's say a bitsandbytes-ipex has cpu optimizations. It would just have to be loaded after ours."
  • "Regarding the out of tree topic on certain backends, this is also something I see on vLLM, just for reference. They have Ascend NPU and IBM Spyre as out-of-tree community supported plugins."
@Titus-von-Koeller
Copy link
Collaborator Author

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant