Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

epic: support tensorrt-llm batch manager #828

Closed
0xSage opened this issue Jul 4, 2024 · 1 comment
Closed

epic: support tensorrt-llm batch manager #828

0xSage opened this issue Jul 4, 2024 · 1 comment
Assignees
Labels
P1: important Important feature / fix type: feature request A new feature

Comments

@0xSage
Copy link
Contributor

0xSage commented Jul 4, 2024

our research team uses batch size ~128 to generate synthetic data.

we should dogfood cortex for this.

add BM to cortex, so we can use our own software for training purposes.
https://nvidia.github.io/TensorRT-LLM/advanced/batch-manager.html

@0xSage 0xSage added P1: important Important feature / fix type: feature request A new feature labels Jul 4, 2024
@vansangpfiev
Copy link
Contributor

Related ticket: janhq/cortex.tensorrt-llm#51

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
P1: important Important feature / fix type: feature request A new feature
Projects
Archived in project
Development

No branches or pull requests

2 participants