You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The current TRT-LLM Materials discusses the Hands-on aspects of getting from a Model to Deployment in a Triton server.
Given that TRT-LLM focuses on Performance, we could have a section that discusses the performance aspects of TRT-LLM and the various optimisations that are available to the end user.
The text was updated successfully, but these errors were encountered:
aswkumar99
changed the title
Addition of Benchmarking for TRT-LLM
Feature Request - Addition of Benchmarking for TRT-LLM
Feb 19, 2024
programmah
added a commit
to programmah/End-to-End-LLM
that referenced
this issue
Feb 26, 2024
The current TRT-LLM Materials discusses the Hands-on aspects of getting from a Model to Deployment in a Triton server.
Given that TRT-LLM focuses on Performance, we could have a section that discusses the performance aspects of TRT-LLM and the various optimisations that are available to the end user.
The text was updated successfully, but these errors were encountered: