This repository was archived by the owner on Jul 4, 2025. It is now read-only.

Error
Looks like something went wrong!

About

Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU accelerated inference on NVIDIA's GPUs.

cortex.jan.ai/docs/cortex-tensorrt-llm

nvidia jan tensorrt llm tensorrt-llm

Readme

Apache-2.0 license

Activity

Custom properties

43 stars

2 watching

3 forks

Report repository

Releases 19

0.0.9 Latest

Aug 6, 2024

+ 18 releases

Packages

No packages published

Languages

C++ 99.3%
Python 0.5%
Cuda 0.2%
CMake 0.0%
Shell 0.0%
Smarty 0.0%

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Error
Looks like something went wrong!

About

Uh oh!

Releases 19

Packages

Languages

License

menloresearch/cortex.tensorrt-llm

ErrorLooks like something went wrong!

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 19

Packages 0

Languages

Error
Looks like something went wrong!

Packages