fp16

Here is 1 public repository matching this topic...

custom-build-robots / tensorrt-llm-edge-prep

Build, run, and setup scripts for the complete TensorRT-LLM pipeline on RTX A6000 Ada (SM89). Reproducible path from HuggingFace checkpoint to deployable .engine file, with FP16 baseline and FP8 quantization. Companion material to the 4-part blog series on ai-box.eu — in preparation for the NVIDIA TensorRT Edge-LLM ecosystem.

inference nvidia quantization rtx fp16 ai-agents edge-ai llm ada-architecture fp8 tensorrt-llm

Updated May 16, 2026
Shell

Improve this page

Add a description, image, and links to the fp16 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the fp16 topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fp16

Here is 1 public repository matching this topic...

custom-build-robots / tensorrt-llm-edge-prep

Improve this page

Add this topic to your repo