threadripper

Here is 1 public repository matching this topic...

a730 / MojoLlama

MojoLlama is a high-throughput inference engine for CPU, built on Modular MAX. GGUF native, MoE-optimized, with support for 50+ architectures — from Llama to Gemma 4 to hybrid SSM models. GPU acceleration via MAX engine for supported models.

ai avx2 threadripper huggingface llm llama-cpp vllm mojo-lang

Updated May 20, 2026
Python

Improve this page

Add a description, image, and links to the threadripper topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the threadripper topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

threadripper

Here is 1 public repository matching this topic...

a730 / MojoLlama

Improve this page

Add this topic to your repo