exllama
Here are 13 public repositories matching this topic...
A.L.I.C.E (Artificial Labile Intelligence Cybernated Existence). A REST API of A.I companion for creating more complex system
-
Updated
Dec 3, 2023 - Python
A constrained generation filter for local LLMs that makes them quote properly from a source document
-
Updated
May 14, 2024 - Python
Run gguf LLM models in Latest Version TextGen-webui
-
Updated
Jun 3, 2024 - Jupyter Notebook
A fast, lightweight, parallel inference server for Llama LLMs.
-
Updated
Jul 30, 2024 - Python
A Python script designed to streamline the process of quantizing models to exllamav2 format
-
Updated
May 17, 2024 - Python
LLM telegram bot
-
Updated
Sep 21, 2024 - Python
Improve this page
Add a description, image, and links to the exllama topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the exllama topic, visit your repo's landing page and select "manage topics."