4bit-quantize

Here are 2 public repositories matching this topic...

michaelnny / QLoRA-LLM

A simple custom QLoRA implementation for fine-tuning a language model (LLM) with basic tools such as PyTorch and Bitsandbytes, completely decoupled from Hugging Face.

llama fine-tuning custom-model llm qlora 4bit-quantize

Updated Jan 29, 2024
Python

LuluW8071 / Llama2-LLM-7B-Text-Generation

Star

Text Generation on Pre-Trained NousResearch’s Llama-2-7b-chat-hf using guanaco-llama2-1k dataset

text-generation pytorch huggingface llm llama2 4bit-quantize

Updated Feb 15, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the 4bit-quantize topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the 4bit-quantize topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly