Skip to content
View TheBloke's full-sized avatar

Block or report TheBloke

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Tools for merging pretrained large language models.

Python 5,482 519 Updated Mar 24, 2025

Lord of LLMS

Python 287 52 Updated Mar 27, 2025

Go ahead and axolotl questions

Python 8,956 986 Updated Mar 27, 2025

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 2,038 255 Updated Mar 6, 2025

A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.

Python 2,845 221 Updated Sep 30, 2023

Falcon LLM ggml framework with CPU and GPU support

C 246 21 Updated Jan 22, 2024

TheBloke's Dockerfiles

Shell 305 59 Updated Mar 8, 2024

A discord bot with many features which uses A1111 as backend and uses my prompt templates for beautiful generations - even with short prompts.

Python 43 16 Updated Jul 30, 2023

Python bindings for the Transformer models implemented in C/C++ using GGML library.

C 1,854 142 Updated Jan 28, 2024

LLM inference in C/C++

C++ 77,278 11,227 Updated Mar 27, 2025

Universal LLM Deployment Engine with ML Compilation

Python 20,269 1,700 Updated Mar 20, 2025

Python bindings for llama.cpp

Python 8,867 1,096 Updated Mar 24, 2025

A Gradio web UI for Large Language Models with support for multiple inference backends.

Python 43,007 5,539 Updated Mar 26, 2025

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Python 9,364 730 Updated Aug 5, 2024

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 4,773 512 Updated Mar 17, 2025

4 bits quantization of LLaMA using GPTQ

Python 3,045 461 Updated Jul 13, 2024

Run LLaMA (and Stanford-Alpaca) inference on Apple Silicon GPUs.

Python 585 46 Updated Mar 25, 2023

Install nVidia drivers on macOS the easy way.

Shell 1,300 102 Updated Mar 11, 2021

No longer maintained, see pinned issues

CoffeeScript 21,938 3,371 Updated Dec 27, 2024

A PEX to Papyrus Decompiler for Skyrim, Fallout 4 and Starfield

C++ 115 22 Updated Oct 17, 2023

Mod manager for various PC games (currently: Skyrim, Oblivion, Fallout 3, Fallout NV)

C++ 503 65 Updated Feb 21, 2018

Everything here is old and outdated by at least 5 years.

Papyrus 35 11 Updated Sep 8, 2021

The stack_unwinding is a small header only C++ library which supplies primitive(class unwinding_indicator) to determining when object destructor is called due to stack-unwinding or due to normal sc…

C++ 84 11 Updated Oct 22, 2015

Python parser for Paradox .txt files.

Python 37 7 Updated Mar 25, 2025

Compute Napoleon Score for Europa Universalis IV

JavaScript 1 Updated Jan 5, 2014

Battle Simulator for Europa Universalis 4

TypeScript 1 Updated Oct 23, 2014
Showing results