Skip to content
View ggerganov's full-sized avatar

Sponsors

Organizations

@ggml-org

Block or report ggerganov

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

🔮 Future ideas

23 repositories

An open-source port of Prince of Persia, based on the disassembly of the DOS version.

C 1,232 150 Updated Dec 24, 2025

A 1980s-arcade-style game written using HTML5, Canvas, and Web Audio

HTML 181 6 Updated Nov 6, 2025

Improve your vim reflexes!

Python 26 4 Updated May 22, 2018

Draw ASCII diagrams in Neovim

Lua 1,159 25 Updated Aug 16, 2024

Inference code for Llama models

Python 59,214 9,823 Updated Jan 26, 2025

Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens

Python 540 23 Updated Nov 6, 2023

Reverse engineered Linux driver for the Apple Neural Engine (ANE).

C 476 26 Updated Mar 12, 2024

LLM-based code completion engine

191 3 Updated Jan 23, 2025

Port of Meta's Encodec in C/C++

C++ 228 20 Updated Dec 4, 2024

CLIP inference in plain C/C++ with no extra dependencies

C++ 552 53 Updated Jun 19, 2025

programmable e-paper tag with RFID

C 333 29 Updated Nov 13, 2024

Self-hosted AI coding assistant

Rust 33,013 1,693 Updated Mar 2, 2026

Web browser version of StarCoder.cpp

C 46 1 Updated Jul 30, 2023

Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference in pure C/C++

C++ 5,545 547 Updated Mar 9, 2026

Experiments on speculative sampling with Llama models

Python 128 8 Updated Jun 8, 2023

Java Bindings for llama.cpp - A Port of Facebook's LLaMA model in C/C++

C++ 407 55 Updated Jun 20, 2025
Jupyter Notebook 599 27 Updated Aug 23, 2024

My develoopment fork of llama.cpp. For now working on RK3588 NPU and Tenstorrent backend

C 115 11 Updated Mar 11, 2026

Inference of Mamba and Mamba2 models in pure C

C 197 10 Updated Jan 22, 2026

Reverse engineering the rk3588 npu

C 111 8 Updated May 30, 2024

Fast and accurate AI powered file content types detection

Python 10,150 495 Updated Mar 3, 2026

Semantic emoji finder. Python/dash UI. Uses sentence transformer embeddings and duckdb

Python 19 2 Updated Sep 15, 2025
Jupyter Notebook 176 14 Updated Jun 26, 2024