1-bit-llm

Here are 4 public repositories matching this topic...

spmfrance-cloud / aria-protocol

Peer-to-peer distributed AI inference using 1-bit quantized models. CPU-only, 70-82% energy savings, 103+ tokens/sec. Validated on Zen 4 & Zen 5 (+35% cross-gen improvement).

open-source distributed-systems cpu peer-to-peer inference quantization avx512 bitnet llm 1-bit-llm

Updated Apr 27, 2026
Python

syn-999 / core58-w2a8-msvc

Star

Windows-native BitNet and ternary LLM inference with CPU GGUF, GPU runtime, terminal and browser chat, and release zips.

windows cuda pytorch quantization bitnet fastapi llama-cpp local-llm llm-inference gguf 1-bit-llm ternary-llm falcon3

Updated Mar 20, 2026
Python

Bollo444 / agent-zero-bitnet-b1.58-2B-4T

Star

High-performance hybrid architecture for Agent Zero & BitNet b1.58. Natively optimized for Windows ARM64 (Snapdragon X Elite / Copilot+ PCs) using raw C++ inference and Docker-based agent orchestration.

bitnet ai-agent llm-inference agent-zero windows-arm64 1-bit-llm snapdragon-x-elite copilot-plus-pc

Updated Feb 6, 2026
Python

matthewidavis / 1BitChat

Star

Desktop chat app for Microsoft's 1-bit BitNet LLMs. Windows-native, CPU-only, zero dependencies

desktop-app windows chat-ui 1-bit pywebview bitnet openai-api cpu-inference llm openai-api-chatbot llama-cpp local-llm 1-bit-llm microsoft-bitnet

Updated Apr 14, 2026
Python

Improve this page

Add a description, image, and links to the 1-bit-llm topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the 1-bit-llm topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

1-bit-llm

Here are 4 public repositories matching this topic...

spmfrance-cloud / aria-protocol

syn-999 / core58-w2a8-msvc

Bollo444 / agent-zero-bitnet-b1.58-2B-4T

matthewidavis / 1BitChat

Improve this page

Add this topic to your repo