FreeInference

Free LLM inference for coding agents and AI-powered IDEs.

Overview

FreeInference provides free access to state-of-the-art language models specifically designed for coding agents like Cursor, Codex, Roo Code, and other AI-powered development tools.

Documentation

Visit our documentation at: https://harvardsys.github.io/free_inference/

Supported IDEs & Coding Agents

Cursor - AI-powered code editor
Codex - Terminal-based coding assistant
Roo Code - VS Code & JetBrains extension
Kilo Code - AI coding assistant
And any tool that supports OpenAI-compatible APIs

Quick Start

Cursor Setup

Open Settings (Cmd + , or Ctrl + ,)
Go to API Keys section
Enter your FreeInference API key
Click Override OpenAI Base URL
Enter: https://freeinference.org/v1
Enable the toggle and start coding!

Codex Setup

Create ~/.codex/config.toml:

model = "glm-4.7"
model_provider = "free_inference"

[model_providers.free_inference]
name = "FreeInference"
base_url = "https://freeinference.org/v1"
wire_api = "chat"
env_http_headers = { "X-Session-ID" = "CODEX_SESSION_ID", "Authorization" = "FREEINFERENCE_API_KEY" }

Add to ~/.zshrc or ~/.bashrc:

export CODEX_SESSION_ID="$(date +%Y%m%d-%H%M%S)-$(uuidgen)"
export FREEINFERENCE_API_KEY="Bearer your-api-key-here"

Reload: source ~/.zshrc

Roo Code / Kilo Code Setup

Install the extension in your IDE
Open settings
Select OpenAI Compatible as provider
Configure:
- Base URL: https://freeinference.org/v1
- API Key: your-api-key-here
Select your preferred model

Available Models

GLM-4.7 - 200K context, best for long context and bilingual support
GLM-4.7-Flash - 200K context, fast and cost-effective
MiniMax M2 - 196K context, best for very large codebases
Qwen3 Coder 30B - 32K context, specialized for code generation
Llama 3.3 70B - 131K context, general coding (limited capacity)
Llama 4 Scout - 128K context, optimized for speed (limited capacity)
Llama 4 Maverick - 128K context, multimodal support (limited capacity)

See the Models documentation for the complete list.

Get API Key

Visit https://freeinference.org
Register for a free account
Log in and create your API key
Start using FreeInference with your favorite IDE!

Documentation Links

Support

Documentation: https://harvardsys.github.io/free_inference/
Issues: GitHub Issues
Questions: Contact the team

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
.github/workflows		.github/workflows
docs		docs
.gitignore		.gitignore
README.md		README.md
index.html		index.html
index0.html		index0.html
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FreeInference

Overview

Documentation

Supported IDEs & Coding Agents

Quick Start

Cursor Setup

Codex Setup

Roo Code / Kilo Code Setup

Available Models

Get API Key

Documentation Links

Support

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

FreeInference

Overview

Documentation

Supported IDEs & Coding Agents

Quick Start

Cursor Setup

Codex Setup

Roo Code / Kilo Code Setup

Available Models

Get API Key

Documentation Links

Support

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages