jinja.cpp

A lightweight, single-header C++11 Jinja2 template engine designed for LLM chat templates. ** (HuggingFace style).

It focuses on supporting the subset of Jinja2 used by modern Large Language Models (LLMs) like Llama 3, Qwen 2.5/3, DeepSeek, and others, enabling seamless inference integration in C++ environments.

Features

C++11 Compatible: Ensures maximum compatibility across older compiler versions and embedded systems.
Lightweight: Minimal dependencies (only nlohmann/json).
LLM Focused: Native support for messages, tools, add_generation_prompt, and special tokens.
Strictly Typed: Uses nlohmann::json for context management.
Custom Function Interop: Easily inject C++ functions (e.g., strftime_now) into templates.
Robust: Validated against official Python transformers outputs using fuzzy matching tests.

Integration

The library is a single header file. Just copy jinja.hpp to your project's include directory (or root).

Feature Checking (Versioning)

You can check the library version using standard macros:

#include "jinja.hpp"

#if JINJA_VERSION_MAJOR >= 0
    // Use jinja.cpp features
#endif

Supported Models

Tested and verified with templates from:

Qwen 2.5 / 3 (Coder, Math, VL, Omni, Instruct, Thinking, QwQ)
DeepSeek (V3, R1)
Llama 3 / 3.1 / 3.2 (Instruct & Vision)
Mistral
Gemma
SmolLM
Phi
And more...

Build Instructions

Prerequisites

CMake 3.10+
C++11 compatible compiler (GCC, Clang, MSVC)

mkdir build
cd build
cmake ..
make

Run Tests

The project includes a comprehensive test suite based on real-world model templates.

./test_main

Usage

Basic Rendering

#include "jinja.hpp"
#include <iostream>

int main() {
    std::string template_str = "Hello {{ name }}!";
    jinja::Template tpl(template_str);

    nlohmann::json context;
    context["name"] = "World";

    std::string result = tpl.render(context);
    std::cout << result << std::endl; // Output: Hello World!
    return 0;
}

LLM Chat Template

#include "jinja.hpp"

// Load your tokenizer_config.json's "chat_template"
std::string chat_template_str = "...";
jinja::Template tpl(chat_template_str);

nlohmann::json messages = nlohmann::json::array({
    {{"role", "user"}, {"content", "Hello!"}}
});

// Apply template
std::string prompt = tpl.apply_chat_template(
    messages,
    true, // add_generation_prompt
    nlohmann::json::array() // tools
);

Custom Functions

You can register custom C++ functions to be called from within the template.

tpl.add_function("strftime_now", [](const std::vector<nlohmann::json>& args) {
    // Return current time string
    return "2025-12-16";
});

Documentation

For detailed implementation details, see doc/implementation_details.md.

License

Apache License 2.0. See LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.github/workflows		.github/workflows
doc		doc
tests		tests
third_party/nlohmann		third_party/nlohmann
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md
README_CN.md		README_CN.md
jinja.hpp		jinja.hpp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

jinja.cpp

Features

Integration

Feature Checking (Versioning)

Supported Models

Build Instructions

Prerequisites

Run Tests

Usage

Basic Rendering

LLM Chat Template

Custom Functions

Documentation

License

About

Uh oh!

Releases

Packages

Languages

License

wangzhaode/jinja.cpp

Folders and files

Latest commit

History

Repository files navigation

jinja.cpp

Features

Integration

Feature Checking (Versioning)

Supported Models

Build Instructions

Prerequisites

Run Tests

Usage

Basic Rendering

LLM Chat Template

Custom Functions

Documentation

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages