rs-llama-cpp

Automated Rust bindings generation for LLaMA.cpp

Description

LLaMA.cpp is under heavy development with contributions pouring in from numerous individuals every day. Currently, its C API is very low-level and given how fast the project is evolving, keeping up with the changes and porting the examples into a higher-level API prove to be difficult. As a trade-off, this project prioritizes automation over flexibility by automatically generating Rust bindings for the main example of LLaMA.cpp.

Limitations

The main design goal of this project is to minimize the effort of updating LLaMA.cpp by automating as many steps as possible. However, this approach does have some limitations:

The API is very high-level, resembling a call to the main function of LLaMA.cpp and receiving tokens through a callback function.
Currently, the project does not expose parameters with types that are more challenging to convert to Rust, such as std::unordered_map and std::vector.
Some of the parameters exposed via the Rust API are only relevant for a CLI.
The generated C++ library outputs a significant amount of debug information to stderr and stdout. This is not configurable at the moment

Usage

use rs_llama_cpp::{gpt_params_c, run_inference, str_to_mut_i8};

fn main() {
    let params: gpt_params_c = {
        gpt_params_c {
            model: str_to_mut_i8("/path/to/model.bin"),
            prompt: str_to_mut_i8("Hello "),
            ..Default::default()
        }
    };

    run_inference(params, |token| {
        println!("Token: {}", token);

        if token.ends_with("\n") {
            return false; // stop inference
        }

        return true; // continue inference
    });
}

Name		Name	Last commit message	Last commit date
Latest commit History 73 Commits
.github		.github
examples/hello		examples/hello
rs-llama-cpp-wrapper		rs-llama-cpp-wrapper
src		src
.gitignore		.gitignore
.gitmodules		.gitmodules
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md
build.rs		build.rs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.github

.github

examples/hello

examples/hello

rs-llama-cpp-wrapper

rs-llama-cpp-wrapper

src

src

.gitignore

.gitignore

.gitmodules

.gitmodules

Cargo.toml

Cargo.toml

LICENSE

LICENSE

README.md

README.md

build.rs

build.rs

Repository files navigation

rs-llama-cpp

Description

Limitations

Usage

About

Releases 60

Packages

Contributors 2

Languages

License

fardjad/rs-llama-cpp

Folders and files

Latest commit

History

Repository files navigation

rs-llama-cpp

Description

Limitations

Usage

About

Resources

License

Stars

Watchers

Forks

Languages