Rust gRPC Embedding Service with FastEmbed

A high-performance gRPC-based embedding service built in Rust using FastEmbed and the sentence-transformers/paraphrase-MiniLM-L12-v2 model. You can change to others model by modify the code on main.py:

InitOptions::new(EmbeddingModel::ParaphraseMLMiniLML12V2Q)

Features

Fast & Lightweight: Built in Rust for optimal performance and low resource consumption
gRPC API: Efficient binary protocol for client-server communication
Flexible Input: Supports both single text and batch processing
ONNX Model: Uses FastEmbed with ONNX runtime for fast inference
Configurable: Environment-based configuration for easy deployment

Quick Start

1. Build from Code

Prerequisites

Rust (latest stable version)
Protocol Buffers compiler (protoc)

Create a .env with values:

APP_SERVER_ADDRESS=0.0.0.0:6010

Build the project:

cargo build --release

Start the server:

cargo run --bin server

The server will start on 127.0.0.1:6010 by default and download the embedding model on first run.

2. Build from DockerFile

Create a .env with values:

APP_SERVER_ADDRESS=0.0.0.0:6010

Change the <your_image_name> with your desired name and run it:

docker build -t <your_image_name>:latest .

3. Running from Docker Compose (Recommended)

docker compose up --build -d

API Reference

gRPC Service: `EmbeddingService`

Method: `GetEmbeddings`

Request (EmbeddingRequest):

message EmbeddingRequest {
  oneof input {
    string single_text = 1;        // Single text input
    TextBatch batch_texts = 2;     // Batch text input
  }
}

message TextBatch {
  repeated string texts = 1;
}

Response (EmbeddingResponse):

message EmbeddingResponse {
  repeated Vector vectors = 1;
}

message Vector {
  repeated float values = 1;
}

Performance Notes

The service loads the model once at startup for optimal performance
FastEmbed uses ONNX runtime for efficient inference
Batch processing is recommended for multiple texts to reduce overhead
Model files are cached locally after first download

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
proto		proto
src		src
.dockerignore		.dockerignore
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
Dockerfile		Dockerfile
README.md		README.md
build.rs		build.rs
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Rust gRPC Embedding Service with FastEmbed

Features

Quick Start

1. Build from Code

Prerequisites

2. Build from DockerFile

3. Running from Docker Compose (Recommended)

API Reference

gRPC Service: `EmbeddingService`

Method: `GetEmbeddings`

Performance Notes

Created by:

Rizky Indrabayu

About

Uh oh!

Releases

Packages

Languages

indrabayuu/rust-embedding-service

Folders and files

Latest commit

History

Repository files navigation

Rust gRPC Embedding Service with FastEmbed

Features

Quick Start

1. Build from Code

Prerequisites

2. Build from DockerFile

3. Running from Docker Compose (Recommended)

API Reference

gRPC Service: EmbeddingService

Method: GetEmbeddings

Performance Notes

Created by:

Rizky Indrabayu

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

gRPC Service: `EmbeddingService`

Method: `GetEmbeddings`

Packages