GoGemma

Simple Go server that takes a token, command, and text and returns response from Gemma (2B parameter Google LLM). Uses Redis to cache responses. Easy deployment with Fly.io.

Leverages Gemma CPP.

Developed for the RapidRead feature in GhostRemix.

Local Development

Dependencies

Go 1.22
Make
Air
Tilt

Get the code

Use the template to create your own repository.

GitHub UI

Navigate to the repository, click Use this template, and follow the instructions.

GitHub CLI

Get the GitHub CLI

# Step 1: Clone the template repository

git clone https://github.com/mikab-laboratory/go-gemma.git new-project

cd new-project

# Step 2: Create a new repository on GitHub

gh repo create username/new-project --private --source=.

# Step 3: Push the cloned contents to the new repository

git push --set-upstream origin main

Quickstart

Create .env file from .env.example.
Download Gemma from our Google Drive or Kaggle.
Create libs directory and unpack zip content there.
Run tilt up in project root.
Test with the command below.

curl -X POST -H "Content-Type: application/json" -d '{
  "command": "Summarize this post; Reply only with the summary;",
  "token": "your_token_here",
  "text": "Your input text goes here..."
}' http://localhost:8081/askGemma

Test Docker Build

Build image and run container with make all.
Clean image and container with make clean-all.

Deploy to Fly.io

Prerequisites

Create Fly.io account.
Authenticate with flyctl auth login.
Create app with flyctl launch --no-deploy.

GitHub Actions

Navigate to the newly created application in the Fly.io dashboard and get a deploy token.
Set secrets in GitHub repository settings.
Manually trigger by going to Actions tab and selecting Deploy. Click Run workflow and enter the branch name to deploy.
- You can update this action to trigger on push to main by changing the on section of the workflow file to push: [main]

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
.github/workflows		.github/workflows
db		db
model		model
server		server
.air.toml		.air.toml
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
Dockerfile		Dockerfile
License		License
Makefile		Makefile
README.md		README.md
Tiltfile		Tiltfile
docker-compose.yml		docker-compose.yml
fly.toml		fly.toml
go.mod		go.mod
go.sum		go.sum
main.go		main.go
start.sh		start.sh

License

mikan-laboratory/go-gemma

Folders and files

Latest commit

History

Repository files navigation

GoGemma

Table of Contents

Local Development

Dependencies

Get the code

GitHub UI

GitHub CLI

Quickstart

Test Docker Build

Deploy to Fly.io

Prerequisites

GitHub Actions

About

Resources

License

Stars

Watchers

Forks

Languages