gemma-api

A FastAPI service that provides a REST API for interacting with the Gemma language model through Ollama. The service is designed to be flexible and can be used for various natural language processing tasks by customizing the system prompt.

Features

Generic REST API for LLM interaction
Configurable system prompts for different use cases
Docker containerization
Automatic model loading
Health check endpoint
Retry mechanism for reliability
JSON response formatting

Prerequisites

Docker and Docker Compose

Quick Start

Clone the repository:

git clone https://github.com/JacobOmateq/gemma-api
cd gemma-api

Start the service:

./start.sh

Or for a complete rebuild:

./rebuild.sh

Configuration

System Prompt

The default system prompt is configured in llm_settings.json. You can customize it for your specific use case:

{
    "model": "gemma3:1b",
    "system_prompt": "Your custom prompt here..."
}

API Endpoints

Generate Response

curl --location 'http://localhost:8000/generate' \
--header 'Content-Type: application/json' \
--data '{
    "text": "Your input text here",
    "system_prompt": "Optional custom prompt to define the task"
}'

Example use cases:

Information Extraction:

{
    "text": "Meeting with John tomorrow at 2 PM",
    "system_prompt": "Extract event details and return them as JSON with fields: type, person, time"
}

Text Classification:

{
    "text": "I love this product, it works great!",
    "system_prompt": "Analyze the sentiment of this text and return JSON with fields: sentiment, confidence"
}

Structured Data Generation:

{
    "text": "Create a character profile for a fantasy story",
    "system_prompt": "Generate a character profile and return as JSON with fields: name, race, class, abilities, background"
}

Response Format

{
    "content": {
        // JSON formatted response
        // Structure depends on the system prompt
    }
}

Health Check

curl http://localhost:8000/health

Response:

{
    "status": "healthy",
    "ollama": "connected"
}

Development

Project Structure

gemma-api/
├── main.py           # FastAPI application
├── llm_settings.json # Model and prompt configuration
├── start.sh         # Service startup script
├── rebuild.sh       # Rebuild script for development
├── Dockerfile       # Container definition
└── docker-compose.yml

Scripts

start.sh: Initial startup of the service
rebuild.sh: Complete rebuild of the containers (useful during development)

Rebuilding the Service

During development, use the rebuild script to apply changes:

./rebuild.sh

This will:

Stop existing containers
Rebuild images from scratch
Start the services
Pull the required Ollama model

Troubleshooting

Logs

View service logs:

docker-compose logs -f

View specific service logs:

docker-compose logs -f api  # For the API service
docker-compose logs -f ollama  # For the Ollama service

Common Issues

If the service returns null responses, check:
- Ollama model availability
- JSON parsing in the response
- System prompt formatting
If the service is unhealthy:
- Verify Ollama container is running
- Check resource availability
- Review API logs

License

MIT

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

gemma-api

Features

Prerequisites

Quick Start

Configuration

System Prompt

API Endpoints

Generate Response

Response Format

Health Check

Development

Project Structure

Scripts

Rebuilding the Service

Troubleshooting

Logs

Common Issues

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yml		docker-compose.yml
llm_settings.json		llm_settings.json
main.py		main.py
rebuild.sh		rebuild.sh
requirements.txt		requirements.txt
start.sh		start.sh

Uh oh!

Uh oh!

JacobOmateq/gemma-api

Folders and files

Latest commit

History

Repository files navigation

gemma-api

Features

Prerequisites

Quick Start

Configuration

System Prompt

API Endpoints

Generate Response

Response Format

Health Check

Development

Project Structure

Scripts

Rebuilding the Service

Troubleshooting

Logs

Common Issues

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages