llm-api-testing

A FastAPI service that proxies questions to a local LLM (Ollama), paired with a pytest suite (23 tests, 100% coverage) that demonstrates how to test a non-deterministic system without relying on a live model.

Live reports: test report · coverage

Built as project 1 of 5 exploring AI/LLM testing. A writeup is in progress.

What it teaches

Testing a non-deterministic system with threshold assertions instead of exact-match equality
Using respx to mock outgoing httpx calls so error paths run without a live dependency
Exercising a FastAPI app in-process via ASGITransport - no live HTTP server, no port juggling
Structuring pytest markers so CI can run a fast, hermetic slice (-m "not ollama") and a full integration slice on demand
Tracking known false-positives with xfail(strict=False) so they flip to passing the day the classifier is upgraded

What it tests

Input validation - empty question, missing field, very long prompt
Moderation policy - harmful prompts refused, benign prompts allowed, known false-positives tracked with xfail
Consistency - same prompt 10×, ≥70% of answers must contain the expected token (threshold assertion for a non-deterministic model)
Latency - response under 30s
Concurrency - 5 parallel requests, no 500s
Error paths - Ollama unreachable / 5xx / empty response, mocked with respx so they run without a live LLM

Run it

Requires Python 3.10+.

# Setup
python3 -m venv .venv && source .venv/bin/activate
pip install -r requirements.txt

# Fast suite - no Ollama needed, ~1 second
pytest -m "not ollama"

For the full suite (integration tests against a real LLM), start Ollama in a separate terminal tab:

ollama serve

Then, in this terminal:

ollama pull llama3.2
pytest --html=reports/report.html --self-contained-html \
       --cov=app --cov-report=html:reports/htmlcov \
       --cov-report=term-missing --durations=10

Sample reports are committed and hosted live on GitHub Pages so you can preview without running:

Test report - pytest-html with per-test logs
Coverage report - line-level coverage

Open them after a run:

open reports/report.html          # pytest-html report
open reports/htmlcov/index.html   # coverage report

(Linux: xdg-open; Windows: start.)

Run the API locally

Start the FastAPI server with uvicorn (requires Ollama running for /ask):

uvicorn app.main:app --reload

Then visit:

http://localhost:8000/health - health check
http://localhost:8000/docs - interactive Swagger UI (try /ask from the browser)
http://localhost:8000/redoc - alternative API docs

Test markers

Marker	Runtime	Meaning
`ollama`	1–60 s	Requires a real Ollama at `localhost:11434`
`mocked`	<50 ms	Uses `respx` to mock Ollama (error-path tests)

Select with pytest -m "ollama", pytest -m mocked, or pytest -m "not ollama".

Project layout

llm-api-testing/
├── app/
│   └── main.py           # FastAPI app + Ollama client + structured logging
├── tests/
│   └── test_api.py       # 23 tests, sectioned by pattern (app / mocked / integration)
├── reports/              # sample HTML reports, committed for preview
│   ├── report.html       #   pytest-html report
│   └── htmlcov/          #   coverage-html report
├── pyproject.toml        # pytest config + marker registration
├── requirements.txt      # runtime + test dependencies
└── README.md

Tech

FastAPI · httpx · pytest · pytest-asyncio · pytest-html · pytest-cov · respx · Ollama (llama3.2)

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
app		app
reports		reports
tests		tests
.gitattributes		.gitattributes
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

llm-api-testing

What it teaches

What it tests

Run it

Run the API locally

Test markers

Project layout

Tech

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

llm-api-testing

What it teaches

What it tests

Run it

Run the API locally

Test markers

Project layout

Tech

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages