GitHub - Python-Test-Engineer/eval-framework

Combining Python Full Stack test framework with Evals.

Please see https://evaluating-ai-agents.netlify.app/ for more information.

For the PyTest Full Stack Framework please see https://pytest-cookbook.com/

Alhtough this is a UV based project, you can run it as a standard pip project with the requirements.txt file and a python -m venv venv to set up a virtual environment if you want one.

for UV

Make sure you have UV installed https://docs.astral.sh/uv/getting-started/installation/

Then uv sync.

uv run uv_test.py

Tests

If you want to run all the template tests in the tests folder:

To run all tests, uv run pytest -vs --tb=no. We use --tb=no as this supresses tracback which can be quite long in purposely designed error tests.

There are 3 purposely failed tests for demo purposes.

In 100 and 110 the tests will not be picked up by PyTest by default as the tests do not start with test_.

They need an OPENAI_API_KEY set in the environment so that the test can run.

Please see https://pytest-cookbook.com/ on how the PyTest Full Stack Framework works.

It is not needed for Evaluating AI Agents as these are code files outside of PyTest.

Name		Name	Last commit message	Last commit date
Latest commit History 297 Commits
.github/templates		.github/templates
.vscode		.vscode
_archive		_archive
_images		_images
chroma_db		chroma_db
config		config
htmlcov		htmlcov
log		log
reports		reports
results		results
screenshots		screenshots
src		src
tests		tests
utils		utils
.coverage		.coverage
.gitignore		.gitignore
.python-version		.python-version
.sesskey		.sesskey
01_demo_api.ipynb		01_demo_api.ipynb
01_test_groq_openai.py		01_test_groq_openai.py
02_api.ipynb		02_api.ipynb
02_py_llm.py		02_py_llm.py
03.2_router.ipynb		03.2_router.ipynb
03_test_llm.ipynb		03_test_llm.ipynb
05_article_writer_publisher_pricing_workflow.png		05_article_writer_publisher_pricing_workflow.png
06_demo_prompt_google_crm.md		06_demo_prompt_google_crm.md
10_annotated_10_sample_csv.csv		10_annotated_10_sample_csv.csv
10_annotations.db		10_annotations.db
10_fasthtml_annotation_app.py		10_fasthtml_annotation_app.py
10_sample_csv.csv		10_sample_csv.csv
COMMANDS.md		COMMANDS.md
PYDATA_LONDON_02SEP2025.pdf		PYDATA_LONDON_02SEP2025.pdf
PYDATA_LONDON_02SEP2025.pptx		PYDATA_LONDON_02SEP2025.pptx
PYDATA_SOUTHAMPTON_16SEP2025.pdf		PYDATA_SOUTHAMPTON_16SEP2025.pdf
PYDATA_SOUTHAMPTON_16SEP2025.pptx		PYDATA_SOUTHAMPTON_16SEP2025.pptx
README.md		README.md
USEFUL_LINKS.md		USEFUL_LINKS.md
_load.py		_load.py
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
requirements.txt		requirements.txt
test_uv.py		test_uv.py
uv.lock		uv.lock
workflow.mmd		workflow.mmd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

for UV

Tests

About

Uh oh!

Releases

Packages

Languages

Python-Test-Engineer/eval-framework

Folders and files

Latest commit

History

Repository files navigation

for UV

Tests

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages