Evaluator #548

javipacheco · 2023-11-20T08:48:54Z

This PR adds the first approach for an evaluator. The evaluator is a tool that can be used to evaluate the performance of a model on a dataset. In this case, we are using deepeval as evaluation framework for LLMs.

Follow the README to install dependencies and use the Gradlew Task to generate a website with the results like this:

raulraja

Thank you @javipacheco looks great!

Montagon

Just minor comment and a suggestion. Great job, @javipacheco!

evaluator-example/README.md

evaluator/src/main/kotlin/com/xebia/funcional/xef/evaluator/TestBuilder.kt

tomascayuelas · 2023-11-22T10:57:45Z

As we have talked I put here the comments:

Installing poetry we have it only for macos, I would put directly a link to the poetry website and that's it:
https://python-poetry.org/docs/#installation
Installing pyenv to set the system to 3.10.0, otherwise it will not work because by default in the installation of poetry is brought in some OS 3.12 of python.
Consider installing a virtualenv to avoid problems with the versions:
virtualenv venv --python=python3.10.0.
source venv/bin/activate.
this solves all versioning and OS problems...

In the pyproject.toml part:

Remove the ragas
Remove the README.md or add it, because you get an error when installing the project:
If you do not want to install the current project use --no-root.
check:
packages = [{include = "xef_evaluator"}]
put the files:
the .py and the .html, .css, .js maybe inside an assets or public folder..

Review the README.md to indicate these steps and you are done.

Nice job @javipacheco 🥳

javipacheco added 11 commits November 20, 2023 08:59

Evaluator

4a2c9a1

Removing example file

8883c8b

Python test creates a Json result

b7803a1

Removing pycache

e25f38e

Spotlless aply

7de7970

Static Website

0d9638b

Changes in evaluator module

0010350

Merge branch 'main' into evaluator

1a78a64

New evaluator-example module

f88e932

Merge branch 'main' into evaluator

79dcade

Removing pytest cache

3ee3385

javipacheco marked this pull request as ready for review November 21, 2023 14:23

raulraja previously approved these changes Nov 21, 2023

View reviewed changes

Creating static website for test results

3d6a499

javipacheco dismissed raulraja’s stale review via 3d6a499 November 22, 2023 09:22

README updated

635a8df

Montagon previously approved these changes Nov 22, 2023

View reviewed changes

evaluator-example/README.md Outdated Show resolved Hide resolved

evaluator/src/main/kotlin/com/xebia/funcional/xef/evaluator/TestBuilder.kt Outdated Show resolved Hide resolved

Changes in the module structure

9a21cb9

javipacheco dismissed Montagon’s stale review via 9a21cb9 November 22, 2023 11:10

README updated

bae361e

Montagon approved these changes Nov 22, 2023

View reviewed changes

javipacheco merged commit 95de0d5 into main Nov 22, 2023
6 checks passed

javipacheco deleted the evaluator branch November 22, 2023 11:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Evaluator #548

Evaluator #548

javipacheco commented Nov 20, 2023 •

edited

Loading

raulraja left a comment

Montagon left a comment

tomascayuelas commented Nov 22, 2023 •

edited

Loading

Evaluator #548

Evaluator #548

Conversation

javipacheco commented Nov 20, 2023 • edited Loading

raulraja left a comment

Choose a reason for hiding this comment

Montagon left a comment

Choose a reason for hiding this comment

tomascayuelas commented Nov 22, 2023 • edited Loading

javipacheco commented Nov 20, 2023 •

edited

Loading

tomascayuelas commented Nov 22, 2023 •

edited

Loading