Evaluation validation & restart #557

czaloom · 2024-04-21T17:45:20Z

Issue Description

Evaluations dataset/model validation is occurring within the computation causing silent failure.

Example

{"method": "compute_detection_metrics", "event": "Valor Exception: Evaluation '136'", "level": "error", "timestamp": "2024-04-19T21:24:08.293418Z", "exception": [{"exc_type": "RuntimeError", "exc_value": "Model '2fK6nXvtq9JfMnZntWiI7kLMH94_detect' does not meet filter requirements.", "syntax_error": null, "is_cause": false, "frames": [{"filename": "/src/valor_api/backend/metrics/metric_utils.py", "lineno": 292, "name": "wrapper", "line": "", "locals": {"args": "()", "kwargs": "\"{'db': <sqlalchemy.orm.session.Session object at 0x7fc75681a1d0>, 'evaluation_id\"+7", "db": "<sqlalchemy.orm.session.Session object at 0x7fc75681a1d0>", "evaluation_id": "136", "e": "'RuntimeError(\"Model \\'2fK6nXvtq9JfMnZntWiI7kLMH94_detect\\' does not meet filter re'+13", "fn": "<function compute_detection_metrics at 0x7fc759deec20>"}}, {"filename": "/src/valor_api/backend/metrics/detection.py", "lineno": 941, "name": "compute_detection_metrics", "line": "", "locals": {"db": "<sqlalchemy.orm.session.Session object at 0x7fc75681a1d0>", "evaluation_id": "136", "_": "()", "evaluation": "<valor_api.backend.models.Evaluation object at 0x7fc756818fa0>", "groundtruth_filter": "\"Filter(dataset_names=['2V2Z9CNQHCjuYu0R0XLCzFDlfD1_Object_Detection'], dataset_m\"+374", "prediction_filter": "\"Filter(dataset_names=['2V2Z9CNQHCjuYu0R0XLCzFDlfD1_Object_Detection'], dataset_m\"+408", "parameters": "\"EvaluationParameters(task_type=<TaskType.OBJECT_DETECTION: 'object-detection'>, \"+251", "datasets": "\"[(34, '2V2Z9CNQHCjuYu0R0XLCzFDlfD1_Object_Detection', {'task': 'Object Detection\"+376", "model": "None"}}]}]}

Expected Behavior

If a dataset or model has no data that conforms to the evaluation filter, then the job should return as Done with no metrics.

Other Additions

Evaluation jobs with Failed statuses will restart if queried.

rsbowman-striveworks

The change to mark evaluations with no data as DONE makes sense to me. I'm open to the other changes but less sure they're the right thing to do.

api/valor_api/crud/_create.py

api/valor_api/backend/core/evaluation.py

api/valor_api/crud/_create.py

waiting for thoughts on Sean's comments

api/valor_api/main.py

api/valor_api/backend/core/evaluation.py

czaloom added 2 commits April 21, 2024 12:40

moved validation steps outside of computation

469e9c0

allow for restarting of failed evaluations

72d0b2a

czaloom linked an issue Apr 21, 2024 that may be closed by this pull request

BUG: Evaluations over empty sets should give response code #555

Closed

1 task

czaloom marked this pull request as ready for review April 21, 2024 17:46

czaloom requested review from ntlind and ekorman as code owners April 21, 2024 17:46

rsbowman-striveworks reviewed Apr 22, 2024

View reviewed changes

api/valor_api/crud/_create.py Show resolved Hide resolved

api/valor_api/backend/core/evaluation.py Outdated Show resolved Hide resolved

ntlind reviewed Apr 22, 2024

View reviewed changes

api/valor_api/backend/core/evaluation.py Outdated Show resolved Hide resolved

ntlind reviewed Apr 22, 2024

View reviewed changes

api/valor_api/crud/_create.py Show resolved Hide resolved

ntlind previously approved these changes Apr 22, 2024

View reviewed changes

czaloom added 5 commits April 22, 2024 11:41

Merge branch 'main' into czaloom-validate-jobs-before-compute

0aaa39d

Merge branch 'main' into czaloom-validate-jobs-before-compute

8cc0b11

added test

0781210

added integration test

f7b6eb3

fixed

0f7ff0c

ntlind reviewed Apr 23, 2024

View reviewed changes

api/valor_api/main.py Show resolved Hide resolved

ntlind reviewed Apr 23, 2024

View reviewed changes

api/valor_api/main.py Show resolved Hide resolved

ntlind reviewed Apr 23, 2024

View reviewed changes

api/valor_api/backend/core/evaluation.py Outdated Show resolved Hide resolved

czaloom added 4 commits April 23, 2024 10:46

made suggested chagnes to endpoint docs

30ebaaa

changed docstring

ff2e2ea

simplified eval verification code block

082b582

accidentally omitted distinct

46af4dc

ntlind approved these changes Apr 23, 2024

View reviewed changes

czaloom merged commit 0f8f727 into main Apr 23, 2024
10 checks passed

czaloom deleted the czaloom-validate-jobs-before-compute branch April 23, 2024 16:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Evaluation validation & restart #557

Evaluation validation & restart #557

czaloom commented Apr 21, 2024

rsbowman-striveworks left a comment

Evaluation validation & restart #557

Evaluation validation & restart #557

Conversation

czaloom commented Apr 21, 2024

Issue Description

Example

Expected Behavior

Other Additions

rsbowman-striveworks left a comment

Choose a reason for hiding this comment