Add rerank recall metric to unitxt #662

jlqibm · 2024-03-13T21:54:27Z

Closes #661 in support of adding perplexity reranking to fm-eval.

jlqibm · 2024-03-13T21:54:58Z

@yoavkatz please review

codecov · 2024-03-13T22:12:44Z

Codecov Report

Attention: Patch coverage is 22.50000% with 31 lines in your changes are missing coverage. Please review.

Project coverage is 90.87%. Comparing base (4274d2b) to head (a6c64c7).

Files	Patch %	Lines
src/unitxt/metrics.py	22.50%	31 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #662      +/-   ##
==========================================
- Coverage   91.13%   90.87%   -0.27%     
==========================================
  Files          98       98              
  Lines        9897     9937      +40     
==========================================
+ Hits         9020     9030      +10     
- Misses        877      907      +30

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

src/unitxt/metrics.py

requirements/base.rqr

elronbandel

Fix the requirements according to unitxt no-requirements policy explained in the comments

elronbandel · 2024-03-20T14:44:27Z

You need to add the pytrec_eval to therequirements/tests.rqr

jlqibm · 2024-03-25T17:47:02Z

@elronbandel pytrec-eval is added.

yoavkatz · 2024-03-26T06:11:13Z

The tests failed with this:

The pre-commit failed with this:

(Do you have pre-commit installed with pre-commit install ? )

jlqibm · 2024-03-27T17:58:05Z

@yoavkatz The pandas package is missing. So there are 2 question here. One, why didn't this fail at the import statement? Two, which requirements file does pandas need to be added to.
Sorry about the test failure.

I have precommit installed now.

src/unitxt/metrics.py

yoavkatz · 2024-03-27T21:40:44Z

@yoavkatz The pandas package is missing. So there are 2 question here. One, why didn't this fail at the import statement? Two, which requirements file does pandas need to be added to. Sorry about the test failure.

There is a missing import to pandas. In general, if metrics needs specific packages like pandas or trec_eval, we import inside the relevant metric code, and not at the top of the metrics.py file, so If people don't need the metric, they will not need the include.

jlqibm · 2024-03-28T17:22:55Z

@elronbandel I already fixed the requested change but github still thinks it's pending.

jlqibm · 2024-04-02T19:42:57Z

@yoavkatz can you tell what's holding up this merge? There appears to be an outstanding change request but I already fixed that a while ago.

Done

yoavkatz · 2024-04-02T20:28:37Z

I reviewed the code, and it looks fine so I approved.

@elronbandel - I think we the main issue is that the coverage is low - although tests were added (but in the prepare file).
The reason is that coverage is not measured on the prepare files.

We should probably give guidance to write the test in test_metrics and not in the prepare file (unless using a wrapper like HuggingFaceMetric).

elronbandel reviewed Mar 13, 2024

View reviewed changes

src/unitxt/metrics.py Show resolved Hide resolved

elronbandel reviewed Mar 13, 2024

View reviewed changes

requirements/base.rqr Outdated Show resolved Hide resolved

elronbandel previously requested changes Mar 13, 2024

View reviewed changes

jlqibm added 2 commits March 19, 2024 15:27

Add rerank recall metric to unitxt

2fe45ba

Move metric requirements into metric code to match unitxt standards.

9d63394

elronbandel force-pushed the main branch from eda9f22 to 9d63394 Compare March 19, 2024 13:27

Add pytrec-eval to test reqs.

76375d8

yoavkatz reviewed Mar 27, 2024

View reviewed changes

src/unitxt/metrics.py Outdated Show resolved Hide resolved

yoavkatz reviewed Mar 27, 2024

View reviewed changes

src/unitxt/metrics.py Show resolved Hide resolved

jlqibm and others added 6 commits March 27, 2024 18:41

Add missing import. Localize imports.

1dcfc5d

Merge branch 'main' into main

888f872

Format fixes

7a75e41

Merge branch 'main' into main

602ed2c

Merge branch 'main' into main

d4ecc10

Merge branch 'main' into main

c91399c

jlqibm requested a review from elronbandel March 28, 2024 17:22

yoavkatz approved these changes Apr 2, 2024

View reviewed changes

Merge branch 'main' into main

a6c64c7

yoavkatz enabled auto-merge (squash) April 2, 2024 20:29

yoavkatz merged commit ea47b07 into IBM:main Apr 2, 2024
6 of 8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add rerank recall metric to unitxt #662

Add rerank recall metric to unitxt #662

jlqibm commented Mar 13, 2024

jlqibm commented Mar 13, 2024

codecov bot commented Mar 13, 2024 •

edited

Loading

elronbandel left a comment

elronbandel commented Mar 20, 2024

jlqibm commented Mar 25, 2024

yoavkatz commented Mar 26, 2024

jlqibm commented Mar 27, 2024 •

edited

Loading

yoavkatz commented Mar 27, 2024

jlqibm commented Mar 28, 2024

jlqibm commented Apr 2, 2024

yoavkatz commented Apr 2, 2024

Add rerank recall metric to unitxt #662

Add rerank recall metric to unitxt #662

Conversation

jlqibm commented Mar 13, 2024

jlqibm commented Mar 13, 2024

codecov bot commented Mar 13, 2024 • edited Loading

Codecov Report

elronbandel left a comment

Choose a reason for hiding this comment

elronbandel commented Mar 20, 2024

jlqibm commented Mar 25, 2024

yoavkatz commented Mar 26, 2024

jlqibm commented Mar 27, 2024 • edited Loading

yoavkatz commented Mar 27, 2024

jlqibm commented Mar 28, 2024

jlqibm commented Apr 2, 2024

yoavkatz commented Apr 2, 2024

codecov bot commented Mar 13, 2024 •

edited

Loading

jlqibm commented Mar 27, 2024 •

edited

Loading