ndcg refactor #1481

FelipeAdachi · 2024-03-13T01:09:38Z

Description

This PR:

Adds ndcg value expecations to the tests, not just counts
Changes convert_non_numeric=True to be used for string columns (integers, floats in scores and bools should use the default convert_non_numeric=False )
adds score_column support: when a score is available, like in this example, one should use score_column and target_column , and not prediction_column (see 2 last tests on the sklearn examples)
Changes the logic to generate target column values to make it compatible with all scenarios
Fixes prediction and ideal relevance calculation for non numeric case
Handles DivisionbyZero edge case when idcg=0 (ndcg set to 1 if no relevant documents exist)
If K is not passed, metrics will be calculated according to predictions cols's length (and metrics named accordingly - k is no longer omitted in the names when k is None)

For the Numeric case:

If predictions+target or scores+target columns are both provided, they need to be of same length.
If prediction_column is provided with target column, prediction col contains the rank of suggested items, starting with 1
If only prediction column is provided, it is assumed that the order encodes the ranks of recommendations: first item in the list is the first recommendation. The value in the list encodes the relevance score
I have reviewed the Guidelines for Contributing and the Code of Conduct.

python/whylogs/experimental/api/logger/__init__.py

jamie256

Left a few questions and minor comments. Thanks for working on these!

python/whylogs/experimental/api/logger/__init__.py

python/tests/experimental/api/test_logger.py

python/whylogs/experimental/api/logger/__init__.py

python/tests/experimental/api/test_logger.py

jamie256

Looks better, thanks!

felipe207 added 3 commits March 12, 2024 22:03

ndcg fixes

1d82309

fixes and make tests pass

880ac92

precommit

9ef98f0

FelipeAdachi changed the title ~~Update predicted and ideal relevances~~ ndcg refactor Mar 13, 2024

murilommen reviewed Mar 13, 2024

View reviewed changes

python/whylogs/experimental/api/logger/__init__.py Outdated Show resolved Hide resolved

murilommen reviewed Mar 13, 2024

View reviewed changes

python/whylogs/experimental/api/logger/__init__.py Outdated Show resolved Hide resolved

jamie256 reviewed Mar 13, 2024

View reviewed changes

felipe207 added 5 commits March 13, 2024 18:44

lose comments

82473b1

remove nested function

1399c2e

correct prediction cols to be indices to target cols

7845d36

change test

5a6ec0a

change semantics for prediction col

2861975

jamie256 approved these changes Mar 14, 2024

View reviewed changes

jamie256 merged commit 164985c into mainline Mar 14, 2024
19 checks passed

jamie256 deleted the dev/felipe/ndcg-fixes branch March 14, 2024 23:01

jamie256 linked an issue Mar 14, 2024 that may be closed by this pull request

log_batch_ranking_metrics error #1480

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ndcg refactor #1481

ndcg refactor #1481

FelipeAdachi commented Mar 13, 2024 •

edited

jamie256 left a comment

jamie256 left a comment

ndcg refactor #1481

ndcg refactor #1481

Conversation

FelipeAdachi commented Mar 13, 2024 • edited

Description

jamie256 left a comment

Choose a reason for hiding this comment

jamie256 left a comment

Choose a reason for hiding this comment

FelipeAdachi commented Mar 13, 2024 •

edited