🌌🧐 Macro evaluation #850

mberr · 2022-03-28T17:16:44Z

Adds an evaluator for computing macro averaged rank-based metrics.

Also enables docstr-coverage and darglint for pykeen.evaluation and adds missing docstrings.

Dependencies

⚖️🌡️ Weighted Rank-Based Metrics #837

trigger ci

src/pykeen/evaluation/rank_based_evaluator.py

tests/test_evaluation/test_evaluators.py

cthoyt

Needs a bit of high-level documentation. can you add some notes on this in understanding_evaluation.rst (e.g., what issues are caused by micro evaluation? how does macro evaluation work and how does it solve these issues?) Maybe copy-paste some of the content that didn't make it into the metrics manuscript

trigger ci

mberr · 2022-03-30T16:10:07Z

Needs a bit of high-level documentation. can you add some notes on this in understanding_evaluation.rst (e.g., what issues are caused by micro evaluation? how does macro evaluation work and how does it solve these issues?) Maybe copy-paste some of the content that didn't make it into the metrics manuscript

e1d41fb - feel free to adjust / rephrase

trigger ci

mberr · 2022-03-30T16:26:17Z

some parts of the metric description in the understanding evaluation part is incorrect, but also duplicated from the class doc itself, e.g., here, where the section about adjusted mean rank describes the adjusted MRR, and the expectation is incorrect.

pykeen/docs/source/tutorial/understanding_evaluation.rst

Lines 128 to 160 in 9cb4899

    
           Adjusted Mean Rank 
        
           ****************** 
        
           The expectation of an inverse-uniform distributed variable $\frac{1}{X} \sim \mathcal{U}(\frac{1}{a},\frac{1}{b})$ 
        
           is $\mathbb{E}\left[\frac{1}{X}\right] = \frac{\ln b - \ln a}{b - a}$. 
        
           Given our uniformly distributed variable $r_i$  with parameters $a=1$ and $b=N_i$ and its corresponding 
        
           inverse-uniform distributed variable $r_i^{-1}$, we get: 
        
           .. math:: 
        
               \mathbb{E}\left[r_i^{-1}\right] 
        
               = \frac{\ln 1 - \ln N_i}{N_i - 1} 
        
               = \frac{\ln N_i}{N_i - 1} 
        
               \doteq \frac{\ln n}{n - 1} 
        
           The expected value of the mean rank is then derived like: 
        
           .. math:: 
        
               \mathbb{E}\left[\text{MRR}\right] 
        
               = \mathbb{E}\left[\frac{1}{n} \sum \limits_{i=1}^n r_i^{-1}\right] 
        
               = \frac{1}{n} \sum \limits_{i=1}^n \mathbb{E}\left[r_i^{-1}\right] 
        
               = \mathbb{E}\left[r_i^{-1}\right] 
        
               \doteq \frac{\ln n}{n - 1} 
        
           The adjusted mean rank (AMR) was introduced by [berrendorf2020]_. It is defined as the ratio 
        
           of the mean rank to the expected mean rank 
        
           .. math:: 
        
               \text{MRR}^{*}(r_1,\ldots,r_n) = \frac{\text{MRR}(r_1,\ldots,r_n)}{\mathbb{E}\left[\text{MRR}\right] } 
        
           It lies on the open interval $(0, 2)$ where lower is better.

maybe it is better to directly link to the class reference?

cthoyt · 2022-03-31T11:43:32Z

@mberr good idea

trigger ci

src/pykeen/metrics/ranking.py

Trigger CI

mberr added 30 commits March 14, 2022 16:56

add weights parameter

04fdc35

add class-var

3a84993

add weighted MR

f4a9d30

add test

7622a4d

use numpy builtin

0842260

import weighted median

be0c839

import weighted hmean

acf2527

fix refactoring artifact

8b103ea

update more metrics

28e3047

expose helpers

9508963

add another test

8141b4d

Merge branch 'master' into macro-metrics-2

9a1d097

Merge remote-tracking branch 'origin/master' into macro-metrics-2

f2d63bd

add todos

d84d66c

add some more assertions

bc03a3a

fix direction test

792e845

update weighted HM

5320292

fix harmonic mean

809b61a

fix weighted median for odd size

fab2525

trigger ci

087d3d2

fix weighted median corner case

242fb24

trigger ci

3fd6586

implement expected value and variance for MR

df5a5a2

trigger ci

06b2f4d

update adjusted metrics

6975f5a

trigger ci

expectation & variance for weighted H@k

d42169b

reduce code duplication

bf19fc4

trigger ci

fix docstrings

814381d

trigger ci

update signature

0a40f99

trigger ci

fix missing conversion to float

6083e7b

cthoyt reviewed Mar 29, 2022

View reviewed changes

src/pykeen/evaluation/rank_based_evaluator.py Outdated Show resolved Hide resolved

cthoyt reviewed Mar 29, 2022

View reviewed changes

tests/test_evaluation/test_evaluators.py Show resolved Hide resolved

cthoyt requested changes Mar 29, 2022

View reviewed changes

cthoyt and others added 3 commits March 30, 2022 01:48

Add stub in tutorial

0c4e6d2

Update rank_based_evaluator.py

28d27b7

update evaluation tutorial

e1d41fb

trigger ci

cthoyt approved these changes Mar 30, 2022

View reviewed changes

mberr added 4 commits March 30, 2022 18:12

add noqa

9cb4899

trigger ci

fix sentence

afd98c6

trigger ci

export rank-based evaluator variations

aea5a62

trigger ci

link example

7e97319

trigger ci

mberr added 9 commits March 31, 2022 14:20

move some documentation around

0ab4527

move h@k to the class

97b924c

move doc to class

fa2a714

re-insert AMR(I) docs

8fc7fe3

fix link

c04a590

interlink with sklearn & black

623fe8a

remove reference from references.rst

4cf1c7b

trigger ci

extend module docstring

1fedc06

extend module docstring

788eeff

trigger ci

mberr commented Mar 31, 2022

View reviewed changes

src/pykeen/metrics/ranking.py Outdated Show resolved Hide resolved

cthoyt added 2 commits March 31, 2022 16:09

Update docs

c9ef741

Update ranking.py

2359fae

Trigger CI

cthoyt approved these changes Mar 31, 2022

View reviewed changes

Update references

72f9c56

Trigger CI

mberr merged commit 3dfdc30 into master Mar 31, 2022

mberr deleted the macro-evaluator branch March 31, 2022 14:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🌌🧐 Macro evaluation #850

🌌🧐 Macro evaluation #850

mberr commented Mar 28, 2022 •

edited

cthoyt left a comment •

edited

mberr commented Mar 30, 2022

mberr commented Mar 30, 2022 •

edited

cthoyt commented Mar 31, 2022

🌌🧐 Macro evaluation #850

🌌🧐 Macro evaluation #850

Conversation

mberr commented Mar 28, 2022 • edited

Dependencies

cthoyt left a comment • edited

Choose a reason for hiding this comment

mberr commented Mar 30, 2022

mberr commented Mar 30, 2022 • edited

cthoyt commented Mar 31, 2022

mberr commented Mar 28, 2022 •

edited

cthoyt left a comment •

edited

mberr commented Mar 30, 2022 •

edited