FEAT: Fairness of exposure #959

bram49 · 2021-09-21T18:18:40Z

Fairness of exposure

Would like to extend Fairlearn with tools to deal with fairness in rankings (#945). I think that an intuitive and good metric for fairness in rankings is exposure. From the paper: Fairness of exposure in rankings by Ashudeep Singh and Thorsten Joachims.

Where exposure is defined as:
$Exposure(d_i | P) = \sum_{j=1}^N P_{i,j} v_j$
,with document d_i, probabilistic ranking P and logarithmic discount v_j to deal with position bias (higher rankings get exponential more attention)
$v_j = \frac{1}{log_2(1 + j)}$

With exposure, all kinds of fairness metrics can be constructed. Such as:

1. Allocation harm

Where you equalize the average exposure of documents.
Denoting average exposure in group k with:

$Exposure(G_k | P) = \frac{1}{|G_k|} \sum_{d_i \in G_k} Exposure(d_i | P)$

And denoting the demographic parity constraint with:

$Exposure(G_0 | P) = Exposure(G_1 | P)$

2. Quality-of-service harm

Where you try to keep the relevance of the items proportional to the exposure. Like in the example on the right, small differences in relevance between candidates can lead to huge differences in exposure.

$\frac{Exposure(G_0 | P)}{U(G_0 | q)} = \frac{Exposure(G_1 | P)}{U(G_1 | q)}$

,where U(G|q), is the average utility of a group. And utility is the relevance score, on which the documents are ranked.

$U(G_k | q) = \frac{1}{|G_k|} \sum_{d_i \in G_k} u_i$

Problem

The problem with this metric is that you need a given probabilistic ranking P, which you can create when you have multiple rankings with the same documents. To work around this, and try to make the metric work for a single ranking, I thought about defining exposure as the sum of logarithmic discounts for ranking tau. Which would define demographic parity as:

$Exposure(G_k | \tau) = \frac{1}{|G_k|} \sum_{d_i \in G_k} \frac{1}{log(1 + \tau_i)}$

Conclusion

Think that with the adjustment, this is an effective way to deal quantify fairness in rankings. Please let me know what you think

hildeweerts · 2021-09-23T10:04:47Z

Thank you for opening this issue @bram49!

IMO the deterministic version of exposure makes sense, but I'd love to hear other people's thoughts on this as well @fairlearn/fairlearn-maintainers. Could you give an example of how utility would be defined in scenario (2)?

As a side note, I think we need to be careful with overloading terminology. Demographic parity/disparate treatment* are typically use I n the context of classification and regression problems. To avoid confusion, I think it would make sense to instead focus on the types of harm that are being measured. E.g., differences in exposure can be seen as a measure of allocation harm in a ranking scenario, whereas differences in exposure/utility is an indicator of quality-of-service harm.

[*In the Fairlearn community we generally try to avoid the term "disparate treatment" which originates from US laws on employment discrimination. Using that term may suggest compliance with US law even when it's not the case. First of all, most applications will fall outside of the employment domain. Moreover, considering only the output of the model (i.e., disregarding how it's used, by whom, etc.) is a very narrow frame.]

bram49 · 2021-09-23T11:10:13Z

Thank you for the reply @hildeweerts
Indeed it makes more sense to define the methods by the types of harms they try to measure. I will change the headings in the titles to the corresponding harms.
An example of a utility score would be the SAT score to rank the top-k students who will be admitted into a numerus fixus program. Or any type of relevance score, which is calculated to rank a query.

bram49 mentioned this issue Sep 21, 2021

FEAT: Fairness metrics for ranking #945

Open

7 tasks

romanlutz added the enhancement New feature or request label Sep 23, 2021

bram49 linked a pull request Oct 6, 2021 that will close this issue

ENH Add ranking metrics #974

Open

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FEAT: Fairness of exposure #959

FEAT: Fairness of exposure #959

bram49 commented Sep 21, 2021 •

edited

Loading

hildeweerts commented Sep 23, 2021

bram49 commented Sep 23, 2021 •

edited

Loading

FEAT: Fairness of exposure #959

FEAT: Fairness of exposure #959

Comments

bram49 commented Sep 21, 2021 • edited Loading

Fairness of exposure

1. Allocation harm

2. Quality-of-service harm

Problem

Conclusion

hildeweerts commented Sep 23, 2021

bram49 commented Sep 23, 2021 • edited Loading

bram49 commented Sep 21, 2021 •

edited

Loading

bram49 commented Sep 23, 2021 •

edited

Loading