feat: Bandits part 4: Evaluation #27

typotter · 2024-07-05T21:06:50Z

Motivation and Context

Here is where the magic happens; where user data is matrix multiplied against bandit coefficient models to determine the best action for the user.

Description

Bandit Evaluator, a class to compute scores, weigh actions, and bucket the user to ultimately select an action

How has this been tested

Unit Tests

aarsilv · 2024-07-15T20:40:36Z

src/Bandits/BanditEvaluator.php

+        if (empty($actionsWithContexts)) {
+            throw new InvalidArgumentException("No actions provided for bandit evaluation");
+        }


We only want to do this if the bandit is still at play (e.g., this flag has a bandit).

This is to support the case of users removing bandits from the flag more in a friendly way, such as rolling out a winning variation. That way, if they just stop supplying actions but leave this in, it will still work as they expect.

You can see the node example here: https://github.com/Eppo-exp/js-client-sdk-common/blob/main/src/client/eppo-client.ts#L471

This is actually handled up in the EppoClient. My reasoning for throwing an exception here is that if we're invoking the BanditEvaluator, we've already matched an assignment to a bandit, pulled the Bandit model and should not have done all that work if the action list is empty.

Following the parallel discussion in Slack and the PR for the JS commons SDK, does this path eventually return the default variation? Or is the InvalidArgumentException handled differently? (Maybe it never even reaches this point if the action list is empty?)

This will return but not log the default variation. It sounds like our latest plan is for us to wait to check for zero actions until a variation has been assigned?

src/Bandits/BanditEvaluator.php

giorgiomartini0

👏 The math and overall logic look good to me (I don't know PHP). Left one really minor naming suggestion.

giorgiomartini0 · 2024-07-16T19:38:09Z

src/Bandits/BanditEvaluator.php

+        if (empty($actionsWithContexts)) {
+            throw new InvalidArgumentException("No actions provided for bandit evaluation");
+        }


Following the parallel discussion in Slack and the PR for the JS commons SDK, does this path eventually return the default variation? Or is the InvalidArgumentException handled differently? (Maybe it never even reaches this point if the action list is empty?)

giorgiomartini0 · 2024-07-16T19:39:33Z

src/Bandits/BanditEvaluator.php

+        $selectedActionContext = $actionsWithContexts[$selectedAction];
+        $actionScore = $actionScores[$selectedAction];
+        $actionWeight = $actionWeights[$selectedAction];


nit: for consistency, should these be selectedActionScore and selectedActionWeight?

giorgiomartini0 · 2024-07-16T20:00:13Z

src/Bandits/BanditEvaluator.php

+        $remainingWeight = max(0.0, 1.0 - array_sum($weights));
+        $weights[$bestActionKey] = $remainingWeight;


Very clever! Took me a moment to grok; this is adding a new key to weights (bestActionKey was filtered out above), not overwriting it.

aarsilv

Thanks for iterating! Approving for now, although we may need to revisit this SDK (and all the others) once we align on our updated method for handling empty actions

aarsilv · 2024-07-17T23:27:35Z

src/Bandits/BanditEvaluator.php

+        if (empty($actionsWithContexts)) {
+            throw new InvalidArgumentException("No actions provided for bandit evaluation");
+        }


This will return but not log the default variation. It sounds like our latest plan is for us to wait to check for zero actions until a variation has been assigned?

aarsilv · 2024-07-17T23:37:26Z

src/Bandits/BanditEvaluator.php

+                if ($aValue == $bValue) {
+                    return $a->action <=> $b->action;
+                }
+                return $aValue <=> $bValue;


🙌

Could this be simplified to:

return ($aValue <=> $bValue) ?: ($a->action <=> $b->action);

aarsilv · 2024-07-17T23:37:46Z

tests/Bandits/BanditEvaluatorTest.php

+        $this->assertEquals($expectedScore, $actualScore);
+    }
+
+    public function testScoreNumericIgnoringNonNumericAttributes(): void


aarsilv · 2024-07-17T23:37:57Z

tests/Bandits/BanditEvaluatorTest.php

+
+    public function testScoreNumericIgnoringNonNumericAttributes(): void
+    {
+        $numericAttributes = ['age' => 30, 'height' => 170, 'shouldBeANumber' => 'but_it_is_not'];


excellent naming

typotter · 2024-07-24T16:19:24Z

Thanks for the review, folks. Integrating this and applying the recent no-actions action in the Client PR.

typotter added 29 commits July 3, 2024 11:44

ignore

baf51c1

Merge branch 'main' into tp/banditvariations

48f2951

BanditVariation DTO and indexer

8e8a0b2

refine and comment

4677b02

Merge branch 'main' into tp/banditvariations

0085058

new DTOs

62da3f8

Merge branch 'main' into tp/bandits/DTOs

a32731f

DTO stack and tests

7122863

lint

3520ed6

Merge branch 'tp/lint' into tp/bandits/DTOs

a7afeb1

lint

d234582

wip

fd9e48e

more DTOs

3d8bc3a

Bandit Evaluator

be61d52

Bandit Evaluation

f9c0410

Merge branch 'main' into tp/banditvariations

494b8c3

Merge branch 'main' into tp/bandits/DTOs

1f55319

Merge branch 'tp/bandits/DTOs' into tp/bandits/eval

4455e2f

lint

19048aa

fixtest

b708511

lint

1a8eeb9

finetune

b75879a

lint

f5061a4

changes on changes

a9cd9d5

Merge branch 'main' into tp/bandits/DTOs

68e8465

fine tune

be5aaf9

temp wip

bd37034

Tests and fix for attribute set

fae3490

merge DTOs

b82b063

typotter marked this pull request as ready for review July 5, 2024 21:40

typotter requested a review from giorgiomartini0 July 12, 2024 18:36

typotter added 16 commits July 15, 2024 09:15

Creation of empty indexer

81467f2

add hasBandits to Indexer

5f7e472

new input validation method

afa5a4c

flatten cache resource meta

61c4e93

Use metatada

67515a4

lint

d894350

metadata type

9912c53

use correct reserved key

f0b2e51

Merge branch 'tp/config/opt' into tp/bandits/eval

8e6ff70

types

97a7906

Merge branch 'tp/config/opt' into tp/bandits/eval

71bb4e6

CAsinG

c15d28e

Merge branch 'tp/config/opt' into tp/bandits/eval

a9384cb

run tests on all PRs

b5505b9

composer

6fa16d2

Merge branch 'tp/config/opt' into tp/bandits/eval

51e42f3

typotter changed the base branch from main to tp/config/opt July 15, 2024 18:04

typotter requested a review from aarsilv July 15, 2024 20:10

aarsilv reviewed Jul 15, 2024

View reviewed changes

Base automatically changed from tp/config/opt to main July 16, 2024 04:11

merge main

68df489

typotter requested a review from aarsilv July 16, 2024 04:18

giorgiomartini0 approved these changes Jul 16, 2024

View reviewed changes

aarsilv approved these changes Jul 17, 2024

View reviewed changes

typotter added 2 commits July 24, 2024 10:08

pr iterate

97ef97e

better grok

17a93d3

typotter merged commit 3bc88a3 into main Jul 24, 2024
1 check passed

typotter deleted the tp/bandits/eval branch July 24, 2024 16:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Bandits part 4: Evaluation #27

feat: Bandits part 4: Evaluation #27

typotter commented Jul 5, 2024 •

edited

Loading

aarsilv Jul 15, 2024

typotter Jul 16, 2024

giorgiomartini0 Jul 16, 2024

aarsilv Jul 17, 2024

giorgiomartini0 left a comment

giorgiomartini0 Jul 16, 2024

giorgiomartini0 Jul 16, 2024

giorgiomartini0 Jul 16, 2024

aarsilv left a comment

aarsilv Jul 17, 2024

aarsilv Jul 17, 2024

aarsilv Jul 17, 2024

aarsilv Jul 17, 2024

typotter commented Jul 24, 2024

		$remainingWeight = max(0.0, 1.0 - array_sum($weights));
		$weights[$bestActionKey] = $remainingWeight;

feat: Bandits part 4: Evaluation #27

feat: Bandits part 4: Evaluation #27

Conversation

typotter commented Jul 5, 2024 • edited Loading

Motivation and Context

Description

How has this been tested

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

giorgiomartini0 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

aarsilv left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

typotter commented Jul 24, 2024

typotter commented Jul 5, 2024 •

edited

Loading