feat: add classification accuracy semantic robustness eval algo #47

xiaoyi-cheng · 2023-10-18T05:41:14Z

Description of changes:
add classification accuracy semantic robustness eval algo

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

polaschwoebel

Looks good! Please implement the two 2 quick fixes below.

We can discuss whether we want to address the point in the bottom at some later point, not important for mvp.

Quick fixes:

ClassificationAccuracySemanticRobustness missing from eval_algo_mapping.py, so it cannot be imported like the others (for example notebooks).
In example notebooks, first cell:
# from amazon_fmeval.eval_algo_mapping import get_eval_algorithm needs to be
from amazon_fmeval import get_eval_algorithm

More involved, for later:

Only accuracy is reported for robustness (not "balanced_accuracy_score", "precision_score", and "recall_score"). Adding those will probably require some refactoring because they cannot be computed on a per-sample-basis but need the whole dataset at once.

src/amazon_fmeval/eval_algorithms/__init__.py

polaschwoebel · 2023-10-18T14:07:25Z

src/amazon_fmeval/eval_algorithms/classification_accuracy_semantic_robustness.py

+    num_perturbations: int = 5
+    seed: int = 5
+    # BUTTER FINGER PERTURBATION
+    butter_finger_perturbation_prob: Optional[float] = 0.1


These defaults should be the same between all robustness evals. Consider turning them into constants.

Later: abstract away base_task (for QA, summarization, classification) + robustness to avoid duplications.

xiaoyi-cheng requested a review from bilalaws October 18, 2023 05:41

polaschwoebel previously approved these changes Oct 18, 2023

View reviewed changes

xiaoyi-cheng dismissed polaschwoebel’s stale review via 6720541 October 18, 2023 21:11

xiaoyi-cheng force-pushed the classificationsemantic branch from 274ee28 to 6720541 Compare October 18, 2023 21:11

feat: add classification accuracy semantic robustness eval algo

3ccbfd9

xiaoyi-cheng force-pushed the classificationsemantic branch from 6720541 to 3ccbfd9 Compare October 18, 2023 21:14

malhotra18 approved these changes Oct 18, 2023

View reviewed changes

Merge branch 'main' into classificationsemantic

a2dbc35

pinaraws approved these changes Oct 19, 2023

View reviewed changes

xiaoyi-cheng merged commit a06781c into aws:main Oct 19, 2023
3 checks passed

xiaoyi-cheng deleted the classificationsemantic branch October 19, 2023 00:23

Provide feedback