Metrics support #55

AleHD · 2023-08-31T22:41:00Z

Added support to calculate custom metrics during evaluation (e.g. accuracy).

Added command line argument --metrics. When set, it should be accompanied by one or more names of metric functions. For now only perplexity, accuracy, count_loss_mask (number of nonzero elements per sample in the loss_mask), count_instruct_mask (same as before, but this setting doesn't count the extra tokens that surrounds the messages in the instruction tuning setting, i.e. the<|im_begin|> and <|im_end|>), instruct_accuracy .

Thanks to @andreaskoepf and his fork for serving as inspiration to this implementation.

Co-authored-by: Andreas Koepf <andreas.koepf@provisio.com>

AleHD · 2023-09-02T03:40:25Z

Waiting on the instruction tuning PR. Once #40 merges with main successfully, merging this branch will be easier.

…e refactor)

AleHD and others added 4 commits August 29, 2023 01:30

added metrics support

47a18e1

Co-authored-by: Andreas Koepf <andreas.koepf@provisio.com>

fixed import error

12ddd99

Fixed memory leak when using functools.cache

3ed3603

Co-authored-by: Andreas Koepf <andreas.koepf@provisio.com>

Added safeward for instruct metrics

cfb97ce

AleHD marked this pull request as draft August 31, 2023 22:41

AleHD mentioned this pull request Sep 2, 2023

Better documentation #57

Merged

6 tasks

AleHD mentioned this pull request Sep 2, 2023

Instruct loss scalar #58

Merged

Merge branch 'main' into metrics

21e4f4a

AleHD marked this pull request as ready for review September 4, 2023 19:31

AleHD added 2 commits September 5, 2023 00:40

fixed finetune.sh script

51f4efb

Merge branch 'main' into metrics

9d0c44b

AleHD requested a review from martinjaggi September 6, 2023 17:48

AleHD added 3 commits September 6, 2023 20:08

Updated verify_correctness to follow new format of loss_func (from th…

0538996

…e refactor)

update finetune.py

099f0d0

Merge branch 'main' into metrics

c867fe3

martinjaggi approved these changes Sep 6, 2023

View reviewed changes

AleHD merged commit a8feb5b into main Sep 6, 2023

Provide feedback