Pack small PPL eval sets together #199

epwalsh · 2023-06-02T21:41:06Z

Adds the ability to handle multiple language modeling evaluation datasets in a single Evaluator while still tracking metrics for each dataset separately.
This allows us to pack small perplexity evaluation datasets together to make the evaluation loop more efficient and to handle datasets that are too small to use otherwise (because they don't have enough examples for a single batch on all GPUs).

I did a test run to validate the code. Here are the results: https://wandb.ai/ai2-llm/c4-small/reports/Packed-evaluations--Vmlldzo0NTY2Mzg2

dirkgr · 2023-06-06T17:44:48Z

configs/c4-large.yaml

+tokenizer:
+  identifier: gpt2
+  truncate_direction: right
+
+save_folder: ${path.choose:${oc.env:SCRATCH_DIR,no_exist}/checkpoints,/results}/${oc.env:SLURM_JOB_ID,${run_name}}
+save_overwrite: false
+# Sharded checkpoints (best for restarts)
+save_interval: 1000
+save_num_checkpoints_to_keep: 2
+# Unsharded checkpoints (for final storage)
+save_interval_unsharded: 10000
+save_num_unsharded_checkpoints_to_keep: -1
+
+load_path: null
+
+max_duration: 476837  # 2T tokens
+global_train_batch_size: 2048
+device_train_microbatch_size: 4
+
+precision: amp_bf16
+
+max_grad_norm: 1.0
+
+speed_monitor:
+  window_size: 20


What's all this? Should it be in this PR?

Oh, it's just moved?

yes, just moved so we don't have to scroll past all of the evaluation settings to find the other settings.

epwalsh added 11 commits June 2, 2023 09:52

handle multiple tasks in one evaluator

2c668be

Merge branch 'main' into packed-evals

f4c065b

lump small PPL evaluations together

d32b252

update tiny config

31bb5c5

okay, no union types there

20782b1

fix

1210a5f

fix again

3683953

fix?

690e53c

log those

119f535

make more robust

76d17ea

clean up

63d3845

epwalsh requested review from dirkgr and ananyahjha93 June 5, 2023 21:53

dirkgr approved these changes Jun 6, 2023

View reviewed changes

Merge branch 'main' into packed-evals

058e7a5

epwalsh merged commit 86060d4 into main Jun 7, 2023
10 checks passed

epwalsh deleted the packed-evals branch June 7, 2023 16:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pack small PPL eval sets together #199

Pack small PPL eval sets together #199

epwalsh commented Jun 2, 2023 •

edited

Loading

dirkgr Jun 6, 2023

dirkgr Jun 6, 2023

epwalsh Jun 6, 2023

Pack small PPL eval sets together #199

Pack small PPL eval sets together #199

Conversation

epwalsh commented Jun 2, 2023 • edited Loading

dirkgr Jun 6, 2023

Choose a reason for hiding this comment

dirkgr Jun 6, 2023

Choose a reason for hiding this comment

epwalsh Jun 6, 2023

Choose a reason for hiding this comment

epwalsh commented Jun 2, 2023 •

edited

Loading