Log more dpo metrics #610

maxjeblick · 2024-02-07T12:19:17Z

This PR logs more metrics for DPO training (in particular cross entropy loss).

I refactored train.py a bit, it is now also possible to automatically log additional metrics/losses by adding them to the model's output dictionary using additional_log_ prefix (e.g. outputs["additional_log_chosen_rewards"] = ...)

Related to #594

pascal-pfeiffer

Thanks a lot @maxjeblick
Super helpful to have more metrics logged. Ideally, we would also want to show them in the UI, but having them on neptune and in the log files, is already much better.

Please have another look on the path change for the gpt_templates. I believe this slipped into your PR somehow. Otherwise, looks good to me!

pascal-pfeiffer · 2024-02-14T10:12:39Z

llm_studio/python_configs/text_causal_language_modeling_config.py

+        prompt_template_directory = os.path.join(
+            os.path.dirname(__file__), "../../prompts"
+        )
        self._possible_values["metric_gpt_template"] = possible_values.String(
-            values=(f.split(".")[0] for f in os.listdir("prompts"))
+            values=(f.split(".")[0] for f in prompt_template_directory)


Why this change? It fails, as the path is relative to the root app dir.

Thanks, will check. Reason for the change is to resolve prompts directory w.r.t. llm_studio/python_configs/text_causal_language_modeling_config.py to be able to run tests with root directory == directory where test is running (this is the default in PyCharm when a test is started within the code editor).

train.py

pascal-pfeiffer

Thanks again, lgtm!

maxjeblick and others added 15 commits February 5, 2024 13:51

log SampleAveragedCrossEntropyLoss in Neptune for dpo loss

c277238

try log to new losses in val as well

6101d64

try log to new losses in val as well

3760793

remove debugging

93ce6e8

refactor explicit logging keys

d284627

explicitly detach metrics

4f619cf

also log perplexity for rejected sample

524fa18

fix rejected ce

0dd50fe

refactor evaluation

5e74382

fix rank 0 metric calculation

56e914c

fix rank 0 metric calculation

25fea10

add test for perplexity

de131e7

Merge branch 'main' into max/log_more_dpo_metrics

d8dfa99

Merge branch 'main' into max/log_more_dpo_metrics

a545c74

fix style issue

bc86468

maxjeblick requested a review from pascal-pfeiffer February 12, 2024 09:39

pascal-pfeiffer requested changes Feb 14, 2024

View reviewed changes

maxjeblick added 2 commits March 4, 2024 09:57

Merge branch 'main' into max/log_more_dpo_metrics

0aed6d2

address pr comments

c4f8090

maxjeblick requested a review from pascal-pfeiffer March 4, 2024 09:03

pascal-pfeiffer approved these changes Mar 4, 2024

View reviewed changes

maxjeblick merged commit 40219d3 into main Mar 4, 2024
5 checks passed

maxjeblick deleted the max/log_more_dpo_metrics branch March 4, 2024 09:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Log more dpo metrics #610

Log more dpo metrics #610

maxjeblick commented Feb 7, 2024

pascal-pfeiffer left a comment

pascal-pfeiffer Feb 14, 2024

maxjeblick Feb 14, 2024

pascal-pfeiffer left a comment

Log more dpo metrics #610

Log more dpo metrics #610

Conversation

maxjeblick commented Feb 7, 2024

pascal-pfeiffer left a comment

Choose a reason for hiding this comment

pascal-pfeiffer Feb 14, 2024

Choose a reason for hiding this comment

maxjeblick Feb 14, 2024

Choose a reason for hiding this comment

pascal-pfeiffer left a comment

Choose a reason for hiding this comment