Adopt `PreTrainedModelWrapper` for Hugging Face models #215

jon-tow · 2023-01-23T20:24:13Z

Summary

This PR adds the following changes:

Adopts a modified version of the PreTrainedModelWrapper as implemented in the trl package, here, to allow for flexible wrapping of Hugging Face (HF) models. This is useful for providing intuitive access to Hugging Face PreTrainedModel attributes such as push_to_hub and save_pretrained without having to access the underlying model.
Adds ILQL save_pretrained support`
Introduces the following for HF-based architectures:
- Renames architectures to AutoModelFor... to match Hugging Face counterparts.
- Removes base_model.transformer and base_model.lm_head references and instead extracts the final hidden states from all hidden states in the forward pass. This saves 2x memory on save to disk where previously a model's state_dict stored both .transformer and .lm_head as separate states with the underlying transformer.
Moves modeling code out of the trainer dir into a separate models dir.
Renames some utils.modeling HF attribute getters from causal_lm to decoder as many of these utils are helpful in general e.g. for encoder-decoder (seq2seq) models as well (T5Branch uses the same getter for accessing decoder parts of the transformer).

TODOs

Convert test scripts to proper unit tests for save_pretrained and from_pretrained for ILQL and PPO
Add from_config method to support custom model configs, e.g. as used in some examples

trlx/examples/randomwalks/ilql_randomwalks.py

Line 24 in b2ce1a4

GPT2Config(n_layer=6, n_embd=144, vocab_size=23),

Reports

SFT Sentiments: https://api.wandb.ai/links/jon-tow/8euree8w
RandomWalks (PPO & ILQL): https://api.wandb.ai/links/jon-tow/gv1z3lzt
Sentiments (PPO): https://api.wandb.ai/links/jon-tow/gqcgkjha
Sentiments (ILQL): https://api.wandb.ai/links/jon-tow/pvcxmos2
Summarize CNN/DailyMail (T5 PPO): https://api.wandb.ai/links/jon-tow/brordspn

…nto update-pre-commit

into update-save-pretrained

…ave-pretrained

cat-state · 2023-02-21T22:35:51Z

Thanks for this PR, I like the reorganization and changes overall!

…ave-pretrained

cat-state

thanks! this LGTM

jon-tow added 30 commits January 23, 2023 19:11

Adopt PreTrainedModelWrapper for Hugging Face models

c7e931d

Adopt PreTrainedModelWrapper for Hugging Face models

195bf01

Merge branch 'update-pre-commit' of https://github.com/jon-tow/trlx i…

dfe2b51

…nto update-pre-commit

Update documentation

55252e1

Merge branch 'update-save-pretrained' of https://github.com/jon-tow/trlx

79870bb

into update-save-pretrained

Fix up broken merge

41432ff

Run pre-commit

e7338f0

Revert dtype change to ILQLHead

056efcb

Fix isort

f7f5189

Format again...

3af2b73

Revert newline deletion

a9913aa

Revert unrelated changes and update docs

162b213

Update README.md saving example

d19f538

Revert unrelated changes

2e7fa31

Fix dtype access and hydra return_dict

f24d793

Force ref models into eval mode

840bfb2

Add unit tests for AutoModel...s

871b24e

Commit work on fixing T5Branch

e120c13

Merge branch 'main' of https://github.com/CarperAI/trlx into update-s…

c0d4792

…ave-pretrained

Merge branch 'main' of https://github.com/CarperAI/trlx into update-s…

c8f7127

…ave-pretrained

refactor(models): move models out of trainer dir

8811899

refactor(sft): remove save_pretrained override

7298764

Run pre-commit

2cf6966

Ignore line length for links

3d7c99b

Merge branch 'main' of https://github.com/CarperAI/trlx into update-s…

3809322

…ave-pretrained

Revert naming to base_model

7761b3d

Rename hydra models for clarity

68393b0

Add from_config support

db9bb93

cleanup docstrings

7a6b160

Revert T5 branch changes

91eb155

jon-tow marked this pull request as ready for review February 16, 2023 17:20

jon-tow marked this pull request as draft February 16, 2023 19:19

jon-tow marked this pull request as ready for review February 16, 2023 20:05

jon-tow marked this pull request as draft February 16, 2023 22:36

jon-tow marked this pull request as ready for review February 16, 2023 22:51

cat-state self-requested a review February 21, 2023 22:35

jon-tow added 2 commits February 21, 2023 22:56

Merge branch 'main' of https://github.com/CarperAI/trlx into update-s…

32800b2

…ave-pretrained

Remove variadic params

e4aff47

cat-state approved these changes Feb 22, 2023

View reviewed changes

jon-tow merged commit 715894a into CarperAI:main Feb 22, 2023

jon-tow deleted the update-save-pretrained branch February 22, 2023 20:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adopt `PreTrainedModelWrapper` for Hugging Face models #215

Adopt `PreTrainedModelWrapper` for Hugging Face models #215

jon-tow commented Jan 23, 2023 •

edited

cat-state commented Feb 21, 2023 •

edited

cat-state left a comment

Adopt PreTrainedModelWrapper for Hugging Face models #215

Adopt PreTrainedModelWrapper for Hugging Face models #215

Conversation

jon-tow commented Jan 23, 2023 • edited

cat-state commented Feb 21, 2023 • edited

cat-state left a comment

Choose a reason for hiding this comment

Adopt `PreTrainedModelWrapper` for Hugging Face models #215

Adopt `PreTrainedModelWrapper` for Hugging Face models #215

jon-tow commented Jan 23, 2023 •

edited

cat-state commented Feb 21, 2023 •

edited