Lightweight HF integration #220

AkshitaB · 2023-06-23T00:51:29Z

Creates a config.json file at the checkpoint location, which allows Olmo models to be loaded as HF models.

python hf_olmo/add_hf_config_to_olmo_checkpoint.py --checkpoint-dir <olmo-checkpoint-location>

The model, config, and tokenizer classes are registered with the relevant HF auto classes. Importing them allows models to be loaded as HF-compatible models.

from hf_olmo import *
from transformers import AutoModelForCausalLM, AutoTokenizer
model = AutoModelForCausalLM.from_pretrained(<olmo-checkpoint-location>)
tokenizer = AutoTokenizer.from_pretrained(<olmo-checkpoint-location>)

Using HF pipeline works:

from hf_olmo.modeling_olmo import OLMoConfig, OLMoForCausalLM  # noqa: F401

model = AutoModelForCausalLM.from_pretrained(model_path)
tokenizer = AutoTokenizer.from_pretrained(model_path)
pipeline = TextGenerationPipeline(model=model, tokenizer=tokenizer)
output = pipeline("question: who wrote romeo and juliet? answer: ", max_new_tokens=30)
# [{'generated_text': 'question: who wrote romeo and juliet? answer: romeo and juliet is a play by william shakespeare. the play was first performed in 1605. the play is set in the city'}]

Instruct-eval tasks also work (adding from hf_olmo import * to run_eval.py should suffice):
Tested with mmlu and bbh.

To Do:

Test bbh instruct eval task. @OyvindTafjord pointed out that bbh tasks from instruct eval may be a more comprehensive test, since it uses more of the HF api.
Add implementation for AutoModelForCausalLM-specific methods, so that HF's .generate() works.
Test with deepspeed code from @hamishivi .

epwalsh

Overall looks good. I'm curious what your thoughts are about how we'll ultimately make this integration available when we release the code? I was thinking we'd make this a submodule of olmo that people could install as an extra, like pip install olmo[hf] or something. But keeping it a separate package is probably fine too. Maybe we want to call it something more specific like olmo_hf though?

epwalsh · 2023-06-23T20:34:46Z

hf_integration/modeling_olmo.py

+            past_key_values=outputs.attn_key_values,
+        )
+
+    def generate(self, input_ids, *args, **kwargs):


Do we need to implement this or can we get this for free using HF built-in generate functionality?

We can, now. Needed a couple small methods implemented.

AkshitaB · 2023-06-26T06:20:56Z

Maybe we want to call it something more specific like olmo_hf though?

Makes sense. I've renamed it.

epwalsh

Just some comments about CI, otherwise looks good

.github/actions/setup-venv/action.yml

epwalsh · 2023-06-26T18:23:40Z

.github/workflows/main.yml

@@ -152,7 +152,7 @@ jobs:
                    value: ":16:8"
                  - name: TOKENIZERS_PARALLELISM
                    value: "false"
-                command: ["/entrypoint.sh", "pytest", "-v", "-m", "gpu", "tests/"]
+                command: ["/entrypoint.sh", "pytest", "-v", "-m", "gpu", "tests/", "-k", "not hf_olmo"]


Should we add another job to run the HF tests?

Discussed offline: since this requires updating the beaker image on which we run the GPU tests, and since we expect to reconfigure this at some point, it's not worth the effort now.

I've confirmed using instruct-eval that the HF integration runs on GPU.

AkshitaB added 16 commits June 21, 2023 22:06

first commit

442f86c

second pass, textgen pipeline works

c7c170d

add tests

2befdc5

tokenizer

b5da6b4

make auto compatible

6935e5f

tests, cleanup

5b623fc

pipeline test

bdbf041

add requirement file

e6df083

pyproject

6db6980

get tests to work

c3611d7

rename for consistency

71abd14

add test fixture

c92f25e

ignore hf integration tests on gpu

e0fd0a7

fix

bfa379f

move imports

754eff1

use_cache config to arg

9e7b57c

AkshitaB requested a review from epwalsh June 23, 2023 17:24

AkshitaB added 2 commits June 23, 2023 11:12

update forward

6159097

style

126e5a5

epwalsh approved these changes Jun 23, 2023

View reviewed changes

AkshitaB added 9 commits June 25, 2023 17:06

add missing kwargs, fix from_pretrained to use device_map

7c8243d

use HF's default generation tools

7abd90f

style

e4c14a6

ensure that HF generation uses cache

e5f91a3

update comment

d5006a5

rename to hf_olmo

72248f0

fix

89aaf97

fix github actions

9659955

fix finally

833e4f5

AkshitaB requested a review from epwalsh June 26, 2023 17:24

epwalsh reviewed Jun 26, 2023

View reviewed changes

add reqs to cache

1a4ba1b

AkshitaB merged commit acf372e into main Jun 26, 2023
10 checks passed

AkshitaB deleted the hf-integration branch June 26, 2023 18:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lightweight HF integration #220

Lightweight HF integration #220

AkshitaB commented Jun 23, 2023 •

edited

Loading

epwalsh left a comment

epwalsh Jun 23, 2023

AkshitaB Jun 26, 2023

AkshitaB commented Jun 26, 2023

epwalsh left a comment

epwalsh Jun 26, 2023

AkshitaB Jun 26, 2023

Lightweight HF integration #220

Lightweight HF integration #220

Conversation

AkshitaB commented Jun 23, 2023 • edited Loading

epwalsh left a comment

Choose a reason for hiding this comment

epwalsh Jun 23, 2023

Choose a reason for hiding this comment

AkshitaB Jun 26, 2023

Choose a reason for hiding this comment

AkshitaB commented Jun 26, 2023

epwalsh left a comment

Choose a reason for hiding this comment

epwalsh Jun 26, 2023

Choose a reason for hiding this comment

AkshitaB Jun 26, 2023

Choose a reason for hiding this comment

AkshitaB commented Jun 23, 2023 •

edited

Loading