[HF] Add input embedding argument to HF model #442

2015aroras · 2024-02-09T22:46:44Z

The trainers in HF seem to expect models to accept input embeddings directly as an alternative to input ids. This expectation caused this HF issue.

This PR adds an input embeddings argument to OLMo, and bypasses the input embedding layer when input embeddings are provided.

2015aroras · 2024-02-09T22:49:00Z

olmo/model.py

@@ -1137,6 +1137,7 @@ def get_alibi_attention_bias(self, seq_len: int, device: torch.device) -> torch.
    def forward(
        self,
        input_ids: torch.LongTensor,


Ideally I would make input_ids optional, but this would not be a fully backwards compatible change.

epwalsh

LGTM. Can you add an entry to the CHANGELOG?

2015aroras added 2 commits February 9, 2024 14:41

Pass input embeddings from HF OLMo to inner model forward

dc1545d

Use input embeddings instead of input ids when provided

75e1476

2015aroras requested review from AkshitaB and epwalsh February 9, 2024 22:46

Merge branch 'main' into shanea/add-input-embedding-arg

ad2f3c0

2015aroras commented Feb 9, 2024

View reviewed changes

2015aroras added 2 commits February 9, 2024 14:50

Run Ruff

3d53758

Update Changelog

faccd94

epwalsh approved these changes Feb 9, 2024

View reviewed changes

2015aroras merged commit 97296e6 into main Feb 9, 2024
11 checks passed

2015aroras deleted the shanea/add-input-embedding-arg branch February 9, 2024 23:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[HF] Add input embedding argument to HF model #442

[HF] Add input embedding argument to HF model #442

2015aroras commented Feb 9, 2024

2015aroras Feb 9, 2024

epwalsh left a comment

[HF] Add input embedding argument to HF model #442

[HF] Add input embedding argument to HF model #442

Conversation

2015aroras commented Feb 9, 2024

2015aroras Feb 9, 2024

Choose a reason for hiding this comment

epwalsh left a comment

Choose a reason for hiding this comment