Add Flash GPT2 #93

tgaddair · 2023-12-01T18:11:54Z

Implementation from: https://github.com/huggingface/transformers/pull/27479/files#diff-6ab7f0d7a9683ef8eb95c8b13bb0468d4f930ce15105a23238277bcb10c5a097

… output

tgaddair · 2023-12-05T23:49:46Z

server/lorax_server/models/__init__.py

-        FlashLlama,
-    )
+    from lorax_server.models.flash_llama import FlashLlama
+    from lorax_server.models.flash_gpt2 import FlashGPT2


Add FlashGPT2 to __all__ below on line 69.

tgaddair · 2023-12-05T23:51:39Z

server/lorax_server/models/flash_gpt2.py

+            device = torch.device(f"cuda:{rank}")
+            dtype = torch.float16 if dtype is None else dtype
+        else:
+            raise NotImplementedError("FlashLlama is only available on GPU")


tgaddair and others added 8 commits December 1, 2023 09:15

WIP: gpt2

d19c09d

WIP flash attention

c4918ed

Forward

1750537

wip: make gpt2 queryable; TODO: make it match generation of vanilla HF

a979542

Merge branch 'gpt2' of github.com:predibase/lorax into gpt2

75f9a26

merge

210cef6

remove rope

1f4606d

remove cross-attention; match transformers generate output with lorax…

f1979bb

… output

tgaddair marked this pull request as ready for review December 5, 2023 23:47

Merge branch 'main' into gpt2

0b3676b

tgaddair commented Dec 5, 2023

View reviewed changes

tgaddair merged commit 561d03f into main Dec 5, 2023
1 check passed

tgaddair deleted the gpt2 branch December 5, 2023 23:53

tgaddair mentioned this pull request Dec 7, 2023

Does lorax currently support GPT2 finetuned adapters? #84

Open

4 tasks

tgaddair mentioned this pull request Jan 26, 2024

Support self-trained model #208

Open

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Flash GPT2 #93

Add Flash GPT2 #93

tgaddair commented Dec 1, 2023

tgaddair Dec 5, 2023

tgaddair Dec 5, 2023

Add Flash GPT2 #93

Add Flash GPT2 #93

Conversation

tgaddair commented Dec 1, 2023

tgaddair Dec 5, 2023

Choose a reason for hiding this comment

tgaddair Dec 5, 2023

Choose a reason for hiding this comment