Added Phi #132

tgaddair · 2023-12-15T06:19:30Z

Example:

lorax-launcher --model-id microsoft/phi-2

Prompt:

curl 127.0.0.1:8080/generate \
    -X POST \
    -d '{"inputs": "Instruct: Write a detailed analogy between mathematics and a lighthouse.\nOutput:", "parameters": {"max_new_tokens": 64}}' \
    -H 'Content-Type: application/json'

Response:

Mathematics is like a lighthouse. Just as a lighthouse guides ships safely to shore, mathematics provides a guiding light in the world of numbers and logic. It helps us navigate through complex problems and find solutions. Just as a lighthouse emits a steady beam of light, mathematics provides a consistent framework for reasoning and problem-solving.

flozi00 · 2023-12-15T09:31:45Z

server/lorax_server/models/__init__.py

@@ -325,6 +328,19 @@ def get_model(
                trust_remote_code=trust_remote_code,
            )
        raise NotImplementedError("Qwen model requires flash attention v2")
+
+    if model_type == "phi-msft" or model_type == "phi":


what do you think about using model_type in [] instead ?

arnavgarg1 · 2023-12-15T13:04:33Z

server/lorax_server/models/custom_modeling/flash_phi_modeling.py

+        super().__init__()
+        self.num_heads = config.n_head
+        self.hidden_size = config.n_embd
+        self.head_size = self.hidden_size // self.num_heads


Is it intentional to use self.num_heads like this here but then update the value of self.num_heads by the number of process groups in line 110?

Yes, when creating the modules we need the num_heads value pre-splitting. Then later on we use num_heads post-split.

Added comment.

arnavgarg1 · 2023-12-15T13:07:06Z

server/lorax_server/models/flash_phi.py

+            revision=revision,
+            padding_side="left",
+            truncation_side="left",
+            trust_remote_code=trust_remote_code,


Since we're adding trust_remote_code, do we already install the einops library to convert weights when using microsoft/phi-1.5?

trust_remote_code is False by default. einops is installed already (for Falcon). This current implementation works with the microsoft version of the weights, rather than the changes HF made.

tgaddair added 11 commits December 14, 2023 10:08

WIP: phi model

20f1ac7

Added phi

d7b230b

WIP

033e2f3

Fix config

021a8ac

More config fixes

36fed64

More stuff

8e34dc8

Fixed layer norm

a89cb09

Cleanup

4796895

Fixed license

ab4c6a7

Merge branch 'main' into phi

92b6cb3

Removed debug

65dd627

tgaddair requested a review from geoffreyangus December 15, 2023 06:45

flozi00 approved these changes Dec 15, 2023

View reviewed changes

arnavgarg1 reviewed Dec 15, 2023

View reviewed changes

arnavgarg1 approved these changes Dec 15, 2023

View reviewed changes

Addressed comments

413df15

tgaddair merged commit 549bbb8 into main Dec 15, 2023
1 check passed

tgaddair deleted the phi branch December 15, 2023 17:29

tgaddair mentioned this pull request Jan 26, 2024

Support self-trained model #208

Open

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added Phi #132

Added Phi #132

tgaddair commented Dec 15, 2023 •

edited

flozi00 Dec 15, 2023

tgaddair Dec 15, 2023

arnavgarg1 Dec 15, 2023 •

edited

tgaddair Dec 15, 2023

tgaddair Dec 15, 2023

arnavgarg1 Dec 15, 2023

tgaddair Dec 15, 2023

Added Phi #132

Added Phi #132

Conversation

tgaddair commented Dec 15, 2023 • edited

flozi00 Dec 15, 2023

Choose a reason for hiding this comment

tgaddair Dec 15, 2023

Choose a reason for hiding this comment

arnavgarg1 Dec 15, 2023 • edited

Choose a reason for hiding this comment

tgaddair Dec 15, 2023

Choose a reason for hiding this comment

tgaddair Dec 15, 2023

Choose a reason for hiding this comment

arnavgarg1 Dec 15, 2023

Choose a reason for hiding this comment

tgaddair Dec 15, 2023

Choose a reason for hiding this comment

tgaddair commented Dec 15, 2023 •

edited

arnavgarg1 Dec 15, 2023 •

edited