support text-only input with llama3.2-11b #1216

Gasoonjia · 2024-09-26T08:43:23Z

This PR enables llama3.2-11b model with text-only input.
Note that this PR is only for CLI pipeline. Will have a separate PR for openai-api update

(torchchat-test) [gasoonjia@server ~/torchchat-32mm (main|REBASE-i|main)]$ python torchchat.py generate llama3.2-11B --prompt "How are you these days"
Using device=cuda NVIDIA PG509-210
Loading model...
Time to load model: 8.88 seconds
-----------------------------------------------------------
How are you these daysI'm just a computer program, so I don't have feelings or emotions like humans do, but thanks for asking! I'm functioning properly and ready to assist with any questions or tasks you may have. How about you? How's your day going?
========================================


      Average tokens/sec (total): 11.66                 
Average tokens/sec (first token): 1.82                 
Average tokens/sec (next tokens): 13.04

pytorch-bot · 2024-09-26T08:43:27Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1216

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure, 1 Cancelled Job

As of commit c0c8033 with merge base ec7b510 ():

NEW FAILURE - The following job has failed:

pull / runner-et (macos-14-xlarge) (gh)
RuntimeError: module was compiled against NumPy C-API version 0x10 (NumPy 1.23) but the running NumPy has C-API version 0xe. Check the section C-API incompatibility at the Troubleshooting ImportError section at https://numpy.org/devdocs/user/troubleshooting-importerror.html#c-api-incompatibility for indications on how to solve this problem.

CANCELLED JOB - The following job was cancelled. Please retry:

pull / runner-et (16-core-ubuntu) (gh)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

iseeyuan · 2024-09-26T14:05:15Z

torchchat/generate.py

+                is_multimodal = False

-            seq_len = tokens.size(1)
+            seq_len = x.size(1)


What's the difference between tokens and x?

No difference, all equal to text input
Previously we pull the text input out of batch inside prefill and call it tokens
Now we pull out text input first then forward it into prefill function and call it x
The reason we change the name is other single modality models usually use x to represent the text token inputs.

joecummings · 2024-09-26T14:15:32Z

You can also check for multimodal content like this so you don't have to check the model type explicitly:

https://github.com/pytorch/torchtune/blob/7da96d18dbddbfcf77e045fe149b1efb6866681d/recipes/dev/generate_v2.py#L123

Gasoonjia · 2024-09-26T17:14:10Z

@joecummings thanks for your suggestions!
Just curious how do you guys handle the mismatch between input and model requirements? Like when user forward an image input into a single-modality model?

Jack-Khuu · 2024-09-27T00:35:30Z

The failing test is not relevant. Pushing through

support text-only input with llama3.2-11b

aa5fc54

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Sep 26, 2024

Gasoonjia requested review from Jack-Khuu, byjlw, iseeyuan and vmpuri September 26, 2024 08:44

iseeyuan approved these changes Sep 26, 2024

View reviewed changes

Gasoonjia added 2 commits September 26, 2024 13:34

Merge branch 'main' into text-only-11b

cfeec04

Merge branch 'main' into text-only-11b

c0c8033

Jack-Khuu mentioned this pull request Sep 27, 2024

llama vision model requires an image #1213

Closed

Jack-Khuu merged commit e4b36f9 into main Sep 27, 2024
49 of 51 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

support text-only input with llama3.2-11b #1216

support text-only input with llama3.2-11b #1216

Uh oh!

Gasoonjia commented Sep 26, 2024 •

edited

Loading

Uh oh!

pytorch-bot bot commented Sep 26, 2024 •

edited

Loading

Uh oh!

iseeyuan Sep 26, 2024

Uh oh!

Gasoonjia Sep 26, 2024

Uh oh!

joecummings commented Sep 26, 2024

Uh oh!

Gasoonjia commented Sep 26, 2024

Uh oh!

Jack-Khuu commented Sep 27, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

support text-only input with llama3.2-11b #1216

support text-only input with llama3.2-11b #1216

Uh oh!

Conversation

Gasoonjia commented Sep 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Sep 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1216

❌ 1 New Failure, 1 Cancelled Job

Uh oh!

iseeyuan Sep 26, 2024

Choose a reason for hiding this comment

Uh oh!

Gasoonjia Sep 26, 2024

Choose a reason for hiding this comment

Uh oh!

joecummings commented Sep 26, 2024

Uh oh!

Gasoonjia commented Sep 26, 2024

Uh oh!

Jack-Khuu commented Sep 27, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Gasoonjia commented Sep 26, 2024 •

edited

Loading

pytorch-bot bot commented Sep 26, 2024 •

edited

Loading