Support llama3.1-8b generation #947

Gasoonjia · 2024-07-24T00:13:33Z

Llama3.1 8b now is supported in torchchat! 🎉

Local test:

pytorch-bot · 2024-07-24T00:13:35Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/947

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 1f309dd with merge base 7b4fa7c ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

malfet · 2024-07-24T00:54:59Z

build/model.py

+    high_freq_wavelen = old_context_len / high_freq_factor
+    new_freqs = []
+    for freq in freqs:
+        wavelen = 2 * math.pi / freq


Suggested change

wavelen = 2 * math.pi / freq

wavelen = 2 * torch.pi / freq

malfet · 2024-07-24T00:55:18Z

build/model.py

+import math
+


Use pi from torch rather than math

Suggested change

import math

kartikayk · 2024-07-24T01:02:59Z

config/data/models.json

@@ -40,6 +40,12 @@
        "distribution_path": "meta-llama/Meta-Llama-3-70B-Instruct",
        "transformer_params_key": "Meta-Llama-3-70B"
    },
+    "meta-llama/Meta-Llama-3.1-8B-Instruct": {


nit: technically you can also add support for the pre-trained model (meta-llama/Meta-Llama-3.1-8B) and the llama guard (meta-llama/Llama-Guard-3-8B). Not a requirement though

Probably want support for the pre-trained (non-Instruct) version to match with our support for Llama 3.0?

oh yeah of course. This PR is focused on enabling llama3.1 in torchchat, so didn't cover all possible models. Will have another PR to handle that.

malfet · 2024-07-24T01:12:03Z

build/model.py

+
+    low_freq_wavelen = old_context_len / low_freq_factor
+    high_freq_wavelen = old_context_len / high_freq_factor
+    new_freqs = []


Hmm, it feels like one can write this logic in a much more pytorchy-style:

new_freqs = 360 / freq.rad2deg() new_freqs[new_freqs > high_freq_wavelen ] /= scale_factor new_freqs[new_freqs < low_freq_wavelen * scale_factor] = apply_smooth_here

byjlw · 2024-07-24T03:21:18Z

Awesome! Please update the readme so people can discover it :)

Gasoonjia added 2 commits July 23, 2024 17:04

add llama 3.1 8b support

416c4f4

Merge branch 'main' of https://github.com/pytorch/torchchat

abb8e95

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Jul 24, 2024

Gasoonjia changed the title ~~Support llama3.1 generation~~ Support llama3.1-8b generation Jul 24, 2024

Gasoonjia assigned malfet, Jack-Khuu and kartikayk and unassigned malfet, Jack-Khuu and kartikayk Jul 24, 2024

Gasoonjia requested review from malfet, Jack-Khuu and kartikayk July 24, 2024 00:22

Jack-Khuu approved these changes Jul 24, 2024

View reviewed changes

malfet reviewed Jul 24, 2024

View reviewed changes

build/model.py Outdated

Comment on lines 8 to 9

import math

Copy link

Contributor

malfet Jul 24, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Use pi from torch rather than math

Suggested change

import math

kartikayk reviewed Jul 24, 2024

View reviewed changes

malfet approved these changes Jul 24, 2024

View reviewed changes

malfet reviewed Jul 24, 2024

View reviewed changes

Gasoonjia and others added 2 commits July 24, 2024 11:20

replace math.pi with torch.pi

070ea67

add 3.1 8b base and 70b

1f309dd

Gasoonjia merged commit 3e28e5d into main Jul 24, 2024
51 checks passed

Jack-Khuu mentioned this pull request Aug 19, 2024

How to deploy a new model by torchchat? #1038

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support llama3.1-8b generation #947

Support llama3.1-8b generation #947

Uh oh!

Gasoonjia commented Jul 24, 2024 •

edited

Loading

Uh oh!

pytorch-bot bot commented Jul 24, 2024 •

edited

Loading

Uh oh!

malfet Jul 24, 2024

Uh oh!

malfet Jul 24, 2024

Uh oh!

kartikayk Jul 24, 2024

Uh oh!

orionr Jul 24, 2024

Uh oh!

Gasoonjia Jul 24, 2024

Uh oh!

malfet Jul 24, 2024 •

edited

Loading

Uh oh!

byjlw commented Jul 24, 2024

Uh oh!

Uh oh!

Uh oh!

Support llama3.1-8b generation #947

Support llama3.1-8b generation #947

Uh oh!

Conversation

Gasoonjia commented Jul 24, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Jul 24, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/947

✅ No Failures

Uh oh!

malfet Jul 24, 2024

Choose a reason for hiding this comment

Uh oh!

malfet Jul 24, 2024

Choose a reason for hiding this comment

Uh oh!

kartikayk Jul 24, 2024

Choose a reason for hiding this comment

Uh oh!

orionr Jul 24, 2024

Choose a reason for hiding this comment

Uh oh!

Gasoonjia Jul 24, 2024

Choose a reason for hiding this comment

Uh oh!

malfet Jul 24, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

byjlw commented Jul 24, 2024

Uh oh!

Uh oh!

Uh oh!

Gasoonjia commented Jul 24, 2024 •

edited

Loading

pytorch-bot bot commented Jul 24, 2024 •

edited

Loading

malfet Jul 24, 2024 •

edited

Loading