Fix eval #5955

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Closed

helunwencser wants to merge 1 commit into pytorch:main from helunwencser:export-D62198560

Contributor

helunwencser commented Oct 7, 2024 •

edited

Loading

Summary:
This PR fixes a bunch of issues in the eval pipeline:

Use the right token for eot_token_id
Do not add bos and eos during tok_encode based on this discussion.
Update executorch/examples/models/llama2/tokenizer/tiktoken.py to be synced with llama 3.1's official code. Majorly updated set of special tokens.

Differential Revision: D62198560

pytorch-bot bot commented Oct 7, 2024 •

edited

Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/5955

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 4158d4c with merge base 36a5bc6 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot added the CLA Signed label

Contributor

facebook-github-bot commented Oct 7, 2024

This pull request was exported from Phabricator. Differential Revision: D62198560

facebook-github-bot added the fb-exported label

helunwencser pushed a commit to helunwencser/executorch that referenced this pull request


          Fix eval (pytorch#5955)

bf05031

Summary:

This PR fixes a bunch of issues in the eval pipeline:
- Use the right token for `eot_token_id`
- Do not add `bos` and `eos` during `tok_encode` based on this [discussion](https://fburl.com/code/uifmt746).
- Update `executorch/examples/models/llama2/tokenizer/tiktoken.py` to be synced with llama 3.1's official [code](https://github.com/meta-llama/llama-models/blob/main/models/llama3/api/tokenizer.py). Majorly updated set of special tokens.
- Update `--limit`'s default value to none based on `lm_eval`'s [doc](https://github.com/EleutherAI/lm-evaluation-harness/blob/main/docs/interface.md). It should only be used during test.
- Update `--dtype-override`'s default value to none such that we don't override mode's dtype by default.

For the context, we observed a gap between ExecuTorch's eval result and SpinQuant's eval result: 19 vs 14.

After these fixes, ExecuTorch's eval result is closer to SpinQuant's eval.

Differential Revision: D62198560

helunwencser force-pushed the export-D62198560 branch from f105c39 to bf05031 Compare

October 7, 2024 22:25

Contributor

facebook-github-bot commented Oct 7, 2024

This pull request was exported from Phabricator. Differential Revision: D62198560

helunwencser force-pushed the export-D62198560 branch from bf05031 to 004045b Compare

October 8, 2024 20:04

helunwencser pushed a commit to helunwencser/executorch that referenced this pull request


          Fix eval (pytorch#5955)

004045b

Summary:

This PR fixes a bunch of issues in the eval pipeline:
- Use the right token for `eot_token_id`
- Do not add `bos` and `eos` during `tok_encode` based on this [discussion](https://fburl.com/code/uifmt746).
- Update `executorch/examples/models/llama2/tokenizer/tiktoken.py` to be synced with llama 3.1's official [code](https://github.com/meta-llama/llama-models/blob/main/models/llama3/api/tokenizer.py). Majorly updated set of special tokens.
- Update `--limit`'s default value to none based on `lm_eval`'s [doc](https://github.com/EleutherAI/lm-evaluation-harness/blob/main/docs/interface.md). It should only be used during test.
- Update `--dtype-override`'s default value to none such that we don't override mode's dtype by default.

For the context, we observed a gap between ExecuTorch's eval result and SpinQuant's eval result: 19 vs 14.

After these fixes, ExecuTorch's eval result is closer to SpinQuant's eval.

Differential Revision: D62198560

Contributor

facebook-github-bot commented Oct 8, 2024

This pull request was exported from Phabricator. Differential Revision: D62198560

helunwencser pushed a commit to helunwencser/executorch that referenced this pull request


          Fix eval (pytorch#5955)

1ecad67

Summary:

This PR fixes a bunch of issues in the eval pipeline:
- Use the right token for `eot_token_id`
- Do not add `bos` and `eos` during `tok_encode` based on this [discussion](https://fburl.com/code/uifmt746).
- Update `executorch/examples/models/llama2/tokenizer/tiktoken.py` to be synced with llama 3.1's official [code](https://github.com/meta-llama/llama-models/blob/main/models/llama3/api/tokenizer.py). Majorly updated set of special tokens.

Differential Revision: D62198560

helunwencser force-pushed the export-D62198560 branch from 004045b to 1ecad67 Compare

October 8, 2024 20:30

Contributor

facebook-github-bot commented Oct 8, 2024

This pull request was exported from Phabricator. Differential Revision: D62198560

mergennachin self-requested a review

October 8, 2024 23:46

mergennachin approved these changes

View reviewed changes


          Fix eval (pytorch#5955)

4158d4c

Summary:

This PR fixes a bunch of issues in the eval pipeline:
- Use the right token for `eot_token_id`
- Do not add `bos` and `eos` during `tok_encode` based on this [discussion](https://fburl.com/code/uifmt746).
- Update `executorch/examples/models/llama2/tokenizer/tiktoken.py` to be synced with llama 3.1's official [code](https://github.com/meta-llama/llama-models/blob/main/models/llama3/api/tokenizer.py). Majorly updated set of special tokens.

Reviewed By: mergennachin

Differential Revision: D62198560

helunwencser force-pushed the export-D62198560 branch from 1ecad67 to 4158d4c Compare

October 9, 2024 17:12

Contributor

facebook-github-bot commented Oct 9, 2024

This pull request was exported from Phabricator. Differential Revision: D62198560

facebook-github-bot closed this in

2027a14

Contributor

facebook-github-bot commented Oct 9, 2024

This pull request has been merged in 2027a14.

facebook-github-bot added the Merged label

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed fb-exported Merged