update doc for llama3 #1462

zhyncs · 2024-04-19T04:04:00Z

Motivation

as titled

ref #1459

Modification

as titled

Checklist

Pre-commit or other linting tools are used to fix the potential lint issues.
The modification is covered by complete unit tests. If not, please add more unit tests to ensure the correctness.
If the modification has a dependency on downstream projects of a newer version, this PR should be tested with all supported versions of downstream projects.
The documentation has been modified accordingly, like docstring or example tutorials.

AllentDan

LGTM

zhyncs · 2024-04-19T05:53:04Z

The compatibility verification is all OK with main branch.

# TurboMind
python3 -m lmdeploy serve api_server /workdir/Meta-Llama-3-8B

# PyTorch
python3 -m lmdeploy serve api_server /workdir/Meta-Llama-3-8B --backend pytorch

# KV Cache Int8
python3 -m lmdeploy serve api_server /workdir/Meta-Llama-3-8B --quant-policy 8

# KV Cache Int4
python3 -m lmdeploy serve api_server /workdir/Meta-Llama-3-8B --quant-policy 4

# AWQ
python3 -m lmdeploy lite auto_awq /workdir/Meta-Llama-3-8B --calib-dataset 'ptb' --calib-samples 128 --calib-seqlen 2048 --w-bits 4 --w-group-size 128 --work-dir /workdir/Meta-Llama-3-8B-AWQ
python3 -m lmdeploy serve api_server /workdir/Meta-Llama-3-8B-AWQ

# press
python3 benchmark/profile_restful_api.py --server_addr 127.0.0.1:23333 --tokenizer_path /workdir/Meta-Llama-3-8B --dataset /workdir/ShareGPT_V3_unfiltered_cleaned_split.json --concurrency 128 --num_prompts 1000

zhyncs added 2 commits April 19, 2024 12:02

update doc for llama3

b8fe3f3

update README

bd44fdf

lvhan028 requested a review from AllentDan April 19, 2024 05:25

AllentDan approved these changes Apr 19, 2024

View reviewed changes

lvhan028 approved these changes Apr 19, 2024

View reviewed changes

lvhan028 added the documentation Improvements or additions to documentation label Apr 19, 2024

lvhan028 merged commit a02ed41 into InternLM:main Apr 19, 2024
4 checks passed

zhyncs deleted the patch-4 branch April 19, 2024 05:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update doc for llama3 #1462

update doc for llama3 #1462

zhyncs commented Apr 19, 2024 •

edited

AllentDan left a comment

zhyncs commented Apr 19, 2024

update doc for llama3 #1462

update doc for llama3 #1462

Conversation

zhyncs commented Apr 19, 2024 • edited

Motivation

Modification

Checklist

AllentDan left a comment

Choose a reason for hiding this comment

zhyncs commented Apr 19, 2024

zhyncs commented Apr 19, 2024 •

edited