accelerate pytorch benchmark #946

grimoire · 2024-01-14T09:10:12Z

--------------------------------------------------
concurrency: 256
elapsed_time: 295.300s

first token latency(s)(min, max, ave): 0.623, 12.899, 4.401
per-token latency(s) percentile(50, 75, 95, 99): [0, 0, 0.708, 0.979]

number of prompt tokens: 740656
number of completion tokens: 695134
token throughput (completion token): 2353.993 token/s
token throughput (prompt + completion token): 4862.142 token/s
RPS (request per second): 10.159 req/s
RPM (request per minute): 609.550 req/min
--------------------------------------------------

AllentDan · 2024-01-15T02:41:08Z

May update benchmark/profile_generation.py as well. I heard the two scripts will be merged into one.

grimoire added 3 commits January 14, 2024 17:08

accelerate benchmark

11eb7de

fix profile

8cc55b6

remove interval

a0ec878

lvhan028 requested a review from zhulinJulia24 January 14, 2024 10:47

lvhan028 added the improvement label Jan 14, 2024

lvhan028 requested a review from AllentDan January 14, 2024 12:29

merge main

7e00d35

AllentDan approved these changes Jan 15, 2024

View reviewed changes

lvhan028 approved these changes Jan 15, 2024

View reviewed changes

lvhan028 merged commit 40da677 into InternLM:main Jan 15, 2024
3 of 5 checks passed

grimoire had a problem deploying to prod February 14, 2024 02:17 — with GitHub Actions Failure

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

accelerate pytorch benchmark #946

accelerate pytorch benchmark #946

grimoire commented Jan 14, 2024 •

edited

AllentDan commented Jan 15, 2024

accelerate pytorch benchmark #946

accelerate pytorch benchmark #946

Conversation

grimoire commented Jan 14, 2024 • edited

AllentDan commented Jan 15, 2024

grimoire commented Jan 14, 2024 •

edited