Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

accelerate pytorch benchmark #946

Merged
merged 4 commits into from
Jan 15, 2024
Merged

Conversation

grimoire
Copy link
Collaborator

@grimoire grimoire commented Jan 14, 2024

--------------------------------------------------
concurrency: 256
elapsed_time: 295.300s

first token latency(s)(min, max, ave): 0.623, 12.899, 4.401
per-token latency(s) percentile(50, 75, 95, 99): [0, 0, 0.708, 0.979]

number of prompt tokens: 740656
number of completion tokens: 695134
token throughput (completion token): 2353.993 token/s
token throughput (prompt + completion token): 4862.142 token/s
RPS (request per second): 10.159 req/s
RPM (request per minute): 609.550 req/min
--------------------------------------------------

@AllentDan
Copy link
Collaborator

May update benchmark/profile_generation.py as well. I heard the two scripts will be merged into one.

@lvhan028 lvhan028 merged commit 40da677 into InternLM:main Jan 15, 2024
3 of 5 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants