Improve benchmark tput by moving prompt preparation outside of loop #54

gracehonv · 2024-06-12T22:59:31Z

Moved the prompt making call to randomly_sample_sonnet_lines_prompt outside of load request send loop so that the send loop can generate load to the server faster. Otherwise there's an artificial delay due to making the next prompt which slows down the benchmark throughput/sec.
Also changed tokenizer instantiation to just once outside the prompt generation loop to speed up the overall test.
After this change I've seen up to 2x improvement in server achieved throughput in some small workloads. This change will allow better measurement of true server throughput.

gracehonv · 2024-06-18T01:12:57Z

@avnishn or @rickyyx would it be possible to get this PR reviewed? Thank you!

rickyyx

Thank you!!

improve benchmark tput by moving prompt preparation outside of loop

0f25505

rickyyx approved these changes Jun 19, 2024

View reviewed changes

rickyyx merged commit 03872a4 into ray-project:main Jun 19, 2024

gracehonv deleted the grace_nv/loadgen branch June 19, 2024 20:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve benchmark tput by moving prompt preparation outside of loop #54

Improve benchmark tput by moving prompt preparation outside of loop #54

Uh oh!

gracehonv commented Jun 12, 2024

Uh oh!

gracehonv commented Jun 18, 2024

Uh oh!

rickyyx left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Improve benchmark tput by moving prompt preparation outside of loop #54

Improve benchmark tput by moving prompt preparation outside of loop #54

Uh oh!

Conversation

gracehonv commented Jun 12, 2024

Uh oh!

gracehonv commented Jun 18, 2024

Uh oh!

rickyyx left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants