Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

improve the benchmark test of vineyard llm kv cache #1816

Merged
merged 1 commit into from
May 21, 2024

Conversation

dashanji
Copy link
Member

@dashanji dashanji commented Mar 12, 2024

What do these changes do?

After the benchmark test, we can get the following result.

Token list size is 17792Total Update time is 2.22029s Total Query time is 0.646123s Average update time is 8013.38token/s Average query time is 27536.5token/s

The query time including (query kv tensor ptr from vineyard) + (memcpy from the kv tensor ptr to users' buffer)

@dashanji dashanji changed the title improve the benchmark test improve the benchmark test of vineyard llm kv cache Mar 12, 2024
@dashanji dashanji changed the title improve the benchmark test of vineyard llm kv cache [wip] improve the benchmark test of vineyard llm kv cache Mar 12, 2024
@dashanji dashanji force-pushed the fix-kv-cache-benchmark branch 3 times, most recently from d6b0bc8 to b935def Compare March 20, 2024 03:30
@dashanji dashanji changed the title [wip] improve the benchmark test of vineyard llm kv cache improve the benchmark test of vineyard llm kv cache Mar 20, 2024
@github-actions github-actions bot added the stale label Apr 12, 2024
Copy link
Contributor

/cc @sighingnow, this issus/pr has had no activity for a long time, please help to review the status and assign people to work on it.

Signed-off-by: Ye Cao <caoye.cao@alibaba-inc.com>
@sighingnow sighingnow merged commit c8de264 into v6d-io:main May 21, 2024
5 of 6 checks passed
@sighingnow sighingnow deleted the fix-kv-cache-benchmark branch May 21, 2024 02:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants