Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Serving][Benchmark] Add benchmark code for serving #69

Closed
wants to merge 43 commits into from
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
43 commits
Select commit Hold shift + click to select a range
2f4dc20
add benchmarks
xwu99 Jan 19, 2024
d1a048d
update
xwu99 Jan 19, 2024
da03432
update
xwu99 Jan 19, 2024
5a4a28c
update
xwu99 Jan 22, 2024
06b5ea0
update
xwu99 Jan 22, 2024
765da83
update
xwu99 Jan 22, 2024
d534f76
update
xwu99 Jan 22, 2024
eab0425
move README to doc/benchmark.md
xwu99 Jan 23, 2024
cf4f071
fix error & add progress bar
KepingYan Feb 1, 2024
6285ae9
update
xwu99 Feb 18, 2024
ff6c4cb
update doc
xwu99 Feb 18, 2024
12c3bd3
update
xwu99 Feb 18, 2024
64e16cd
add support for ipex dataset
xwu99 Feb 22, 2024
348394a
update
xwu99 Feb 22, 2024
c73af8f
update
xwu99 Feb 22, 2024
f3a64a9
update
xwu99 Feb 22, 2024
06e4142
add doc
xwu99 Feb 22, 2024
d87854f
TODO: fix stats
xwu99 Feb 26, 2024
02a2a48
TODO: fix stats
xwu99 Feb 26, 2024
1c32952
Add track-token-latency, track-input-output, results-dir
xwu99 Feb 27, 2024
fa328aa
update doc
xwu99 Feb 27, 2024
82d646a
update doc
xwu99 Feb 27, 2024
79c45d5
update
xwu99 Feb 27, 2024
05d47ca
update code
xwu99 Feb 27, 2024
4197444
update
xwu99 Feb 27, 2024
bbdb393
add sample_requests_from_random_generation
xwu99 Feb 27, 2024
d0b5659
update
xwu99 Feb 27, 2024
621b1f1
update
xwu99 Feb 27, 2024
35a66ad
update
xwu99 Feb 28, 2024
0a30e61
update
xwu99 Feb 28, 2024
d2a7d59
update
xwu99 Feb 28, 2024
f2f5fb7
fix max_new_tokens bug, fix output_len, add param model_type for inpu…
KepingYan Mar 5, 2024
1211d84
revert
KepingYan Mar 5, 2024
c76d6da
resubmit
KepingYan Mar 5, 2024
1b462cb
fix test
KepingYan Mar 5, 2024
5d1a329
add length limit and fix index out of range
KepingYan Mar 11, 2024
4c49f57
merge main branch
KepingYan Mar 11, 2024
f704f92
fix conflict
KepingYan Mar 13, 2024
c29d982
merge upstream
xwu99 Mar 25, 2024
71586f8
Merge branch 'intel:main' into add-benchmark-serving
xwu99 Mar 26, 2024
5f0f1f6
add openai support
KepingYan Mar 26, 2024
71d8a3e
Refactor get_request to support batching
xwu99 Mar 26, 2024
97bc73f
Refactor get_request to support batching
xwu99 Mar 26, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
8 changes: 5 additions & 3 deletions .gitignore
Original file line number Diff line number Diff line change
@@ -1,5 +1,7 @@
__pycache__
**.ipynb
debug/
build/lib/
llm_on_ray.egg-info/
llm_on_ray.egg-info/
__pycache__
**.ipynb
*.json
*.sh
Loading
Loading