We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent c2bce76 commit 2ba0dd8Copy full SHA for 2ba0dd8
benchmarks/summary.md
@@ -7,6 +7,9 @@ Date | Device | dtype | batch size | cache length |max input length |max output
7
----| ------- | ------ |---------- | -------------|-----------------|------------------|----------------------
8
2024-04-24 | TPU v5e-8 | bfloat16 | 128 | 2048 | 1024 | 1024 | 8249
9
2024-04-24 | TPU v5e-8 | int8 | 256 | 2048 | 1024 | 1024 | 10873
10
+2024-07-29 | TPU v5e-8 | int8 | 256 | 2048 | 1024 | 1024 | 8471.54
11
+
12
+**NOTE:(2024-07-29)** Looks like we have a regression in the past 3 month. We are working in fixing it.
13
14
15
## Gemma - 7B
0 commit comments