Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add llm as judge mt-bench dataset and metrics #791
Add llm as judge mt-bench dataset and metrics #791
Changes from all commits
e035f36
aa884cc
7621fbb
9f642ec
e88c3af
ffd946b
bad7244
4633961
44f6de0
3d71c1e
e9a1376
4bb70b4
88508bf
747efb3
859144c
1488ec1
4ebdc40
0ff6f13
51fda0c
6b67b59
00ab418
d70db84
932f70a
2e6b91e
08cee34
cd04870
d4990b0
8233c04
5622625
3f214d8
6287678
5c6fee9
518d016
e4f92f8
c9948de
9a04e32
ea46120
3f12332
d4ef4f6
99773b8
9b868d3
1d4a937
cf815ee
049a322
7cebc9e
6ef225c
eea313a
b3bcf76
2e9c0d6
ecfeb1d
85c7c4b
0b8af6a
b6879c6
5507ea8
507c4ec
0475505
f210ba1
785e2d3
dc1d901
2785535
ef0beb2
cff58a5
a40b64a
3ffbfac
77be8a2
34612ad
b4e59d2
2e487a4
767575d
9aeaf20
62b09f7
f6078a1
40de15e
7a67fd3
83fd39e
b2485b7
e783614
a647b98
a4364d1
b00d153
ab72f42
f284a2f
0a7e182
c89de53
0e32579
95bbd9b
5a81128
9b7712a
58494fe
c4181f3
6279905
345cf7f
b68f3fa
ef0b44d
76fe80e
133f493
d2ff421
0902187
679a988
a16fbac
eaeb84f
23bf027
9def31e
83391ea
231dc9a
b4716be
f76dc3b
d1475e0
a9345b6
ea230cb
7c37b36
60dd6a8
483e0e2
3094377
f7a146c
8538e74
4eb8322
ec46bb8
704554a
31ac967
a177505
c40166d
fa6b440
948a343
1dd773b
f15ce3d
e74acd4
6f5396b
1dc2623
f664126
deb5642
73c010b
4e1095d
8ba40e9
9a0cc4d
273fe22
0475982
338c70d
69b3068
804d605
53b6316
be35899
2a56811
c155c17
4b01950
f38da96
3e1a9d2
3506d92
5b848f3
e7905fa
a0b1c75
0890aa1
7e271ce
f30ebf4
bae0934
635adbe
a04e319
23a6997
5a50544
4c6aa39
a14bf0a
eea4288
09360f4
109d501
File filter
Filter by extension
Conversations
Jump to
There are no files selected for viewing
Large diffs are not rendered by default.