chateval
Popular repositories Loading
-
application
application PublicA platform for the warehousing and evaluation of neural open domain chatbot models.
-
archive
archive PublicForked from jsedoc/ChatEval
Public evaluation tool for non task driven neural open domain chatbots
Python 5
-
evaluation
evaluation PublicMicroservice to handle automatic evaluation of neural chatbot models. Multiple automated evaluation methods (including embedding-based metrics).
-
baseline-collection
baseline-collection PublicCode to publish HITs on Mechanical Turk to collect human baselines
Python 2
-
Repositories
- application Public
A platform for the warehousing and evaluation of neural open domain chatbot models.
chateval/application’s past year of commit activity - kani Public Forked from zhudotexe/kani
kani (カニ) is a highly hackable microframework for chat-based language models with tool usage/function calling.
chateval/kani’s past year of commit activity - GPTScore Public Forked from jinlanfu/GPTScore
Source Code of Paper "GPTScore: Evaluate as You Desire"
chateval/GPTScore’s past year of commit activity - botsim Public Forked from salesforce/botsim
BotSIM - a data-efficient end-to-end Bot SIMulation toolkit for evaluation, diagnosis, and improvement of commercial chatbots
chateval/botsim’s past year of commit activity - BEGIN-dataset Public Forked from google/BEGIN-dataset
A benchmark dataset for evaluating dialog system and natural language generation metrics.
chateval/BEGIN-dataset’s past year of commit activity - conture Public Forked from alexa/conture
ConTurE is a human-chatbot dataset that contains turn level annotations to assess the quality of chatbot responses.
chateval/conture’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…