De-duplicate APIBench eval data (?) #87
Labels
good first issue
Good for newcomers
help wanted
Extra attention is needed
question
Further information is requested
The evaluation data for APIBench is duplicated between
data/apibench/*_eval.json
andeval/eval-data/questions/
. I think the only difference is formatting. Maybe we should just keep theeval/eval-data/responses
and havedata/apibench
for only data used to train the model.Initially we made two copies with the following rationale:
apibench
should have all the data self-contained, which the community is using to train/benchmark their LLMs.eval/
would have the eval data in a format that would be easy to eyeball and understand what is going on.Maybe this is one of those few cases where it might be ok to have the same data twice in the repository in different formats?
Starting this issue in case anyone has comments on this.
The text was updated successfully, but these errors were encountered: