De-duplicate APIBench eval data (?) #87

ShishirPatil · 2023-08-07T09:59:51Z

The evaluation data for APIBench is duplicated between data/apibench/*_eval.json and eval/eval-data/questions/. I think the only difference is formatting. Maybe we should just keep the eval/eval-data/responses and have data/apibench for only data used to train the model.

Initially we made two copies with the following rationale:
apibench should have all the data self-contained, which the community is using to train/benchmark their LLMs.
eval/ would have the eval data in a format that would be easy to eyeball and understand what is going on.

Maybe this is one of those few cases where it might be ok to have the same data twice in the repository in different formats?

Starting this issue in case anyone has comments on this.

The text was updated successfully, but these errors were encountered:

ShishirPatil added help wanted Extra attention is needed good first issue Good for newcomers question Further information is requested labels Aug 7, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

De-duplicate APIBench eval data (?) #87

De-duplicate APIBench eval data (?) #87

ShishirPatil commented Aug 7, 2023

De-duplicate APIBench eval data (?) #87

De-duplicate APIBench eval data (?) #87

Comments

ShishirPatil commented Aug 7, 2023