Two bugs in evaluation scripts #38

bxclib2 · 2019-08-01T12:33:42Z

Hi

In the evaluation.py

in line 495:

    scores[level]['exec'] = 0

should be:

    scores[level]['exec'] = 0.

and in line 549:

should add a line:

            scores["all"]['exec'] += 1

These bugs gives wrong and weird results when evaluate the exec accuracy. Could you fix them? Thanks a lot!

Also, I noticed that even I copy exactly the gold query, the exec accuracy is not 100. Could you check and find the errors??
easy medium hard extra all
count 250 440 174 170 1034
===================== EXECUTION ACCURACY =====================
execution 1.000 0.995 1.000 1.000 0.998

====================== EXACT MATCHING ACCURACY =====================
exact match 1.000 1.000 1.000 1.000 1.000

The text was updated successfully, but these errors were encountered:

taoyds · 2020-06-08T08:28:57Z

Thanks! already corrected in March.

taoyds closed this as completed Jun 8, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Two bugs in evaluation scripts #38

Two bugs in evaluation scripts #38

bxclib2 commented Aug 1, 2019 •

edited

Loading

taoyds commented Jun 8, 2020

Two bugs in evaluation scripts #38

Two bugs in evaluation scripts #38

Comments

bxclib2 commented Aug 1, 2019 • edited Loading

taoyds commented Jun 8, 2020

bxclib2 commented Aug 1, 2019 •

edited

Loading