Ground truth predictions don't yield 100% success rate #7

shuyanzhou · 2020-02-18T23:21:25Z

Hi,

I recently find that feeding ground truth actions and masks does not yield a 100% success rate on valid_seen.

Over 817 valid_seen samples (first 3 removed for personal reason), the result is:
SR: 674/817 = 0.825
PC: 1946/2097 = 0.928

Any thoughts about the possible cause?

MohitShridhar · 2020-02-18T23:42:41Z

Thanks for trying this out @shuyanzhou. Can you email me the results JSON files?

During the data generation, we replayed every trajectory with the masks and GT actions, and evaluated the success rate (here). We also did multiple replay checks per trajectory, so technically this shouldn't happen.

shuyanzhou · 2020-02-18T23:49:28Z

@MohitShridhar Thanks for your prompt reply! If there is no problem on your side, the difference might come from my own revisions to the code/data. I will check. If I could not solve the problem, I will reopen the issue. Thanks!

MohitShridhar · 2020-02-18T23:53:16Z

Yeah let me how it goes. We put in extra effort during the generation phase to make sure this doesn't happen. We discarded any trajectory that failed the evaluation. But something could have changed since then.

MohitShridhar · 2020-02-21T07:28:05Z

@shuyanzhou looks like there was something faulty with the evaluation metrics. Can you please pull from master and re-run your GT actions+masks test? Let me know how it goes.

shuyanzhou · 2020-02-24T19:32:40Z

@MohitShridhar thanks for pointing this. I did another evaluation pass, now the results are correct -- GTs yield 100% SR.

shuyanzhou closed this as completed Feb 18, 2020

Tushar-N mentioned this issue Feb 20, 2020

postconditions_met for sliced objects #11

Closed

MohitShridhar reopened this Feb 21, 2020

MohitShridhar closed this as completed Feb 24, 2020

MohitShridhar mentioned this issue Mar 28, 2020

Can't get 100% accuracy in Sub-Goal evaluation with ground-truth actions and masks. #19

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ground truth predictions don't yield 100% success rate #7

Ground truth predictions don't yield 100% success rate #7

shuyanzhou commented Feb 18, 2020

MohitShridhar commented Feb 18, 2020 •

edited

Loading

shuyanzhou commented Feb 18, 2020

MohitShridhar commented Feb 18, 2020

MohitShridhar commented Feb 21, 2020 •

edited

Loading

shuyanzhou commented Feb 24, 2020

Ground truth predictions don't yield 100% success rate #7

Ground truth predictions don't yield 100% success rate #7

Comments

shuyanzhou commented Feb 18, 2020

MohitShridhar commented Feb 18, 2020 • edited Loading

shuyanzhou commented Feb 18, 2020

MohitShridhar commented Feb 18, 2020

MohitShridhar commented Feb 21, 2020 • edited Loading

shuyanzhou commented Feb 24, 2020

MohitShridhar commented Feb 18, 2020 •

edited

Loading

MohitShridhar commented Feb 21, 2020 •

edited

Loading