Loosen separation test tolerance #302

craffel · 2019-01-09T18:07:14Z

For some tests, the difference between the scores produced by Travis' build and my own build is as large as 3.89588121e-04. This was causing Travis to fail. In practice, we shouldn't care much about differences larger than 10e-3 for separation tasks since it would be very unusual for anyone to pay attention to differences this small when comparing source separation algorithms.

For some tests, the difference between the scores produced by Travis' build and my own build is as large as 3.89588121e-04. In practice, we shouldn't care much about differences larger than 10e-3 for separation tasks since it would be very unusual for anyone to pay attention to differences this small when comparing source separation algorithms.

bmcfee · 2019-01-09T18:49:27Z

LGTM; I think this is related to our previous headaches deriving from BLAS discrepancies across platforms, but I agree that anything past the 3rd (2nd?) decimal place is unreliable in source sep anyway.

craffel · 2019-01-09T20:02:51Z

Amazingly, this is still not loose enough for the Python 3.4 build:
https://travis-ci.org/craffel/mir_eval/jobs/477467832
The max absolute deviation between my system and Travis' system is 4.87191262e-03. What do we think about this? @faroit

faroit · 2019-01-09T23:28:27Z

Do we still need to support 3.4?
I totally agree to loosen it further. Bsseval already caused way too much trouble over here ;-)

craffel · 2019-01-10T16:27:55Z

Do we still need to support 3.4?

Not necessarily, but we have just not updated Travis. That this affects 3.4 and not, say, 3.5 or 3.6 is mostly a coincidence I think.

I totally agree to loosen it further. Bsseval already caused way too much trouble over here ;-)

Ok, will do.

bmcfee · 2019-01-10T16:31:07Z

Not necessarily, but we have just not updated Travis. That this affects 3.4 and not, say, 3.5 or 3.6 is mostly a coincidence I think.

I dropped 3.4 builds on travis (in librosa) for exactly this reason. Since mir_eval doesn't use any fancy features of >=3.5 (except, maybe, implicitly ordered dictionaries?), I don't see much point in keeping the 3.4 test around.

Loosen tolerance futher

89241ea

craffel merged commit 6e9f59b into master Jan 11, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Loosen separation test tolerance #302

Loosen separation test tolerance #302

craffel commented Jan 9, 2019

bmcfee commented Jan 9, 2019

craffel commented Jan 9, 2019

faroit commented Jan 9, 2019

craffel commented Jan 10, 2019

bmcfee commented Jan 10, 2019 •

edited

Loosen separation test tolerance #302

Loosen separation test tolerance #302

Conversation

craffel commented Jan 9, 2019

bmcfee commented Jan 9, 2019

craffel commented Jan 9, 2019

faroit commented Jan 9, 2019

craffel commented Jan 10, 2019

bmcfee commented Jan 10, 2019 • edited

bmcfee commented Jan 10, 2019 •

edited