Fix nn_test.py on AVX512 builds #21541

markdryan · 2018-08-10T08:27:17Z

This patch modifies the nn_test test case L2LossTest.testGradient
so that it passes on AVX512 builds. The test case is failing
as the error tolerance used in the test case is too strict.
The test case compares the difference of pairs of tensor reductions
to an expected result. If the comparison is out by more than 1e-11
the test case fails. The problem here is that the results of a
summation reduction of doubles of the same tensor can differ slightly
on different builds. AVX2, AVX512 and non vectorized versions of the
tensor contraction algorithm add the tensor's contents together in
different orders and this different ordering can produce slightly
different results due to rounding errors.

The accuracy of AVX512 tensor reduction is no worse than the AVX2
implementation. In fact, it's only luck that this test case passes
on AVX2 builds and fails on AVX512 builds. If the seed at the start of
the test is changed from 1 to 3, the test passes on AVX512 builds and
fails on AVX2 builds. Rather than trying to find a seed that allows
the test case to pass on all CPU architectures, it is better to relax
the test criteria a little bit.

Signed-off-by: Mark Ryan mark.d.ryan@intel.com

This patch modifies the nn_test test case L2LossTest.testGradient so that it passes on AVX512 builds. The test case is failing as the error tolerance used in the test case is too strict. The test case compares the difference of pairs of tensor reductions to an expected result. If the comparison is out by more than 1e-11 the test case fails. The problem here is that the results of a summation reduction of doubles of the same tensor can differ slightly on different builds. AVX2, AVX512 and non vectorized versions of the tensor contraction algorithm add the tensor's contents together in different orders and this different ordering can produce slightly different results due to rounding errors. The accuracy of AVX512 tensor reduction is no worse than the AVX2 implementation. In fact, it's only luck that this test case passes on AVX2 builds and fails on AVX512 builds. If the seed at the start of the test is changed from 1 to 3, the test passes on AVX512 builds and fails on AVX2 builds. Rather than trying to find a seed that allows the test case to pass on all CPU architectures, it is better to relax the test criteria a little bit. Signed-off-by: Mark Ryan <mark.d.ryan@intel.com>

PiperOrigin-RevId: 208345397

googlebot added the cla: yes label Aug 10, 2018

markdryan mentioned this pull request Aug 10, 2018

The nn_test unit test testGradient.L2LossTest fails on AVX512 builds #21540

Closed

drpngx approved these changes Aug 10, 2018

View reviewed changes

drpngx added the ready to pull PR ready for merge process label Aug 10, 2018

drpngx self-assigned this Aug 10, 2018

tensorflow-copybara merged commit f135cec into tensorflow:master Aug 11, 2018

tensorflow-copybara pushed a commit that referenced this pull request Aug 11, 2018

Merge pull request #21541 from markdryan:avx512_fix_nn_test

926d063

PiperOrigin-RevId: 208345397

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix nn_test.py on AVX512 builds #21541

Fix nn_test.py on AVX512 builds #21541

markdryan commented Aug 10, 2018

Fix nn_test.py on AVX512 builds #21541

Fix nn_test.py on AVX512 builds #21541

Conversation

markdryan commented Aug 10, 2018