Use version of NDArray split that always returns a list. #454

mjdenkowski · 2018-06-22T19:04:25Z

This fixes an issue where inference was silently breaking when using vocabulary restriction, batch decoding, and a single source factor. The source was the behavior of mxnet.ndarray.split that returns a list when num_outputs is greater than 1, but the individual NDArray that would be element 0 when num_outputs is 1. This was leading the code to pull element 0 of the NDarray instead of the NDArray itself. This commit adds a wrapper for split that always returns a list for consistent behavior.

Pull Request Checklist

Changes are complete (if posting work-in-progress code, prefix your pull request title with '[WIP]'
until you can check this box.
Unit tests pass (pytest)
System tests pass (pytest test/system)
Passed code style checking (./style-check.sh)
You have considered writing a test
Updated major/minor version in sockeye/__init__.py. Major version bump if this is a backwards incompatible change.
Updated CHANGELOG.md

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

- Use version of ndarray split that always returns a list for uniform handling.

fhieber

Great catch, thats a very subtle bug!

fhieber · 2018-06-23T07:36:35Z

sockeye/inference.py

@@ -158,7 +158,7 @@ def _get_encoder_module(self) -> Tuple[mx.mod.BucketingModule, int]:

        def sym_gen(source_seq_len: int):
            source = mx.sym.Variable(C.SOURCE_NAME)
-            source_words = source.split(num_outputs=self.num_source_factors, axis=2, squeeze_axis=True)[0]
+            source_words = utils.split(source, num_outputs=self.num_source_factors, axis=2, squeeze_axis=True)[0]


I think we don't need this change here, as in the symbolic API split seems to return always an 'indexable' Symbol/SliceChannel. We do these source-factor related splits also in other places of the code and it works there just fine. Also, the util function isn't typed for symbols and I am surprised data.split (aka using the fluent method) works for symbols).

mx.sym.split(mx.sym.Variable('x'), num_outputs=1, axis=2)[0].eval(x=mx.nd.ones((2,2,2)))

This throws an error:

mx.sym.split(mx.sym.Variable('x'), num_outputs=1, axis=2)[1].eval(x=mx.nd.ones((2,2,2)))

Good catch!

fhieber · 2018-06-23T07:41:39Z

sockeye/utils.py

+    :return: List of NDArrays resulting from the split.
+    """
+    ndarray_or_list = data.split(num_outputs=num_outputs, axis=axis, squeeze_axis=squeeze_axis)
+    if num_outputs == 1:


Would it make sense to avoid the split altogether when num_outputs is 1? If squeeze_axis==True, one only would need a reshape/squeeze which is essentially a no-op.

Another good point. I think it's a toss-up between staying as close to the original as possible versus micro-optimizing the call. Since we're now using this in just one place, called once per batch, I would lean toward keeping it this way for clarity.

yes, probably not worth the additional complexity.

fhieber

Again, great fix!

tdomhan

indeed a great catch :)

mjdenkowski added 3 commits June 22, 2018 14:49

Fix source factor splitting for single factor.

2460464

- Use version of ndarray split that always returns a list for uniform handling.

Unit test for factor splitting.

87c333b

Update version, changelog.

03babf5

mjdenkowski requested review from davvil, fhieber and tdomhan as code owners June 22, 2018 19:04

fhieber requested changes Jun 23, 2018

View reviewed changes

fhieber added the bug label Jun 23, 2018

Keep original split call for sym_gen.

5ea4812

fhieber approved these changes Jun 25, 2018

View reviewed changes

tdomhan approved these changes Jun 25, 2018

View reviewed changes

tdomhan merged commit c59361d into master Jun 25, 2018

tdomhan deleted the split-fix branch June 25, 2018 09:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use version of NDArray split that always returns a list. #454

Use version of NDArray split that always returns a list. #454

mjdenkowski commented Jun 22, 2018 •

edited

fhieber left a comment

fhieber Jun 23, 2018

mjdenkowski Jun 23, 2018

fhieber Jun 23, 2018 •

edited

mjdenkowski Jun 23, 2018

fhieber Jun 25, 2018

fhieber left a comment

tdomhan left a comment

Use version of NDArray split that always returns a list. #454

Use version of NDArray split that always returns a list. #454

Conversation

mjdenkowski commented Jun 22, 2018 • edited

Pull Request Checklist

fhieber left a comment

Choose a reason for hiding this comment

fhieber Jun 23, 2018

Choose a reason for hiding this comment

mjdenkowski Jun 23, 2018

Choose a reason for hiding this comment

fhieber Jun 23, 2018 • edited

Choose a reason for hiding this comment

mjdenkowski Jun 23, 2018

Choose a reason for hiding this comment

fhieber Jun 25, 2018

Choose a reason for hiding this comment

fhieber left a comment

Choose a reason for hiding this comment

tdomhan left a comment

Choose a reason for hiding this comment

mjdenkowski commented Jun 22, 2018 •

edited

fhieber Jun 23, 2018 •

edited