convert forward return to tensor in FeatureAblation #1049

aobo-y · 2022-10-21T00:44:31Z

FeatureAblation(Permutation) never clearly define the type of forward's return. According to the documentation, tensor is the only acceptable type

The forward function can either return a scalar per example or a tensor of a fixed sized tensor (or scalar value) for the full batch

However, returning a single int or float is a common use case we have already supported (ref #1047 (comment)). But our code did not explicitly raise error for unexpected types. Other types like list, tuple, or numpy.array may pass unexpectedly or fail in unexpected places with confusing error messages, such as we may use list as torch.dtype

captum/captum/attr/_core/feature_ablation.py

Line 320 in 5f878af

dtype=attrib_type,

The PR explicitly assert the return type and convert everything into tensor. The assertion & conversion is done in a new private _run_forward wrapper instead of _run_forward from global utils, which is shared by many other classes. I will update others progressively and eventually push the logic to the shared _run_forward.

aobo-y · 2022-10-21T00:57:02Z

captum/attr/_core/feature_ablation.py

+        # our tests expect int -> torch.int64, float -> torch.float64
+        # but this may actually depend on the machine
+        # ref: https://docs.python.org/3.10/library/stdtypes.html#typesnumeric
+        return torch.tensor(forward_output, dtype=output_type)


I inherit our original logic that passing python types to torch dtype, like dtype=float. But this is not an officially documented operation. Existing tests assume it must equal to dtype=torch.float64 https://github.com/pytorch/captum/blob/5f878af6a7/tests/attr/test_feature_ablation.py#L429

But this may be machine dependent https://docs.python.org/3.10/library/stdtypes.html#typesnumeric .

Floating point numbers are usually implemented using double in C; information about the precision and internal representation of floating point numbers for the machine on which your program is running is available in sys.float_info

Two other alternatives are:

explicitly map python type to torch dtype: float -> torch.float64

do not set dtype, rely on torch's default dtype (float32)

This is an interesting point! I looked into it a bit, this functionality seems to be added in this PR: pytorch/pytorch#21215
It looks like the type mapping is done explicitly on the C++ side using the PyObject type, so this shouldn't be affected by the internal representation. This is the logic for mapping:

PyObject *obj = args[i]; if (obj == (PyObject*)&PyFloat_Type) { return at::ScalarType::Double; } if (obj == (PyObject*)&PyBool_Type) { return at::ScalarType::Bool; } if (obj == (PyObject*)&PyLong_Type #if PY_MAJOR_VERSION == 2 || obj == (PyObject*)&PyInt_Type #endif ) { return at::ScalarType::Long; }

So if float is set as dtype, this would be passed through the Python / C++ bindings as PyFloat_Type, which should always correspond to ScalarType::Double / torch.float64. The tests in the original PR also verify this mapping.

Thx for the deep dive @vivekmig !
Then I will just add the comment to refer the mapping, also as a caveat.
After all, it is not a documented torch usage. May have breaking changes someday.

Makes sense, sounds good!

facebook-github-bot · 2022-10-21T00:58:35Z

@aobo-y has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

vivekmig

Looks great, thanks! Just one note on the dtype comments.

vivekmig · 2022-11-10T22:53:41Z

captum/attr/_core/feature_ablation.py

+        # our tests expect int -> torch.int64, float -> torch.float64
+        # but this may actually depend on the machine
+        # ref: https://docs.python.org/3.10/library/stdtypes.html#typesnumeric
+        return torch.tensor(forward_output, dtype=output_type)


This is an interesting point! I looked into it a bit, this functionality seems to be added in this PR: pytorch/pytorch#21215
It looks like the type mapping is done explicitly on the C++ side using the PyObject type, so this shouldn't be affected by the internal representation. This is the logic for mapping:

PyObject *obj = args[i]; if (obj == (PyObject*)&PyFloat_Type) { return at::ScalarType::Double; } if (obj == (PyObject*)&PyBool_Type) { return at::ScalarType::Bool; } if (obj == (PyObject*)&PyLong_Type #if PY_MAJOR_VERSION == 2 || obj == (PyObject*)&PyInt_Type #endif ) { return at::ScalarType::Long; }

So if float is set as dtype, this would be passed through the Python / C++ bindings as PyFloat_Type, which should always correspond to ScalarType::Double / torch.float64. The tests in the original PR also verify this mapping.

vivekmig · 2022-11-10T23:22:00Z

captum/attr/_core/feature_ablation.py

@@ -601,3 +593,20 @@ def _find_output_mode(
            feature_mask is None
            or all(len(sm.shape) == 0 or sm.shape[0] == 1 for sm in feature_mask)
        )
+
+    def _run_forward(self, *args, **kwargs) -> Tensor:
+        forward_output = _run_forward(*args, **kwargs)


nit: It seems a bit confusing when seeing both the instance method and original method named as _run_forward, could consider renaming this one slightly, but either way is fine.

facebook-github-bot · 2022-11-14T22:27:50Z

@aobo-y has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

convert forward return to tensor in FeatureAblation

5b37675

facebook-github-bot added the cla signed label Oct 21, 2022

aobo-y commented Oct 21, 2022

View reviewed changes

aobo-y requested review from vivekmig and NarineK October 21, 2022 00:57

vivekmig approved these changes Nov 10, 2022

View reviewed changes

rename local _run_forward

ab3dfda

facebook-github-bot closed this in a7610be Nov 15, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

convert forward return to tensor in FeatureAblation #1049

convert forward return to tensor in FeatureAblation #1049

aobo-y commented Oct 21, 2022 •

edited

aobo-y Oct 21, 2022

vivekmig Nov 10, 2022

aobo-y Nov 14, 2022

vivekmig Nov 16, 2022

facebook-github-bot commented Oct 21, 2022

vivekmig left a comment

vivekmig Nov 10, 2022

vivekmig Nov 10, 2022

facebook-github-bot commented Nov 14, 2022

convert forward return to tensor in FeatureAblation #1049

convert forward return to tensor in FeatureAblation #1049

Conversation

aobo-y commented Oct 21, 2022 • edited

aobo-y Oct 21, 2022

Choose a reason for hiding this comment

vivekmig Nov 10, 2022

Choose a reason for hiding this comment

aobo-y Nov 14, 2022

Choose a reason for hiding this comment

vivekmig Nov 16, 2022

Choose a reason for hiding this comment

facebook-github-bot commented Oct 21, 2022

vivekmig left a comment

Choose a reason for hiding this comment

vivekmig Nov 10, 2022

Choose a reason for hiding this comment

vivekmig Nov 10, 2022

Choose a reason for hiding this comment

facebook-github-bot commented Nov 14, 2022

aobo-y commented Oct 21, 2022 •

edited