SameDiff: training nets with mixed precision variables fails on updater state view array type #6992

AlexDBlack · 2019-01-14T04:25:41Z

Suppose I have 2 weight arrays: one double, one float.
SameDiff can handle this (with appropriate casts) but currently SameDiff creates a single INDArray for updater state. Thus when we try to update both parameters, one will fail due to mixed datatype in ops (double/float or float/double)

2 possiblities are available here:

Split updater state by variable (updater state datatype matches variable datatype)
Keep single updater state array, but add casting to the updaters to handle various input datatypes

* #6992 SameDiff mixed precision training support * Placeholder shape validation * Checkpoint listener * SameDiff checkpoint listener * SameDiff: Remove no longer required trainable params config from TrainingConfig * SameDiff: add name scopes * SameDiff name scopes - javadoc and tests * #7802 Evaluation class - report single class not macro avg in stats() for binary case * 7804 Arbiter - update score functions to use ND4J evaluation metric enums * SameDiff flatbuffers export: don't export arrays for array type variables (not required)

AlexDBlack · 2019-06-03T01:13:11Z

#7792

AlexDBlack added the SameDiff Autodiff related issues label Jan 14, 2019

AlexDBlack added a commit that referenced this issue May 28, 2019

#6992 SameDiff mixed precision training support

31e49c4

AlexDBlack mentioned this issue May 29, 2019

Fixes and SameDiff functionality #7807

Merged

AlexDBlack closed this as completed Jun 3, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SameDiff: training nets with mixed precision variables fails on updater state view array type #6992

SameDiff: training nets with mixed precision variables fails on updater state view array type #6992

AlexDBlack commented Jan 14, 2019

AlexDBlack commented Jun 3, 2019

SameDiff: training nets with mixed precision variables fails on updater state view array type #6992

SameDiff: training nets with mixed precision variables fails on updater state view array type #6992

Comments

AlexDBlack commented Jan 14, 2019

AlexDBlack commented Jun 3, 2019