Docs clarification

deeplearning4j · Apr 6, 2019 · b55063e · b55063e
1 parent fb0bfbe
commit b55063e
Showing 1 changed file with 4 additions and 1 deletion.
diff --git a/...-backends/nd4j-api-parent/nd4j-api/src/main/java/org/nd4j/autodiff/samediff/SameDiff.java b/...-backends/nd4j-api-parent/nd4j-api/src/main/java/org/nd4j/autodiff/samediff/SameDiff.java
@@ -3372,7 +3372,9 @@ How? Walk backward on op graph starting at loss variable(s), along FP variables
 
         Step 3: Differentiate ops in minimal subgraph
         The only major issue here is with multiple output ops, where only one of the outputs lead to the loss.
-        For example, X -> slice -> (A,B); B -> loss, with A being unused.
+        For example, X -> slice -> (A,B); B -> loss, with A being unused (in that it doesn't contribute to the loss function)
+        But to do split op backprop, we need gradient variables/arrays for both outputs (A and B).
+        We know the shape and type of dL/dA must be exactly the same as A; we also know that the loss function doesn't depend on A. Hence, dL/dA == zerosLike(A)
 
          */
 
@@ -3623,6 +3625,7 @@ public SDVariable[] define(SameDiff sameDiff, Map<String, INDArray> inputs, SDVa
                             if(!v.dataType().isFPType()){
                                 grads.add(null);
                             } else {
+                                //See "Step 3: Differentiate ops in minimal subgraph" above for explanation on why this should be zerosLike here...
                                 SDVariable gTemp = sameDiff.zerosLike(v);
                                 grads.add(gTemp);
                             }