More explicit gradient state interface #113

asilvas · 2022-11-03T01:47:27Z

Clear naming is subjective, but tried to be more explicit.

yushiyangk · 2022-11-03T01:55:29Z

Can we also rename the variable from grad to something else perhaps context. In other places grad refers to a Tensor instead.

Personally, in the context of a backwards pass, I feel that grad_result is more confusing than grad_in.

bwasti · 2022-11-03T02:26:57Z

Thanks for bringing this up!

I agree that context or ctx would be better than grad.

Here's my proposal for names:

export interface GradContext {
   forward_inputs: [Tensor, ...ArgType[]]
   forward_output: Tensor
   backward_input: Tensor // the associated gradient of forward_output
   backward_output_index: number // index of the associated forward input to be differentiated
 }

bwasti

LGTM! one nit with the change to leakyRelu

bwasti · 2022-11-03T17:08:23Z

shumai/tensor/tensor_ops.ts

@@ -16,7 +16,7 @@ export function relu(tensor: Tensor): Tensor {
  return tensor.maximum(scalar(0))
 }

-export function leakyRelu(tensor: Tensor, negative_slope: number): Tensor {
+export function leakyRelu(tensor: Tensor, negative_slope = 1e-3): Tensor {


fine to sneak this in, but can you annotate with the type? negative_slope: number = 1e-3

bun format removes the type annotation when the arg has a default

:o @cryptodeal is that expected?

bwasti · 2022-11-03T17:28:08Z

Here's an updated diagram with the new names:

More explicit gradient state interface

0a74eff

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 3, 2022

asilvas added 3 commits November 3, 2022 09:56

Merge branch 'main'

8c5b6d1

feedback

370d89b

missed renames

068affc

bwasti approved these changes Nov 3, 2022

View reviewed changes

Merge branch 'main' into grad-interface

3fa6d42

bwasti merged commit 9158fba into facebookresearch:main Nov 3, 2022

asilvas deleted the grad-interface branch November 3, 2022 17:34

bwasti mentioned this pull request Dec 5, 2022

Making softmax numerically stable #133

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More explicit gradient state interface #113

More explicit gradient state interface #113

asilvas commented Nov 3, 2022

yushiyangk commented Nov 3, 2022 •

edited

bwasti commented Nov 3, 2022

bwasti left a comment

bwasti Nov 3, 2022

asilvas Nov 3, 2022

bwasti Nov 3, 2022

bwasti commented Nov 3, 2022

More explicit gradient state interface #113

More explicit gradient state interface #113

Conversation

asilvas commented Nov 3, 2022

yushiyangk commented Nov 3, 2022 • edited

bwasti commented Nov 3, 2022

bwasti left a comment

Choose a reason for hiding this comment

bwasti Nov 3, 2022

Choose a reason for hiding this comment

asilvas Nov 3, 2022

Choose a reason for hiding this comment

bwasti Nov 3, 2022

Choose a reason for hiding this comment

bwasti commented Nov 3, 2022

yushiyangk commented Nov 3, 2022 •

edited