small fix in the backward rule of `norm` #131

tangwei94 · 2024-06-13T14:15:37Z

Previously, the backward of norm will become NAN if the norm is zero.

codecov · 2024-06-13T15:17:05Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 81.94%. Comparing base (b5096da) to head (6e9bec0).

Additional details and impacted files

@@           Coverage Diff           @@
##           master     #131   +/-   ##
=======================================
  Coverage   81.94%   81.94%           
=======================================
  Files          42       42           
  Lines        5666     5667    +1     
=======================================
+ Hits         4643     4644    +1     
  Misses       1023     1023

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Jutho · 2024-06-13T18:11:18Z

ext/TensorKitChainRulesCoreExt.jl

@@ -172,7 +172,9 @@ end
 function ChainRulesCore.rrule(::typeof(norm), a::AbstractTensorMap, p::Real=2)
    p == 2 || error("currently only implemented for p = 2")
    n = norm(a, p)
-    norm_pullback(Δn) = NoTangent(), a * (Δn' + Δn) / (n * 2), NoTangent()
+    function norm_pullback(Δn)
+        return NoTangent(), a * (Δn' + Δn) / (n * 2 + eps(real(eltype(a)))), NoTangent()


Could you change this to a * (Δn' + Δn) / 2 / hypot(n, eps(one(n))) ? I think that is slightly nicer, in that, if n==1., then n+eps() is no longer exactly one, but hypot(1.,eps()) is still exactly 1. due to machine precision.

Yes, this is indeed better. I have modified this line according to the suggestion.

Jutho · 2024-06-13T18:12:01Z

Thanks; that's an important fix. I made one suggestion in the code.

ext/TensorKitChainRulesCoreExt.jl

tangwei94 · 2024-06-13T19:45:04Z

ext/TensorKitChainRulesCoreExt.jl

@@ -172,7 +172,9 @@ end
 function ChainRulesCore.rrule(::typeof(norm), a::AbstractTensorMap, p::Real=2)
    p == 2 || error("currently only implemented for p = 2")
    n = norm(a, p)
-    norm_pullback(Δn) = NoTangent(), a * (Δn' + Δn) / (n * 2), NoTangent()
+    function norm_pullback(Δn)
+        return NoTangent(), a * (Δn' + Δn) / (n * 2 + eps(real(eltype(a)))), NoTangent()


Yes, this is indeed better. I have modified this line according to the suggestion.

tangwei94 added 2 commits June 13, 2024 11:00

small fix in backward rule of norm

e58f96e

format norm_pullback

65c4902

Jutho reviewed Jun 13, 2024

View reviewed changes

tangwei94 commented Jun 13, 2024

View reviewed changes

use hypot

6e9bec0

Jutho merged commit b026cf2 into Jutho:master Jun 14, 2024
13 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

small fix in the backward rule of `norm` #131

small fix in the backward rule of `norm` #131

tangwei94 commented Jun 13, 2024

codecov bot commented Jun 13, 2024 •

edited

Loading

Jutho Jun 13, 2024

tangwei94 Jun 13, 2024

Jutho commented Jun 13, 2024

tangwei94 Jun 13, 2024

small fix in the backward rule of norm #131

small fix in the backward rule of norm #131

Conversation

tangwei94 commented Jun 13, 2024

codecov bot commented Jun 13, 2024 • edited Loading

Codecov Report

Jutho Jun 13, 2024

Choose a reason for hiding this comment

tangwei94 Jun 13, 2024

Choose a reason for hiding this comment

Jutho commented Jun 13, 2024

tangwei94 Jun 13, 2024

Choose a reason for hiding this comment

small fix in the backward rule of `norm` #131

small fix in the backward rule of `norm` #131

codecov bot commented Jun 13, 2024 •

edited

Loading