Initial finite differencing testing #14

willtebbutt · 2019-04-12T22:03:45Z

This is some initial work to set up systematic testing for frules and rrules. This initial work covers:

Some utility functionality to make running finite-differencing straightforward. This depends on FDM.jl
I've extended accumulate! very slightly to accomodate scalars. @jrevels I could use your input here, this might be totally inconsistent with what you have in mind.
I've moved some tests around and established a nested testset with 1-1 mapping between src and test files.
All frules and rrules in linalg are now covered as a demonstration of the functionality.

There are a few things that need to be addressed before I go about adding extra finite-differencing test-coverage (in a later PR) / before this could be merged:

What have I got structurally wrong in this PR? i.e. have I missed the point with everything.
What is missing? i.e. if we extended the types of tests in test/linalg.jl to cover the entire codebase, what still wouldn't be covered? The test_adjoint! function was already here when I started, so I've kept that. Does it need extending? Do we need a forwards-mode equivalent?
There are some tests left in runtests.jl that I've wrapped in a Misc Tests testset. These would ideally go somewhere else. I'm not entirely sure where the optimal location is though and could do with some advice.

test/test_util.jl

ararslan · 2019-04-12T22:30:45Z

I can't speak to Jarrett's overarching vision, so hopefully he can chime in on those parts, but overall this looks really good to me. I was just thinking about integrating FDM for the tests the other day, as it's proved immensely useful for Nabla. As I said in some in-line comments, I think much of the machinery here could actually be moved to FDM.

willtebbutt · 2019-04-12T22:47:52Z

As I said in some in-line comments, I think much of the machinery here could actually be moved to FDM.

I'm in total agreement with you here. Will open a PR to FDM. Would definitely simplify a lot of stuff here and various other bits of work, as you point out.

jrevels · 2019-04-14T19:11:20Z

src/rules/linalg.jl

@@ -31,7 +31,7 @@ end
 function rrule(::typeof(inv), x::AbstractArray)
    Ω = inv(x)
    m = @thunk(-Ω)
-    return Ω, Rule(ΔΩ -> m' * ΔΩ * Ω')
+    return Ω, Rule(ΔΩ -> extern(m)' * ΔΩ * Ω')


Ah good catch. IIRC, this was from back when adjoint was defined on t::Thunk to be something like @thunk(extern(t)').

Perhaps we should just move the adjoint into the @thunk, e.g.:

m = @thunk(-Ω') return Ω, Rule(ΔΩ -> m * ΔΩ * Ω')

Do we even need a Thunk here? What do we gain in this particular case?

Good point. The general reason to use a Thunk for closed-over computations is to avoid performing the computation if it ends up being unnecessary; e.g. if a Zero is passed in for the differential. However, that seems pretty unlikely for a unary rule.

Ohhh I see. Zero has higher precedence than Thunk in your monad-y dispatch list, meaning that if the stuff on the RHS of the Thunk is Zero then the Thunk is never materialised and we avoid the computation entirely. Have I understood this correctly? (I'm still getting too grips with this...)

jrevels · 2019-04-14T19:31:13Z

src/rules.jl

+function accumulate!(Δ, rule::AbstractRule, args...)
+    return materialize!(Δ, broadcastable(add(cast(Δ), rule(args...))))
+end
+accumulate!(Δ::Real, rule::AbstractRule, args...) = accumulate(Δ, rule, args...)


Seems reasonable. Could probably expand this to Δ::Number, even. Could we add a space between these two definitions? 🙂

I guess one place where this might yield unintuitive behavior is if it causes users to think that passing in a non-materialize!-able Δ always "just" works, e.g. if you call accumulate!(::SArray, ...) a user might expect it to fallback to accumulate instead of hitting the setindex! error it would currently hit when materialize!(::SArray, ...) is called. It's not immediately clear how to implement something that ensures that. However, I think we can justify/explain this fallback by saying that it only exists so that callers can handle things generically without checking for the scalar case. That way, it makes sense that numbers are special here, and avoids the impression that a similar fallback exists for immutable containers in general.

Downstream packages are allowed to special-case these methods as well, which makes it even more okay for us to do so. I'm writing docstrings for these methods today which will hopefully make things clearer.

Could we add a space between these two definitions?

ofc :)

Could probably expand this to Δ::Number

Sounds reasonable.

I'm on board with the rest of what you've suggested as well

jrevels · 2019-04-14T20:10:57Z

This is awesome, thanks so much!

Made some comments but everything here generally LGTM, agree with moving the relevant functionality to FDM.jl.

What is missing? i.e. if we extended the types of tests in test/linalg.jl to cover the entire codebase, what still wouldn't be covered? The test_adjoint! function was already here when I started, so I've kept that. Does it need extending? Do we need a forwards-mode equivalent?

Yup, a forward-mode equivalent will be necessary at some point, though it could probably eventually be merged with the current test_adjoint! into a single test harness.

There are quite a few permutations in terms of features to test; for any given rule, we could presumably test both modes vs. all relevant differential types vs. all relevant rule types vs. a sample of possible input shapes vs. different levels of materialization, etc. In the long run, we'd want to cover as much of that space as possible in shared test harnesses, and leave the rest to ad hoc per-rule tests. As time goes on and more per-rule ad hoc tests are added, we can refactor whenever we find similar tests/common functionality to pull out into the shared harnesses.

There are some tests left in runtests.jl that I've wrapped in a Misc Tests testset. These would ideally go somewhere else. I'm not entirely sure where the optimal location is though and could do with some advice.

I think these could be considered some of those ad hoc per-rule tests; seems like each test file could have its own Misc Tests testset for such tests. For example, the * and hypot test would fall under test/rule/base.jl's miscellaneous tests, while the broadcast(sin, ...) one would fall under test/rule/broadcast.jl.

Anyway, this PR is already a huge improvement, thanks again 🙂

willtebbutt · 2019-04-15T00:39:03Z

Yup, a forward-mode equivalent will be necessary at some point, though it could probably eventually be merged with the current test_adjoint! into a single test harness.

Are you happy for this particular PR to be merged before this happens?

I think these could be considered some of those ad hoc per-rule tests; seems like each test file could have its own Misc Tests testset for such tests.

Sounds reasonable to me.

willtebbutt · 2019-04-15T11:08:02Z

Latest push requires this FDM.jl PR to be merged and a new version tagged before tests stand a chance of passing.

ararslan · 2019-04-15T20:25:44Z

This can now use FDM 0.4.0.

Project.toml

Co-Authored-By: willtebbutt <wt0881@my.bristol.ac.uk>

…nRules.jl into wct/fdm-testing

willtebbutt · 2019-04-15T21:34:46Z

@jrevels anything else that you want done before merging?

ararslan

Looks good to me. Should definitely be squashed on merge.

willtebbutt · 2019-04-15T23:19:10Z

Should definitely be squashed on merge.

Haha yes, for sure.

jrevels · 2019-04-17T21:56:14Z

LGTM too 👍 thanks again!

Gave you commit bit so feel free to merge after rebase (sorry for the conflict, I can add the note about Δ::Real to the accumulate! docstring in a follow-up PR) 🙂

ararslan · 2019-04-17T22:01:18Z

I see no merge conflict...

jrevels · 2019-04-17T22:12:32Z

Huh, I guess GitHub served me an outdated version of the page or something? weird...

willtebbutt added 3 commits April 12, 2019 19:21

Initial work for unary functions

537a338

Resolve merge conflict

dc6d02a

linalg tested

c73c003

ararslan reviewed Apr 12, 2019

View reviewed changes

test/test_util.jl Outdated Show resolved Hide resolved

ararslan reviewed Apr 12, 2019

View reviewed changes

test/test_util.jl Outdated Show resolved Hide resolved

ararslan requested a review from jrevels April 12, 2019 22:30

willtebbutt mentioned this pull request Apr 13, 2019

Implements extensions to jvp and adjoint JuliaDiff/FiniteDifferences.jl#11

Merged

Remove FDM work

af79de4

jrevels reviewed Apr 14, 2019

View reviewed changes

jrevels mentioned this pull request Apr 15, 2019

add docstrings for accumulate, accumulate! and store! #16

Merged

Resolve Jarrett's comments

365b8b8

willtebbutt added 2 commits April 15, 2019 22:02

Require latest FDM version

645d5cc

Merge branch 'master' into wct/fdm-testing

487e1cc

ararslan reviewed Apr 15, 2019

View reviewed changes

Project.toml Outdated Show resolved Hide resolved

ararslan reviewed Apr 15, 2019

View reviewed changes

Project.toml Outdated Show resolved Hide resolved

ararslan and others added 3 commits April 15, 2019 22:09

Update Project.toml

109557d

Co-Authored-By: willtebbutt <wt0881@my.bristol.ac.uk>

Make FDM test-only

dc89787

Merge branch 'wct/fdm-testing' of https://github.com/willtebbutt/Chai…

93140a7

…nRules.jl into wct/fdm-testing

ararslan approved these changes Apr 15, 2019

View reviewed changes

ararslan merged commit 7b69721 into JuliaDiff:master Apr 17, 2019

nickrobinson251 mentioned this pull request Aug 28, 2019

accumulate!(::Real, ...) JuliaDiff/ChainRulesCore.jl#11

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initial finite differencing testing #14

Initial finite differencing testing #14

willtebbutt commented Apr 12, 2019 •

edited

Loading

ararslan commented Apr 12, 2019

willtebbutt commented Apr 12, 2019 •

edited

Loading

jrevels Apr 14, 2019

willtebbutt Apr 15, 2019

jrevels Apr 15, 2019

willtebbutt Apr 15, 2019

jrevels Apr 14, 2019

willtebbutt Apr 15, 2019

jrevels commented Apr 14, 2019

willtebbutt commented Apr 15, 2019

willtebbutt commented Apr 15, 2019

ararslan commented Apr 15, 2019

willtebbutt commented Apr 15, 2019

ararslan left a comment

willtebbutt commented Apr 15, 2019

jrevels commented Apr 17, 2019

ararslan commented Apr 17, 2019

jrevels commented Apr 17, 2019

Initial finite differencing testing #14

Initial finite differencing testing #14

Conversation

willtebbutt commented Apr 12, 2019 • edited Loading

ararslan commented Apr 12, 2019

willtebbutt commented Apr 12, 2019 • edited Loading

jrevels Apr 14, 2019

Choose a reason for hiding this comment

willtebbutt Apr 15, 2019

Choose a reason for hiding this comment

jrevels Apr 15, 2019

Choose a reason for hiding this comment

willtebbutt Apr 15, 2019

Choose a reason for hiding this comment

jrevels Apr 14, 2019

Choose a reason for hiding this comment

willtebbutt Apr 15, 2019

Choose a reason for hiding this comment

jrevels commented Apr 14, 2019

willtebbutt commented Apr 15, 2019

willtebbutt commented Apr 15, 2019

ararslan commented Apr 15, 2019

willtebbutt commented Apr 15, 2019

ararslan left a comment

Choose a reason for hiding this comment

willtebbutt commented Apr 15, 2019

jrevels commented Apr 17, 2019

ararslan commented Apr 17, 2019

jrevels commented Apr 17, 2019

willtebbutt commented Apr 12, 2019 •

edited

Loading

willtebbutt commented Apr 12, 2019 •

edited

Loading