define adjoint #72

ChrisRackauckas · 2020-08-20T04:18:18Z

using OrdinaryDiffEq, DiffEqSensitivity, Flux, DiffEqGPU, StaticArrays, CUDA
CUDA.allowscalar(false)

function model()
  prob = ODEProblem((du, u, p, t) -> du[1] = 1.01 * u[1] * p[1], u0, (0.0, 1.0), pa)

  function prob_func(prob, i, repeat)
    remake(prob, u0 = 0.5 .+ i/100 .* prob.u0)
  end

  ensemble_prob = EnsembleProblem(prob, prob_func = prob_func)
  solve(ensemble_prob, Tsit5(), EnsembleGPUArray(), saveat = 0.1, trajectories = 10, sensealg = ForwardDiffSensitivity(convert_tspan=false))
end

# loss function
loss() = sum(abs2,1.0.-Array(model()))

data = Iterators.repeated((), 10)

cb = function () # callback function to observe training
  @show loss()
end

pa = [1.0]
u0 = [3.0]
opt = ADAM(0.1)
println("Starting to train")

loss()

Flux.@epochs 10 Flux.train!(loss, params([pa]), data, opt; cb = cb)

ChrisRackauckas · 2020-08-20T04:18:50Z

@DhairyaLGandhi I think the map adjoint doesn't correctly ignore nothing's going backwards, could you take a look at this?

jc-audet · 2020-12-08T16:13:11Z

I have a similar issue as this thread in DiffEqFlux:

[https://github.com/SciML/DiffEqFlux.jl/issues/381]

with something resembling the MWE in this thread. Was any progress made in the past months?

Thank you

DhairyaLGandhi · 2020-12-08T16:19:06Z

Could we test with FluxML/Zygote.jl#846 ?

ChrisRackauckas · 2020-12-11T10:14:23Z

That branch doesn't seem to help. In fact, I'm a bit puzzled and made another similar example to work with first:

using OrdinaryDiffEq, DiffEqSensitivity, Flux, DiffEqGPU, StaticArrays, CUDA
CUDA.allowscalar(false)

function model()
  prob = ODEProblem((du, u, p, t) -> du[1] = 1.01 * u[1] * p[1] * p[2], u0, (0.0, 1.0), pa)

  function prob_func(prob, i, repeat)
    prob
  end

  ensemble_prob = EnsembleProblem(prob, prob_func = prob_func)
  solve(ensemble_prob, Tsit5(), EnsembleGPUArray(0.0), saveat = 0.1, trajectories = 10, sensealg = ForwardDiffSensitivity(convert_tspan=false))
end

# loss function
loss() = sum(abs2,1.0.-Array(model()))

data = Iterators.repeated((), 10)

cb = function () # callback function to observe training
  @show loss()
end

pa = [1.0,2.0]
u0 = [3.0]
opt = ADAM(0.1)
println("Starting to train")

loss()

Flux.@epochs 10 Flux.train!(loss, params([pa]), data, opt; cb = cb)

In the adjoint I specify, i.e. ZygoteRules.@adjoint function batch_solve_up(ensembleprob,probs,alg,ensemblealg,I,u0,p;kwargs...), I have that:

(size(Array(VectorOfArray(adj))), size(p)) = ((2, 10), (2, 10))

So I know that what I'm pulling back is the same size as p (correct? I assume Zygote doesn't do something crazy on matrices?). You would think that's working, but it gets all the way back to the Flux update! code where it sees

(x, gs[x]) = ([1.0, 2.0], [46839.635021615635; 23419.817510807818])

saying that the derivative somehow adjointed on its own... what?

ChrisRackauckas · 2020-12-11T10:17:17Z

oh wait, remembering that Zygote's adjoints for comprehensions are incorrect I got rid of the comprehensions. See that last commit. That's all I needed to fix that issue. So I think comprehensions incorrectly transpose variables behind pulled back. @DhairyaLGandhi you might want to take a look at that today and try to find a smaller reproducer since that is an issue that keeps coming up.

ChrisRackauckas · 2020-12-12T03:47:05Z

The error isn't reproducible so I'm just going to merge, but @vchuravy it would be good to know why KernelAbstractions.jl cannot compile sometimes, and where it decides it can't is seemingly random, dependent on the computer, how many functions were ran before it, and just how many times a code has been ran. I don't remember it being unstable like that.

ChrisRackauckas · 2020-12-12T03:56:41Z

Seems like the test issue was just changing inbounds semantics between different environments.

DhairyaLGandhi · 2020-12-21T11:18:01Z

Im having trouble reproducing the issue, I see you've gotten rid of comprehensions but is there a more minimal example that I can use?

ChrisRackauckas · 2020-12-21T12:45:51Z

There isn't a more minimal example I could find.

ChrisRackauckas mentioned this pull request Aug 20, 2020

EnsembleGPUArray() fails SciML/DiffEqFlux.jl#381

Closed

johnp-4dvanalytics mentioned this pull request Sep 14, 2020

Adding a reference Zymrael/awesome-neural-ode#3

Closed

define adjoint

5e37c2f

ChrisRackauckas force-pushed the adjoint branch from adf621f to 5e37c2f Compare December 11, 2020 01:56

get rid of comprehensions

59309ce

ChrisRackauckas added 3 commits December 11, 2020 05:33

fix and add reverse mode tests

ff5c713

simplified test for now

0d1af88

try isderiving, fails

23d135f

ChrisRackauckas changed the title ~~[WIP] define adjoint~~ define adjoint Dec 11, 2020

ChrisRackauckas added 3 commits December 11, 2020 06:41

a few branch updates

578d949

remove StaticArrays

6dc5369

change setup to make it easier to compile?

1628ac6

ChrisRackauckas merged commit 221a452 into master Dec 12, 2020

ChrisRackauckas deleted the adjoint branch December 12, 2020 03:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

define adjoint #72

define adjoint #72

ChrisRackauckas commented Aug 20, 2020

ChrisRackauckas commented Aug 20, 2020

jc-audet commented Dec 8, 2020

DhairyaLGandhi commented Dec 8, 2020

ChrisRackauckas commented Dec 11, 2020

ChrisRackauckas commented Dec 11, 2020

ChrisRackauckas commented Dec 12, 2020 •

edited

ChrisRackauckas commented Dec 12, 2020

DhairyaLGandhi commented Dec 21, 2020 •

edited

ChrisRackauckas commented Dec 21, 2020

define adjoint #72

define adjoint #72

Conversation

ChrisRackauckas commented Aug 20, 2020

ChrisRackauckas commented Aug 20, 2020

jc-audet commented Dec 8, 2020

DhairyaLGandhi commented Dec 8, 2020

ChrisRackauckas commented Dec 11, 2020

ChrisRackauckas commented Dec 11, 2020

ChrisRackauckas commented Dec 12, 2020 • edited

ChrisRackauckas commented Dec 12, 2020

DhairyaLGandhi commented Dec 21, 2020 • edited

ChrisRackauckas commented Dec 21, 2020

ChrisRackauckas commented Dec 12, 2020 •

edited

DhairyaLGandhi commented Dec 21, 2020 •

edited