Added Condtional GAN and DCGAN tutorial #111

shreyas-kowshik · 2019-03-04T06:19:47Z

Added the markdown file as discussed @MikeInnes

shreyas-kowshik · 2019-03-04T21:59:18Z

The cGAN part code is completed. Will add the tutorial in the Literate.jl format next.

shreyas-kowshik · 2019-03-04T22:19:11Z

Ping @MikeInnes.
Updated PR with 2 tutorials in the format as discussed.

DhairyaLGandhi · 2019-03-19T11:52:48Z

Could we get rid of the .ipynb_checkpoints really quickly?

shreyas-kowshik · 2019-03-19T13:02:28Z

Thank you @dhairyagandhi96 . I have made the changes requested.

MikeInnes · 2019-03-25T14:44:29Z

You shouldn't have the same content twice here, you can remove the generated markdown file. Not sure what I asked for earlier (ideal to reference comments) but a README would usually just have an overall description and instructions for running the script.

shreyas-kowshik · 2019-03-25T19:37:04Z

@MikeInnes Thank You for the feedback. I have removed the shortened the content of the markdown files, removed HTML and fancy formatting and added instructions to run the script.

1. The original commits were messed up. This commit overwrites all previous commits. 2.The markdown files and the instructions to run the code is given.

MikeInnes · 2019-04-04T13:44:52Z

vision/mnist/DCGAN/dcgan.jl

+  return p
+end
+
+function zero_grad!(model)


You shouldn't need these utilities; just use update! with Params and Grads, like Flux.train! does.

Also, you're still using HTML tags above

@MikeInnes I used the update! without zeroing out the gradients of the models manually. However that model is not converging to the actual output even after repeated trials. However, manually zeroing out the gradients does. I don't know if it's a bug on my part. I was using this as a reference :
https://github.com/eriklindernoren/PyTorch-GAN/blob/1f130dfca726e14254e4fd78e5fb63f08931acd3/implementations/cgan/cgan.py#L161-L195

As pointed out on Slack, gradient used in update! should automatically zero out the gradients, but the results are not reflecting them...

https://github.com/FluxML/Flux.jl/blob/66ce8d8066e00b0f8eccd69649dabaee26f97129/src/optimise/train.jl#L20 should zero out the gradients with a simple update!

The normal update! method will work if you use gradient rather than back!. back! should be avoided as it's effectively deprecated.

@MikeInnes Sorry for replying late.Made the requested changes.
Does it look good?

update! does zero out the gradient so no need to do it explicitly I suppose.

I believe update! only zeros out the gradient in the call but not all the gradient.

See this MWE below:

using Flux, Tracker d1 = Dense(2, 1) d2 = Dense(1, 1) c = Chain(d1, d2) p1 = params(d1) p2 = params(d2) pall = params(c) x = rand(2, 10) loss() = sum(c(x)) # Case 1 gradient(loss, pall).grads |> values |> println # This zeros out all gradients # Case 2 gradient(loss, p1).grads |> values |> println # After this call, the gradient of p2 is not zeroed out # Thus the call for gradient of p2 below will be affected gradient(loss, p2).grads |> values |> println # After this call, the gradient of p1 is not zeroed out # Just zero out all gradients before the next experiment Tracker.zero_grad!.(Tracker.grad.(p1)) Tracker.zero_grad!.(Tracker.grad.(p2)) # Case 3 gradient(loss, p1).grads |> values |> println Tracker.zero_grad!.(Tracker.grad.(p2)) # just to avoid the situation in Case 1 gradient(loss, p2).grads |> values |> println

gives

Any[Float32[10.0] (tracked), Float32[1.8203094] (tracked), Float32[1.2446415 0.7490297] (tracked), Float32[-1.8717546] (tracked)] Any[Float32[1.8203094] (tracked), Float32[1.2446415 0.7490297] (tracked)] Any[Float32[20.0] (tracked), Float32[-3.7435093] (tracked)] Any[Float32[1.8203094] (tracked), Float32[1.2446415 0.7490297] (tracked)] Any[Float32[10.0] (tracked), Float32[-1.8717546] (tracked)]

See how Case 2 is different from Case 1 and Case 3.

When I created a DCGAN model with Tracker backend (Flux v0.9.0), it didn't converge without zeroing out gradients after training the discriminator. https://github.com/matsueushi/fluxjl-gan/blob/flux0.9.0/mnist-dcgan.jl

However, with Zygote backend (Flux v0.10.0),

using Flux d1 = Dense(2, 1) d2 = Dense(1, 1) c = Chain(d1, d2) p1 = params(d1) p2 = params(d2) pall = params(c) x = rand(2, 10) loss() = sum(c(x)) @info "Case1" gradient(loss, pall).grads |> values |> println @info "Case2" gradient(loss, p1).grads |> values |> println gradient(loss, p2).grads |> values |> println

gives expected results

[ Info: Case1 Any[Float32[9.715003], Float32[3.7775292 4.41461], Float32[10.0], Float32[-5.8413825]] [ Info: Case2 Any[Float32[9.715003], Float32[3.7775292 4.41461]] Any[Float32[10.0], Float32[-5.8413825]]

and I didn't have to zero out gradients. https://github.com/matsueushi/fluxjl-gan/blob/e60684b6c8ecc601eb6784ae393eae9a3a3ba57a/mnist-dcgan.jl

This is expected. Only a tracker based AD needs the zero-out part. AD based on Zygote doesn't have this side effect.

shreyas-kowshik changed the title ~~Added DCGAN Tutorial~~ Added DCGAN Tutorial and Conditional GAN Model Mar 4, 2019

shreyas-kowshik changed the title ~~Added DCGAN Tutorial and Conditional GAN Model~~ Added Condtional GAN and DCGAN tutorial Mar 4, 2019

shreyas-kowshik mentioned this pull request Mar 7, 2019

Project Proposal For Contribution to GSoC'19 #99

Closed

Added DCGAN and cGAN files.

81f6567

1. The original commits were messed up. This commit overwrites all previous commits. 2.The markdown files and the instructions to run the code is given.

shreyas-kowshik force-pushed the gans branch from c555552 to 81f6567 Compare March 26, 2019 11:29

MikeInnes reviewed Apr 4, 2019

View reviewed changes

shreyas-kowshik added 3 commits April 11, 2019 21:38

Remove HTML tags

863129b

Removed back, added gradient

023349c

Removed zero_grad from cgan

e99a30c

matsueushi mentioned this pull request Feb 29, 2020

DCGAN model for Flux v0.10 #207

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added Condtional GAN and DCGAN tutorial #111

Added Condtional GAN and DCGAN tutorial #111

shreyas-kowshik commented Mar 4, 2019

shreyas-kowshik commented Mar 4, 2019

shreyas-kowshik commented Mar 4, 2019 •

edited

DhairyaLGandhi commented Mar 19, 2019

shreyas-kowshik commented Mar 19, 2019

MikeInnes commented Mar 25, 2019

shreyas-kowshik commented Mar 25, 2019

MikeInnes Apr 4, 2019

shreyas-kowshik Apr 11, 2019 •

edited

DhairyaLGandhi Apr 12, 2019

MikeInnes Apr 12, 2019

shreyas-kowshik Apr 13, 2019 •

edited

shreyas-kowshik Nov 3, 2019

xukai92 Nov 3, 2019

xukai92 Nov 3, 2019

matsueushi Dec 4, 2019 •

edited

xukai92 Dec 4, 2019

Added Condtional GAN and DCGAN tutorial #111

Are you sure you want to change the base?

Added Condtional GAN and DCGAN tutorial #111

Conversation

shreyas-kowshik commented Mar 4, 2019

shreyas-kowshik commented Mar 4, 2019

shreyas-kowshik commented Mar 4, 2019 • edited

DhairyaLGandhi commented Mar 19, 2019

shreyas-kowshik commented Mar 19, 2019

MikeInnes commented Mar 25, 2019

shreyas-kowshik commented Mar 25, 2019

MikeInnes Apr 4, 2019

Choose a reason for hiding this comment

shreyas-kowshik Apr 11, 2019 • edited

Choose a reason for hiding this comment

DhairyaLGandhi Apr 12, 2019

Choose a reason for hiding this comment

MikeInnes Apr 12, 2019

Choose a reason for hiding this comment

shreyas-kowshik Apr 13, 2019 • edited

Choose a reason for hiding this comment

shreyas-kowshik Nov 3, 2019

Choose a reason for hiding this comment

xukai92 Nov 3, 2019

Choose a reason for hiding this comment

xukai92 Nov 3, 2019

Choose a reason for hiding this comment

matsueushi Dec 4, 2019 • edited

Choose a reason for hiding this comment

xukai92 Dec 4, 2019

Choose a reason for hiding this comment

shreyas-kowshik commented Mar 4, 2019 •

edited

shreyas-kowshik Apr 11, 2019 •

edited

shreyas-kowshik Apr 13, 2019 •

edited

matsueushi Dec 4, 2019 •

edited