Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[DOC] Fully document HBPConv2d #290

Merged
merged 4 commits into from Nov 13, 2022
Merged

[DOC] Fully document HBPConv2d #290

merged 4 commits into from Nov 13, 2022

Conversation

f-dangel
Copy link
Owner

@f-dangel f-dangel commented Nov 12, 2022

This PR adds documentation to the HBP extension of Conv2d layers, which is responsible to compute KFAC/KFLR/KFRA. The docstrings draw connections to the notation in the KFC paper, and outline important differences, as well as improvements for consistency. It also adds a test case for KFAC, KFLR for which both approximations become exact.

Note to myself: I made notes how to connect Hessian backpropagation to KFAC for convolutions by imposing a Kronecker structure on the backpropagated quantity. This concept can also be applied to KFRA to achieve more consistency, but is currently not done by the code.

Convolution layers with a single output behave like linear layers, as
the weights are not shared over the input.
@f-dangel f-dangel merged commit 5e0fb35 into development Nov 13, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant