PerNodeDrop: A Practical and Efficient Alternative to DropConnect:- – Exploring Node-Owned Stochasticity #23090
geleshChrsitUniversity
started this conversation in
Ideas
Replies: 1 comment 3 replies
-
|
Interesting idea. The main appeal here is that the stochasticity is attached to the node rather than the full connection matrix, so it should be cheaper than DropConnect while still being more structured than plain Dropout. If you want to take this further, I would make it a backend-agnostic Keras 3 layer and verify a few things early:
I would also be careful to define exactly where the noise is applied, because that changes the behavior quite a bit. My instinct would be to treat this as a standalone package first and only push upstream if the experiments are consistently strong. |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
# Feedback Request: PerNodeDropDense – Exploring Node-Owned Stochasticity
Hello everyone,
I have been working on a stochastic regularization method called PerNodeDrop, and I would greatly appreciate feedback from the Keras/TensorFlow community regarding both the abstraction and the API design.
The work began with a simple question:
Who owns the stochasticity?
Most stochastic regularization methods can be viewed as assigning perturbation ownership to a particular computational entity.
Activation-Owned Stochasticity
Methods such as Dropout, GaussianDropout, and GaussianNoise assign stochasticity to activations through a dedicated regularization layer.
For a given sample, all downstream neurons observe the same perturbed activation value. The stochastic influence of that activation is therefore shared across the receiving layer.
Connection-Owned Stochasticity
DropConnect assigns stochasticity to individual connections.
Different downstream neurons may receive different stochastic realizations of the same upstream signal, creating a richer perturbation space.
However, stochasticity now operates in the connection space:
O(Nin × Nout)
which can become increasingly expensive as network size grows.
Node-Owned Stochasticity
PerNodeDrop explores a third ownership model.
Instead of assigning stochasticity to activations or connections, the receiving neuron owns the perturbation.
Each receiving neuron generates and applies its own perturbation during computation.
The perturbation is local to the computational unit and does not require connection-level mask management.
This shifts stochasticity from:
O(Nin × Nout)
to:
O(Nout)
while preserving Dense-layer execution semantics.
Current API
Bernoulli mode:
Gaussian mode:
The same abstraction supports both binary masking and continuous perturbation.
Why I Find This Interesting
PerNodeDrop attempts to combine:
The implementation is currently being prepared for open-source release with improved testing, documentation, serialization support, and production-quality packaging.
An initial preprint is available here:
[https://arxiv.org/abs/2512.12663]
A substantially revised manuscript is currently under journal review.
Feedback Requested
I would greatly appreciate thoughts on:
Any thoughts, criticism, implementation concerns, or references would be extremely valuable.
Beta Was this translation helpful? Give feedback.
All reactions