sharenoise #1

aliutkus · 2021-01-18T06:39:46Z

optimizes memory usage and speed by sharing the SPE along all layers.

This is done in the following way:

not redrawing noise each time, so as to share qbar and kbar on all layers. The strategy picked is to keep qbar and kbar untouched as long as their shapes are ok. They must be manually resetted if required.
remove the use of einsum. It's indeed much nicer, but for some mysterious reason, it apparently did not allow to save RAM when reusing qbar and kbar.

the notebook tries to apply the SPE many times in a row, to simulate many layers.

⚠️ note that my_spe.reset() must now be called explicitly each time a new spe must be computed (typically at each batch during training).

dev/spe/spe.py

cifkao

Maybe I would prefer to require calling reset manually with the (maximum) query length (shape) instead of calling it automatically if the shape doesn't match. Then in forward we can just check that we have at least the required length and truncate it if needed. That would allow to store the PEs for multiple iterations in case we find the redrawing is slow or want deterministic behavior.

dev/spe/spe.py

aliutkus · 2021-01-18T10:11:00Z

Maybe I would prefer to require calling reset manually with the (maximum) query length (shape) instead of calling it automatically if the shape doesn't match. Then in forward we can just check that we have at least the required length and truncate it if needed. That would allow to store the PEs for multiple iterations in case we find the redrawing is slow or want deterministic behavior.

Yes, reset should be called manually, at the start of the forward of the hosting module.
but your solution does not work: we really do need to call it again at each forward pass, during training stage, because if we don't do it, we don't update the parameters of the SPE

aliutkus · 2021-01-18T10:11:45Z

Maybe I would prefer to require calling reset manually with the (maximum) query length (shape) instead of calling it automatically if the shape doesn't match. Then in forward we can just check that we have at least the required length and truncate it if needed. That would allow to store the PEs for multiple iterations in case we find the redrawing is slow or want deterministic behavior.

Yes, reset should be called manually, at the start of the forward of the hosting module.
but your solution does not work: we really do need to call it again at each forward pass, during training stage, because if we don't do it, we don't update the parameters of the SPE

Still, we could store the pure noise z for reusability, yes

aliutkus · 2021-01-19T05:51:40Z

dev/spe/spe.py

@@ -188,6 +217,8 @@ def __init__(
        in_features: int = 64,
        num_realizations: int = 256,
        num_sines: int = 10,
+        key_shape: Optional[Tuple[int, ...]] = None,


looks a bit redundant to me, what about a max_length ?

aliutkus · 2021-01-19T05:52:32Z

dev/spe/spe.py

+        self.reset(key_shape, share_in_batch)
+
+    def reset(self,
+              key_shape: Tuple[int, ...],


I don't really like it that we need to provide the key_shape each time, we can't do otherwise ?

dev/spe/spe.py

aliutkus added 3 commits January 18, 2021 06:10

share noise as long as shape match

5de5464

remove einsum and debug a bit

62cef58

add the resetting functions

c784c83

aliutkus requested a review from cifkao January 18, 2021 06:40

aliutkus commented Jan 18, 2021

View reviewed changes

dev/spe/spe.py Show resolved Hide resolved

dev/spe/spe.py Show resolved Hide resolved

dev/spe/spe.py Show resolved Hide resolved

dev/spe/spe.py Outdated Show resolved Hide resolved

dev/spe/spe.py Outdated Show resolved Hide resolved

cifkao reviewed Jan 18, 2021

View reviewed changes

dev/spe/spe.py Outdated Show resolved Hide resolved

dev/spe/spe.py Outdated Show resolved Hide resolved

dev/spe/spe.py Outdated Show resolved Hide resolved

dev/spe/spe.py Outdated Show resolved Hide resolved

Remove einsum, it was using more memory

2742701

cifkao force-pushed the sharenoise branch from 0493db2 to 2742701 Compare January 18, 2021 15:29

cifkao added 2 commits January 18, 2021 17:40

Pass key shape in advance, enable PE sharing in batch

5489262

Complain if _key_shape is None

1eec92c

aliutkus commented Jan 19, 2021

View reviewed changes

dev/spe/spe.py Show resolved Hide resolved

aliutkus commented Jan 19, 2021

View reviewed changes

dev/spe/spe.py Outdated Show resolved Hide resolved

cifkao added 2 commits January 19, 2021 09:32

Improve error messages

eadcd70

Always share in batch, request length instead of shape

ec10f00

cifkao force-pushed the sharenoise branch from 1c4f677 to ec10f00 Compare January 19, 2021 15:42

cifkao added 2 commits January 19, 2021 16:42

Update notebook

3df440b

Merge branch 'main' into sharenoise

5a359d9

cifkao merged commit ea9c2cd into main Jan 19, 2021

cifkao added a commit that referenced this pull request May 24, 2021

Merge pull request #1 from aliutkus/sharenoise

e2326b2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sharenoise #1

sharenoise #1

aliutkus commented Jan 18, 2021

cifkao left a comment

aliutkus commented Jan 18, 2021

aliutkus commented Jan 18, 2021

aliutkus Jan 19, 2021

cifkao Jan 19, 2021

aliutkus Jan 19, 2021

cifkao Jan 19, 2021

sharenoise #1

sharenoise #1

Conversation

aliutkus commented Jan 18, 2021

cifkao left a comment

Choose a reason for hiding this comment

aliutkus commented Jan 18, 2021

aliutkus commented Jan 18, 2021

aliutkus Jan 19, 2021

Choose a reason for hiding this comment

cifkao Jan 19, 2021

Choose a reason for hiding this comment

aliutkus Jan 19, 2021

Choose a reason for hiding this comment

cifkao Jan 19, 2021

Choose a reason for hiding this comment