Add Learnable Query #19

jacobbieker · 2021-09-17T11:15:47Z

Pull Request

Description

This adds a Learnable Query constructor that creates a query with some randomness to possibly be able to "ensemble" predictions through querying the Perceiver output multiple times.

Fixes issue #16

How Has This Been Tested?

Unit tests

No
Yes

Checklist:

My code follows OCF's coding style guidelines
I have performed a self-review of my own code
I have made corresponding changes to the documentation
I have added tests that prove my fix is effective or that my feature works
I have checked my code and corrected any misspellings

Keep the latent space as the 4D tensor and then linearize at the end, the same as what the Perceiver models do

Change prints to Debug statements

perceiver_pytorch/queries.py

tests/test_queries.py

perceiver_pytorch/queries.py

peterdudfield · 2021-09-22T14:25:24Z

tests/test_queries.py

+    x = torch.randn((4, 6, 12, 16, 16))
+    out = query_creator(x)
+    # Output is flattened, so should be [B, T*H*W, C]
+    assert out.shape == (4, 16 * 16 * 6, 803)


Sorry, I probably just dont understand, where does the 803 come from?

That is the number of channels of the channel_dim so 32 + the number of Fourier Features, which for this is 771

And the number of Fourier Features comes from (num input axis)((num_freq_bands2)+1) so for this 3*(128*2+1) = 771

Thanks for adding this

peterdudfield

Looks good to me,

had for me to judge if this is the right method - is it copied from somewhere? or reference in a paper?

jacobbieker · 2021-09-22T17:16:31Z

Looks good to me,

had for me to judge if this is the right method - is it copied from somewhere? or reference in a paper?

This isn't copied from anywhere, just the general idea from using a random latent space to generate predictions from GANs, somewhat the Skillful Nowcasting GAN paper I guess

JackKelly

Looks really good! I do love this idea of using random queries to create an ensemble!

perceiver_pytorch/queries.py

JackKelly · 2021-09-28T16:23:18Z

perceiver_pytorch/queries.py

+_LOG.setLevel(logging.WARN)
+
+
+class LearnableQuery(torch.nn.Module):


On the topic of position encoding for the queries...

I think the position encoding needs to be 'internally consistent' across each entire example. That is, the position encoding should be consistent across data inputs and the queries so the model can see that, say, the last timestep of the recent history in the input is immediately before the first timestep of the query. And that, in general, the model can see that the timeseries of recent history and the timeseries of queries are two parts of a contiguous timeseries.

To give a concrete example: If the recent history in the input spans 11:00 to 11:55, and the query is for 12:00 to 12:55, then we want the model to see that the first timestep of the query is 5 minutes after the last timestep of the input... if that makes sense?!

One way to do this might be to encode the positions once for all timesteps in the example (i.e. the concatenation of the recent history and the forecast timesteps), and then concatenate the last forecast_timesteps of the position encoding to the queries? Or something like that?! Not sure what's best!

Perhaps this could wait for a future PR though!

This is just a hunch, of course!

And, ultimately, we might want the position encoding to include both the relative position ("the i^th element in the array") and the absolute position in time and space ("2pm in South London"). But we can worry about that later :)

Yeah, I might add that in a follow up PR, but it should be fairly easy to add those as options. I think it's a good idea to try at least!

JackKelly · 2021-09-28T16:28:11Z

perceiver_pytorch/queries.py

+
+        Args:
+            channel_dim: Channel dimension for the output of the network
+            query_shape: The final shape of the query, generally, the (T, H, W) of the output


Sorry for being slow but what's the T in (T, H, W)? Time?

Yes, its the number of timesteps

JackKelly · 2021-09-28T16:29:50Z

perceiver_pytorch/queries.py

+        elif conv_layer == "2d":
+            conv = torch.nn.Conv2d
+        else:
+            raise ValueError(f"Value for 'layer' is {conv_layer} which is not one of '3d', '2d'")


Add basic LearnableQuery module

0e8855c

jacobbieker added the enhancement New feature or request label Sep 17, 2021

jacobbieker self-assigned this Sep 17, 2021

jacobbieker linked an issue Sep 17, 2021 that may be closed by this pull request

Add Learnable Query #16

Closed

jacobbieker added 13 commits September 17, 2021 12:18

Add TODO

4f6d648

Add pre-commit config

e2781cf

Add creating Fourier Features

323115d

Add test, conv layer encoding

f6ce7b4

Keep the latent space as the 4D tensor and then linearize at the end, the same as what the Perceiver models do

Fix ordering of channels

213f87c

Change prints to Debug statements

Add getting output shape

012ac31

Add test to ensure query works with PerceiverIO

a4fb689

Reduce image size for tests

d4329bc

Reduce depth and size more

72815bd

Reduce number of Fourier Features

1188bae

Rename check

ff42bf3

Reduce size

071914f

Reduce size

320ff14

jacobbieker marked this pull request as ready for review September 22, 2021 10:45

jacobbieker requested review from JackKelly and peterdudfield September 22, 2021 10:46