Spectral Positional Encoding #45

BrianPugh · 2021-11-10T04:43:36Z

I see in your FourierUnit you added an optional spectral_pos_encoding argument. Have you experimented at all with this? Has it improved/reduced performance?

The text was updated successfully, but these errors were encountered:

windj007 · 2021-11-10T12:41:30Z

Wow! Great that you've noticed that :)

We experimented with positional encoding in spectral domain just a little bit. It did not help for the inpainting on our benchmarks - but might work in other cases. But we did not explore that feature thoroughly enough to say something for sure.

I'll be happy to hear back if this feature helps :)

BrianPugh · 2021-11-10T16:35:45Z

Interesting, I would expect positional encoding (possibly a different encoding than a simple linear mesh) would have helped.

So, this suggests a few possible outcomes (1x1 filter/conv here will always refer to the conv in the frequency domain inside the Spectral Transform block):

The 1x1 filter doesn't take frequency into account/is frequency agnostic (this also applies to the original FFC paper).
Some sort of spatial information is latently encoded in the featuremaps. In this case, the 1x1 convolution takes frequency into account, but the positional encoding is redundant.
The frequency/phase domain isn't really important; each pixel in the spectral image is to just be interpreted as a different hash of the entire input featuremap.
The 1x1 filter doesn't actually do anything. Perhaps the real power just comes from applying BN and relu on the spectral image before applying the iFFT. Or, perhaps its inappropriate to perform the BN/relu since it limits what the post-iFFT transform image looks like.

Marcelo5444 · 2022-03-31T12:37:27Z

Hi @BrianPugh, Have you done any further research on that?

BrianPugh · 2022-03-31T15:44:49Z

i have not had a chance/the resources to perform experiments with these changes.

Mengmengbai · 2022-11-01T05:30:37Z

Interesting, I would expect positional encoding (possibly a different encoding than a simple linear mesh) would have helped.

So, this suggests a few possible outcomes (1x1 filter/conv here will always refer to the conv in the frequency domain inside the Spectral Transform block):

The 1x1 filter doesn't take frequency into account/is frequency agnostic (this also applies to the original FFC paper).

Some sort of spatial information is latently encoded in the featuremaps. In this case, the 1x1 convolution takes frequency into account, but the positional encoding is redundant.

The frequency/phase domain isn't really important; each pixel in the spectral image is to just be interpreted as a different hash of the entire input featuremap.

The 1x1 filter doesn't actually do anything. Perhaps the real power just comes from applying BN and relu on the spectral image before applying the iFFT. Or, perhaps its inappropriate to perform the BN/relu since it limits what the post-iFFT transform image looks like.

Great idea！I agree with you. Maybe I can do some experiments.

Ellohiye · 2022-11-12T07:46:33Z

Interesting, I would expect positional encoding (possibly a different encoding than a simple linear mesh) would have helped.
So, this suggests a few possible outcomes (1x1 filter/conv here will always refer to the conv in the frequency domain inside the Spectral Transform block):

The 1x1 filter doesn't take frequency into account/is frequency agnostic (this also applies to the original FFC paper).

Some sort of spatial information is latently encoded in the featuremaps. In this case, the 1x1 convolution takes frequency into account, but the positional encoding is redundant.

The frequency/phase domain isn't really important; each pixel in the spectral image is to just be interpreted as a different hash of the entire input featuremap.

The 1x1 filter doesn't actually do anything. Perhaps the real power just comes from applying BN and relu on the spectral image before applying the iFFT. Or, perhaps its inappropriate to perform the BN/relu since it limits what the post-iFFT transform image looks like.

Great idea！I agree with you. Maybe I can do some experiments.

Hi, have you done an experiment? What was the result?

senya-ashukha closed this as completed Nov 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Spectral Positional Encoding #45

Spectral Positional Encoding #45

BrianPugh commented Nov 10, 2021

windj007 commented Nov 10, 2021

BrianPugh commented Nov 10, 2021

Marcelo5444 commented Mar 31, 2022

BrianPugh commented Mar 31, 2022

Mengmengbai commented Nov 1, 2022

Ellohiye commented Nov 12, 2022

Spectral Positional Encoding #45

Spectral Positional Encoding #45

Comments

BrianPugh commented Nov 10, 2021

windj007 commented Nov 10, 2021

BrianPugh commented Nov 10, 2021

Marcelo5444 commented Mar 31, 2022

BrianPugh commented Mar 31, 2022

Mengmengbai commented Nov 1, 2022

Ellohiye commented Nov 12, 2022