Relative positonal encoding for continuous positions #161

cswinter · 2022-01-25T02:50:06Z

The current implementation of relative positional encoding (#139) only supports discrete positions (technically, you can give already it continuous positions, which then get truncated by tensor.long). The most basic version of relative positional encoding for vector spaces would apply a function that discretizes all positions, e.g. by dividing by a specified width and then rounding to the nearest integer. There are various extensions to this that might improve performance:

Discretize positions in a way that has a higher resolution for nearby positions and lower resolution for distant positions (only skimmed this very briefly, but https://arxiv.org/pdf/2107.14222.pdf seems to be doing something similar). The idea is that you might care a lot more about small changes in position for nearby entities, but only care about the rough location for more distant entities. Lots of different possibilities for the discretization function.
When an entity position doesn't fall exactly onto one of the discrete locations, interpolate between nearby positions.

This could be prototyped with the Minefield task (#50), on which relative positional encoding should achieve the same performance as translation.

The text was updated successfully, but these errors were encountered:

cswinter · 2022-01-25T02:57:18Z

One alternative to discretizing positions to a square grid would be to go with a radial grid.

cswinter · 2022-04-09T16:08:47Z

Interpolation implemented in #203 and appears to work quite well.

cswinter added the research label Jan 25, 2022

cswinter mentioned this issue Mar 14, 2022

Initial release #192

Closed

16 tasks

cswinter closed this as completed Apr 9, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Relative positonal encoding for continuous positions #161

Relative positonal encoding for continuous positions #161

cswinter commented Jan 25, 2022

cswinter commented Jan 25, 2022

cswinter commented Apr 9, 2022

Relative positonal encoding for continuous positions #161

Relative positonal encoding for continuous positions #161

Comments

cswinter commented Jan 25, 2022

cswinter commented Jan 25, 2022

cswinter commented Apr 9, 2022