0.1.0
follow spatiotemporal attention with a feedforward, and add the highl… …y effective token shift along the time axis in the hidden layer
follow spatiotemporal attention with a feedforward, and add the highl… …y effective token shift along the time axis in the hidden layer