You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In the code for many of the decoder models, you have self.agvpool = nn.AdaptiveAvgPool2d((1,1)) (for example, in the builder/models/detector_models/resnet_dilation_lstm.py file at Line 119), which, if I understand it correctly, averages out the output from the CNN module at channel and time dimension so that the output is 1-by-1 at those two dimensions. I can understand that for the channel dimension, but not for the time-series dimension. As this output will then be sent to the LSTM module, the whole point of which is to process time-series signals. If it is of length 1 at time dimension, then why does it need an LSTM? Am I missing anything?
The text was updated successfully, but these errors were encountered:
In the code for many of the decoder models, you have
self.agvpool = nn.AdaptiveAvgPool2d((1,1))
(for example, in thebuilder/models/detector_models/resnet_dilation_lstm.py
file at Line 119), which, if I understand it correctly, averages out the output from the CNN module at channel and time dimension so that the output is 1-by-1 at those two dimensions. I can understand that for the channel dimension, but not for the time-series dimension. As this output will then be sent to the LSTM module, the whole point of which is to process time-series signals. If it is of length 1 at time dimension, then why does it need an LSTM? Am I missing anything?The text was updated successfully, but these errors were encountered: