Visualizing memorization in RNNs
Visualizing the gradient magnitudes in context, can be a powerful tool to see when recurrent units use short-term or long-term memorization.
- Train the models with
make train. This will take a very long time :)
- Build the data for the article with
- Run the article server with
Alternatively, if you just want to render and show the article: