RWKV, in easy to read code
Just some commented code you can read to get an understanding of how the newer versions of RWKV work.
Disclaimer: This code was designed for expository purposes, not for training or inference. It has not been thoroughly tested, may contain mistakes, and purposely trades optimality for readability.