Skip to content
#

rwkv

Here are 20 public repositories matching this topic...

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

  • Updated Oct 31, 2024
  • Python

Improve this page

Add a description, image, and links to the rwkv topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the rwkv topic, visit your repo's landing page and select "manage topics."

Learn more