feat: add rwkv support #158

mudler · 2023-05-02T22:56:58Z

This PR adds initial support to LocalAI for https://github.com/BlinkDL/RWKV-LM with https://github.com/saharNooby/rwkv.cpp and https://github.com/donomii/go-rwkv.cpp. (acknowledgement goes to @BlinkDL, @saharNooby and @donomii for their respective projects ❤️ , and of course @ggerganov for ggml ) . RWKV's models are extremely fast on CPU!

The Makefile points to my fork just waiting for donomii/go-rwkv.cpp#1 to be merged.

It supports fully the endpoint already exposed, and also token stream. It also updates the README with relevant projects involved.

In a glance you can drop the ggml-quantized model in the models folder with a tokenizer file next to it:

36464540 -rw-r--r--  1 mudler mudler 1.2G May  3 10:51 rwkv_small
36464543 -rw-r--r--  1 mudler mudler 2.4M May  3 10:51 rwkv_small.tokenizer.json

and start to use it with the API. It is strongly adviced to configure it with a yaml file, as models needs specific prompts.

Signed-off-by: mudler <mudler@mocaccino.org>

mudler · 2023-05-03T09:43:19Z

I'll have a round-up of PRs to update docs until we consolidate features on master, and then will cut a new release

mudler force-pushed the rwkv branch from 5d8ca06 to 10607b8 Compare May 2, 2023 23:00

feat: add rwkv support

e74f60b

Signed-off-by: mudler <mudler@mocaccino.org>

mudler force-pushed the rwkv branch from 10607b8 to e74f60b Compare May 3, 2023 07:17

mudler added 6 commits May 3, 2023 09:49

adapt tests

38d60f5

feat: support streams in rwkv

51d82e7

add(makefile): add go-rwkv to the clean target

83fc19f

makefile: reorder

2f07d62

fixup: make tokenizer suffix a constant

59c83dd

readme: update

1052e0f

mudler marked this pull request as ready for review May 3, 2023 09:17

mudler mentioned this pull request May 3, 2023

Add LocalAI in the Community projects README.md BlinkDL/RWKV-LM#102

Open

mudler merged commit 751b7ec into master May 3, 2023

mudler deleted the rwkv branch May 3, 2023 09:45

Provide feedback