caveman_tokenstransfer.2.0 — a fork that adds an input-side peer (100% local LLMLingua-2) — courtesy ping #454
vfalbor
started this conversation in
Show and tell
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hey @JuliusBrussee and team — first, thanks for caveman. The reframing from "be concise" to a dialect was a real insight and the numbers are clean.
After watching the discourse split into "input compression vs output compression," I built and just released caveman_tokenstransfer.2.0 — a fork that keeps caveman exactly as it is and adds two peer skills on the input side:
/tokenstransfer— LLMLingua-2 input compression, 100% local by default (pip install llmlingua torch tiktoken), no API key needed. Same fully-local philosophy caveman has./tokenstranslation— multilingual prompt → English (cheap-tokens language) for non-English UX./caveman-fullstack— runs/cavemanoutput dialect +/tokenstransferinput compression in one shot.It's MIT, it's a real fork (linked to this repo), it ships your installer rebranded (all 30+ agents work). Your output skill is unchanged — credit goes to you.
why a fork instead of a PR
The new pieces add a Python dependency (
llmlingua,torch) and a hosted-fallback service. Both feel like things you'd want to keep out of upstream caveman's "zero deps, pure prompts" surface. So I shipped it next door instead of trying to merge it in.benchmark on your own 10 prompts
For your workload (codegen short-Q long-A) caveman alone is enough. For long-context apps the input side is where the bill is. The two compose cleanly.
Independent input-side head-to-head vs
caveman-compressskill (your rule-based version) on your own benchmark: -52.9% LLMLingua-2 vs -8.6% rules. Reproduction repo: vfalbor/llm-language-token-tax · vs-caveman.what I want from you
caveman-transfer-liteor similar that wrapsllmlinguaas a soft dep inside upstream, happy to PR.Not asking for stars — caveman is already at 64k and crushing. Just doing the polite thing of telling you next door.
Why use many token when few token do trick. 🪨🌳
Beta Was this translation helpful? Give feedback.
All reactions