Pinned
Repositories
Showing 10 of 15 repositories
-
- LLMLingua Public Forked from microsoft/LLMLingua
To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
-
-
- bo11-serve Public
- bee11-serve Public
- bee11-dev-docker Public
-
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…