Install & Run

(Long)LLMLingua: Enhancing Large Language Model Inference via Prompt Compression

Install & Run

# install
pip install .
# copy config and edit it.
cp config_template.yaml config.yaml
# run
python main.py

API

path：api/v1/compress_prompt
method：GET
params example:

{
    "ratio": 0.5,
    "context": [""],
    "question": ""
}

response example:

{
    "data": {
        "compressed_prompt": "",
        "origin_tokens": 3701,
        "compressed_tokens": 2778,
        "ratio": "1.3x",
        "saving": ", Saving $0.1 in GPT-4."
    }
}

Example

example_zh

Reference

https://llmlingua.com/

https://github.com/microsoft/LLMLingua

https://zhuanlan.zhihu.com/p/660805821

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
llmlingua_api		llmlingua_api
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config_template.yaml		config_template.yaml
example_zh.md		example_zh.md
main.py		main.py
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

(Long)LLMLingua: Enhancing Large Language Model Inference via Prompt Compression

Install & Run

API

Example

Reference

About

Releases

Packages

Languages

License

waifuoid/llmlingua-api

Folders and files

Latest commit

History

Repository files navigation

(Long)LLMLingua: Enhancing Large Language Model Inference via Prompt Compression

Install & Run

API

Example

Reference

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages