Enhancing quality - Recovery settings

As mentioned in the paper, key concepts might get omitted either corrupted by the compression, in a way that the GPT can't process the compressed prompt.

You mention also there is an approach to optimize around this issue; could you share details on the corresponding configuration options in the Python implementation?

In the attached image, I've tested the GPT confidence degradation according to compression effects on the _qasper_e_ subset of the LongBench benchmark.

![fig_scatter_plots_pcompr_confidence](https://github.com/microsoft/LLMLingua/assets/1834666/c7add3b3-daa8-4296-be0f-407bf0448886)


Wrong answers/no answer possible:

* Regular GPT-4:  %45.36 e.g. without prompt compression (GPT-4 seems to "give up" frequently on longer queries)
* Compressed prompt by LLM Lingua, target_token=200: 63.93%
* Compressed prompt by LLM Lingua, target_token=400: 60.66%


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enhancing quality - Recovery settings #89

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Enhancing quality - Recovery settings #89

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions