Text Compression Transform #2225

WaelKarkoub · 2024-03-31T19:19:57Z

Why are these changes needed?

This PR introduces text compression by leveraging the LLMLingua library. This addition enhances processing efficiency and response speed by reducing token usage in large language models.

NOTE: LLM lingua uses locally hosted models, so caching might be important here.

Future work:

Image Compression
Video Compression

Related issue number

Closes #2538

Checks

I've included any doc changes needed for https://microsoft.github.io/autogen/. See https://microsoft.github.io/autogen/docs/Contribute#documentation to build and test documentation locally.
I've added tests (if relevant) corresponding to the changes introduced in this PR.
I've made sure all auto checks have passed.

codecov-commenter · 2024-03-31T19:21:06Z

Codecov Report

Attention: Patch coverage is 25.96154% with 77 lines in your changes are missing coverage. Please review.

Project coverage is 45.11%. Comparing base (ded2d61) to head (ec6fe57).
Report is 35 commits behind head on main.

Files	Patch %	Lines
...togen/agentchat/contrib/capabilities/transforms.py	20.23%	67 Missing ⚠️
...agentchat/contrib/capabilities/text_compressors.py	50.00%	10 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##             main    #2225       +/-   ##
===========================================
+ Coverage   33.33%   45.11%   +11.77%     
===========================================
  Files          83       86        +3     
  Lines        8636     9108      +472     
  Branches     1835     2090      +255     
===========================================
+ Hits         2879     4109     +1230     
+ Misses       5516     4651      -865     
- Partials      241      348      +107

Flag	Coverage Δ
unittest	`12.61% <25.96%> (?)`
unittests	`44.36% <0.00%> (+11.03%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

* adds implementation * handles optional import * cleanup * updates github workflows * skip test if dependencies not installed * skip test if dependencies not installed * use cpu * skip openai * unskip openai * adds protocol * better docstr * minor fixes * updates optional dependencies docs * wip * update docstrings * wip * adds back llmlingua requirement * finalized protocol * improve docstr * guide complete * improve docstr * fix FAQ * added cache support * improve cache key * cache key fix + faq fix * improve docs * improve guide * args -> params * spelling

WaelKarkoub added 4 commits March 31, 2024 17:57

adds implementation

472d0f8

handles optional import

28fa3f5

cleanup

12f95f2

updates github workflows

50607eb

WaelKarkoub added enhancement New feature or request long context handling Compression to handle long context labels Mar 31, 2024

WaelKarkoub requested a review from gagb March 31, 2024 19:19

WaelKarkoub had a problem deploying to openai1 March 31, 2024 19:20 — with GitHub Actions Failure

WaelKarkoub requested a review from jackgerrits March 31, 2024 19:20

skip test if dependencies not installed

de3e655

WaelKarkoub had a problem deploying to openai1 March 31, 2024 19:24 — with GitHub Actions Failure

WaelKarkoub had a problem deploying to openai1 March 31, 2024 19:25 — with GitHub Actions Failure

WaelKarkoub had a problem deploying to openai1 May 5, 2024 18:25 — with GitHub Actions Failure

WaelKarkoub requested a review from ekzhu May 5, 2024 22:28

spelling

98cb736

WaelKarkoub had a problem deploying to openai1 May 6, 2024 00:30 — with GitHub Actions Failure

marklysze approved these changes May 6, 2024

View reviewed changes

sonichi added this pull request to the merge queue May 6, 2024

Merged via the queue into main with commit 372ac1e May 6, 2024
77 of 91 checks passed

sonichi deleted the llm-lingua-transform branch May 6, 2024 14:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Text Compression Transform #2225

Text Compression Transform #2225

WaelKarkoub commented Mar 31, 2024 •

edited

Loading

codecov-commenter commented Mar 31, 2024 •

edited

Loading

Text Compression Transform #2225

Text Compression Transform #2225

Conversation

WaelKarkoub commented Mar 31, 2024 • edited Loading

Why are these changes needed?

Related issue number

Checks

codecov-commenter commented Mar 31, 2024 • edited Loading

Codecov Report

WaelKarkoub commented Mar 31, 2024 •

edited

Loading

codecov-commenter commented Mar 31, 2024 •

edited

Loading