-
|
Do we have a bench mark for minimized/summarize context vs not? and how did they do based on the responses? |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 4 replies
-
|
hoping for your response @yvgude |
Beta Was this translation helpful? Give feedback.
-
|
Hi @nylla8444, great follow-up — this is indeed an important question that deserves a thorough answer. Short AnswerNo measurable accuracy loss for typical coding tasks. Here's why: The Theory: Why Compression Doesn't Hurt Coding Accuracylean-ctx uses structural compression, not lossy summarization. What we remove is:
The key insight from the research (see the Lost in the Middle paper): LLMs actually perform worse with more irrelevant context. Excessive context causes attention dilution — the model spreads its attention across irrelevant tokens instead of focusing on what matters. What We Do Have:
|
Beta Was this translation helpful? Give feedback.
Hi @nylla8444, great follow-up — this is indeed an important question that deserves a thorough answer.
Short Answer
No measurable accuracy loss for typical coding tasks. Here's why:
The Theory: Why Compression Doesn't Hurt Coding Accuracy
lean-ctx uses structural compression, not lossy summarization. What we remove is:
import React from 'react'in 50 files is noise after the first occurrencemain.rsand it hasn't changed, a 13-token cache stub is enough