core: add a `no_collect: bool=False` param to RunnableLambda, enables per-chunk processing instead of whole collected input #21413

rhighs · 2024-05-08T08:14:57Z

This PR introduces a new param no_collect: bool to RunnableLambda.__init__. This flags allows for the _transform and _atransform methods to not collect input before processing but rather to process each individual chunk via self.func/self.afunc. This is particularly useful when we want to keep the input stream flowing as it gets transformed. One specific issue this implementation has solved on my part is being able to have a streamed conversation chain whilst being able to save the final chain output in a context memory. This is a problem some other ppl have encountered in this issue: #11945

Example usage:

def generate_values(input: Iterator[Any]) -> Iterator[int]:
    yield 1
    yield 2
    yield 3
    yield 4

def square(input: Input) -> int:
    n = cast(int, input)
    return n * n

expected_values = [1, 4, 9, 16]
runnable = RunnableGenerator(generate_values) | RunnableLambda(
   square, no_collect=True
)
for i, value in enumerate(runnable.stream({"input": "some_input"})):
    assert value == expected_values[i]

twitter: @rhighs_

vercel · 2024-05-08T08:15:02Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

1 Ignored Deployment

Name	Status	Preview	Comments	Updated (UTC)
langchain	⬜️ Ignored (Inspect)	Visit Preview		May 8, 2024 8:15am

ccurme · 2024-05-08T15:35:56Z

cc @nfcampos

core: add a no_collect param to RunnableLambda

828c776

dosubot bot added the size:L This PR changes 100-499 lines, ignoring generated files. label May 8, 2024

dosubot bot added the 🤖:improvement Medium size change to existing code to handle new use-cases label May 8, 2024

hwchase17 assigned nfcampos May 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

core: add a `no_collect: bool=False` param to RunnableLambda, enables per-chunk processing instead of whole collected input #21413

core: add a `no_collect: bool=False` param to RunnableLambda, enables per-chunk processing instead of whole collected input #21413

rhighs commented May 8, 2024

vercel bot commented May 8, 2024 •

edited

ccurme commented May 8, 2024

core: add a no_collect: bool=False param to RunnableLambda, enables per-chunk processing instead of whole collected input #21413

Are you sure you want to change the base?

core: add a no_collect: bool=False param to RunnableLambda, enables per-chunk processing instead of whole collected input #21413

Conversation

rhighs commented May 8, 2024

vercel bot commented May 8, 2024 • edited

ccurme commented May 8, 2024

core: add a `no_collect: bool=False` param to RunnableLambda, enables per-chunk processing instead of whole collected input #21413

core: add a `no_collect: bool=False` param to RunnableLambda, enables per-chunk processing instead of whole collected input #21413

vercel bot commented May 8, 2024 •

edited