feature: llm output tokens batching #628

mkopcins · 2025-09-26T15:34:51Z

Description

Added batching feature to llms so that onTokenCallback is not triggered on each token, but after every batch to reduce number of rerenders.

Introduces a breaking change?

Yes
No

Type of change

Bug fix (change which fixes an issue)
New feature (change which adds functionality)
Documentation update (improves or adds clarity to existing documentation)
Other (chores, tests, code style improvements etc.)

Tested on

iOS
Android

Testing instructions

Screenshots

Related issues

Checklist

I have performed a self-review of my code
I have commented my code, particularly in hard-to-understand areas
I have updated the documentation accordingly
My changes generate no new warnings

Additional notes

docs/docs/02-hooks/01-natural-language-processing/useLLM.md

docs/docs/03-typescript-api/01-natural-language-processing/LLMModule.md

packages/react-native-executorch/common/rnexecutorch/models/llm/LLM.cpp

packages/react-native-executorch/common/runner/text_token_generator.h

## Description Added batching feature to llms so that `onTokenCallback` is not triggered on each token, but after every batch to reduce number of rerenders. ### Introduces a breaking change? - [ ] Yes - [x] No ### Type of change - [ ] Bug fix (change which fixes an issue) - [x] New feature (change which adds functionality) - [x] Documentation update (improves or adds clarity to existing documentation) - [ ] Other (chores, tests, code style improvements etc.) ### Tested on - [x] iOS - [x] Android ### Testing instructions  ### Screenshots  ### Related issues  ### Checklist - [x] I have performed a self-review of my code - [x] I have commented my code, particularly in hard-to-understand areas - [x] I have updated the documentation accordingly - [x] My changes generate no new warnings ### Additional notes  --------- Co-authored-by: Mateusz Kopciński <mateusz.kopcinski@swmansnion.com>

mkopcins requested a review from chmjkb September 26, 2025 15:34

Mateusz Kopciński added 5 commits September 26, 2025 17:44

initial draft of token batching

8afb4ee

reused runner.stats for token data

5b015e9

added token baatching to llms

5736da8

small refactor, added docs

26c512e

fixed bug where first token was emitted before batch

cd47f96

mkopcins force-pushed the @mkopcins/token-batching branch from 9241815 to cd47f96 Compare September 26, 2025 15:44

mkopcins marked this pull request as ready for review September 26, 2025 15:47

NorbertKlockiewicz requested changes Oct 1, 2025

View reviewed changes

review changes

85f9328

msluszniak reviewed Oct 1, 2025

View reviewed changes

packages/react-native-executorch/common/runner/text_token_generator.h Outdated Show resolved Hide resolved

packages/react-native-executorch/common/runner/text_token_generator.h Outdated Show resolved Hide resolved

review changes

a60bb33

NorbertKlockiewicz approved these changes Oct 1, 2025

View reviewed changes

mkopcins merged commit f80168a into main Oct 1, 2025
3 checks passed

mkopcins deleted the @mkopcins/token-batching branch October 1, 2025 12:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feature: llm output tokens batching #628

feature: llm output tokens batching #628

Uh oh!

mkopcins commented Sep 26, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

feature: llm output tokens batching #628

feature: llm output tokens batching #628

Uh oh!

Conversation

mkopcins commented Sep 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Introduces a breaking change?

Type of change

Tested on

Testing instructions

Screenshots

Related issues

Checklist

Additional notes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

mkopcins commented Sep 26, 2025 •

edited

Loading