feat: port LLMs to C++ by chmjkb · Pull Request #415 · software-mansion/react-native-executorch

chmjkb · 2025-06-24T13:35:30Z

Description

This PR ports the current LLM functionality to C++, getting rid of the Runner within the ExecutorchLib framework. I've also made some changes to the build system.

tokenizers-cpp

Previously the library was linked in ExecutorchLib in XCode via a build script, now it is completely removed from the frameworks.
I've prebuiilt static libraries from tokenizers-cpp repo, which I uploaded to common/ios/libs/tokenizers-cpp, similarly to the pre-build ExecuTorch binaries.
The includes for tokenizers-cpp are now in react-native-executorch/third-party/include/tokenizers-cpp/tokenizers_cpp.h
Made some changes to the libs directory structure, please see the podspec for reference.
These headers are then included from the llama runner source code.
Since tokenizers-cpp for Android is pre-built with the ExecuTorch aar, I'm not making any changes here. This will need to be updated once we bump the ExecuTorch runtime and when we can safely get rid of the aar/jitpack setup. We can keep tokenizers-cpp as our submodule and then just reference it in Android's CMake.

runner

The runner source along with headers was moved from ExecuTorchLib to common/runner, similarly to ada and is compiled on the fly when our library compiles.
In the current situation, I think that we will be soon able to get rid of the ET fork and the submodule.

Type of change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Documentation update (improves or adds clarity to existing documentation)

Tested on

iOS
Android

Testing instructions

Screenshots

Related issues

Checklist

I have performed a self-review of my code
I have commented my code, particularly in hard-to-understand areas
I have updated the documentation accordingly
My changes generate no new warnings

Additional notes

…yet in use

…ller

mkopcins · 2025-06-30T09:12:50Z

 * SOFTWARE.
 */

+// #include <executorch/extension/llm/sampler/sampler.h>


I'm not exactly sure why that line was here in the first place. Below you can see a line:

#include "sampler.h"

Which literally includes the same thing. Also, we can't access extension/llm/sampler/sampler.h since there is no such header in our code, so it wouldn't compile.

## Description This PR ports the current LLM functionality to C++, getting rid of the `Runner` within the `ExecutorchLib` framework. I've also made some changes to the build system. `tokenizers-cpp` - Previously the library was linked in `ExecutorchLib` in XCode via a build script, now it is completely removed from the frameworks. - I've prebuiilt static libraries from `tokenizers-cpp` repo, which I uploaded to `common/ios/libs/tokenizers-cpp`, similarly to the pre-build ExecuTorch binaries. - The includes for tokenizers-cpp are now in `react-native-executorch/third-party/include/tokenizers-cpp/tokenizers_cpp.h` - Made some changes to the `libs` directory structure, please see the podspec for reference. - These headers are then included from the llama runner source code. - Since `tokenizers-cpp` for Android is pre-built with the `ExecuTorch` aar, I'm not making any changes here. This will need to be updated once we bump the ExecuTorch runtime and when we can safely get rid of the aar/jitpack setup. We can keep tokenizers-cpp as our submodule and then just reference it in Android's CMake. `runner` - The runner source along with headers was moved from `ExecuTorchLib` to `common/runner`, similarly to `ada` and is compiled on the fly when our library compiles. - In the current situation, I think that we will be soon able to get rid of the ET fork and the submodule. ### Type of change - [ ] Bug fix (non-breaking change which fixes an issue) - [x] New feature (non-breaking change which adds functionality) - [ ] Breaking change (fix or feature that would cause existing functionality to not work as expected) - [ ] Documentation update (improves or adds clarity to existing documentation) ### Tested on - [x] iOS - [x] Android ### Testing instructions  ### Screenshots  ### Related issues  ### Checklist - [ ] I have performed a self-review of my code - [ ] I have commented my code, particularly in hard-to-understand areas - [ ] I have updated the documentation accordingly - [ ] My changes generate no new warnings ### Additional notes

chmjkb changed the base branch from main to @chmjkb/text-embeddings-cpp-port June 24, 2025 13:35

chmjkb force-pushed the @chmjkb/text-embeddings-cpp-port branch from 0200048 to 2cfb189 Compare June 25, 2025 08:15

Base automatically changed from @chmjkb/text-embeddings-cpp-port to main June 25, 2025 10:43

chmjkb added 2 commits June 25, 2025 12:55

chore: add runner to includes

e299c3f

wip: add an example header

45938f3

chmjkb force-pushed the @chmjkb/llm-cpp-port branch from 46d597e to 45938f3 Compare June 25, 2025 10:55

chmjkb added 24 commits June 26, 2025 10:59

feat: support void functions in ModelHostObject

5cda73f

feat: adjust installer to work with llms

0383fcc

feat: add JSI conversion for js callbacks

6a217bf

feat: add llm to modelhostobject

d985c25

feat: add LLM runner

189440a

feat: adjust controller to match the new native impl

6e6703d

fix: check if native llm is installed

6fdd91b

remove a bunch of code 💅🏻

bc83f01

remove a bunch of code 💅🏻

158265f

chore: move runner to common/

b00c5f0

chore: update runner.h

6fdf271

chore: update xcframework

9264242

chore: get rid of tokenizers_c.h

3740b5b

chore: update executorchlib xcodeproj

2c5bd57

chore: rename runner.{h,cpp} to LLM.{h,cpp}

23d61ff

fix: define moduleInfos

ec40ff8

fix: fix includes after renaming runner

4f2810e

chore: move executorch ios libs, add tokenizers-cpp static libs

64785cc

fix: update podspec to match the new ios libs structure

c3a7d17

wip: android cmake

fcda895

chore: unify memory lower bound member naming with BaseModel.cpp

50b19cc

chore: remove Android static-libs for tokenizers-cpp as they are not …

d80e855

…yet in use

chore: remove --force_load flag from tokenizers-cpp static libs

5117e65

fix: Ensure corectness of podspec libs path, add typing to llm contro…

c3b1a84

…ller

chore: remove outdated comment

afb1912

chmjkb self-assigned this Jun 27, 2025

chmjkb added this to the v0.5.0 milestone Jun 27, 2025

chmjkb added the enhancement label Jun 27, 2025

chmjkb linked an issue Jun 27, 2025 that may be closed by this pull request

Port LLMs to C++ native code #262

Closed

chmjkb marked this pull request as ready for review June 27, 2025 10:29

chmjkb requested a review from mkopcins June 27, 2025 10:29

chmjkb added 3 commits June 27, 2025 15:38

feat: add generic synchronous host function wraper

2cf6c6a

chore: remove accidental export of getMemoryLowerBound

2acd171

fix: handle void returns in synchronous host function wrapper

ffe6387