feat: add LFM2.5-VL-450M quantized model#1066
Conversation
|
|
||
| // LFM2.5-VL-1.6B | ||
| const LFM2_VL_1_6B_QUANTIZED_MODEL = `${URL_PREFIX}-lfm-2.5/${VERSION_TAG}/lfm2.5-VL-1.6B/quantized/lfm2_5_vl_1_6b_8da4w_xnnpack.pte`; | ||
| const LFM2_VL_450M_QUANTIZED_MODEL = `${URL_PREFIX}-lfm2.5-VL-1.6B/${VERSION_TAG}/quantized/lfm2_5_vl_450m_8da4w_xnnpack.pte`; |
There was a problem hiding this comment.
lets change it before we merge
There was a problem hiding this comment.
|
Please also add all analogical changes to this PR: https://github.com/software-mansion/react-native-executorch/pull/1042/changes |
We should automate it somehow, it's quite annoying :/ |
Agreed, I'm open to any suggestions ;) |
msluszniak
left a comment
There was a problem hiding this comment.
We can also add model list as in different demo apps to multimodal LMs, but this can be done in a separate PR
## Description Adds the `LFM2_VL_450M_QUANTIZED` model constant for the new 450M quantized LFM2.5-VL variant. Reuses the existing LFM2.5-VL tokenizer and tokenizer config, and registers the model in `MODEL_REGISTRY`. ### Introduces a breaking change? - [ ] Yes - [x] No ### Type of change - [ ] Bug fix (change which fixes an issue) - [x] New feature (change which adds functionality) - [ ] Documentation update (improves or adds clarity to existing documentation) - [ ] Other (chores, tests, code style improvements etc.) ### Tested on - [ ] iOS - [ ] Android ### Testing instructions 1. Import `LFM2_VL_450M_QUANTIZED` from `react-native-executorch`. 2. Pass it to `useLLM` and verify the model downloads and runs inference with image input. ### Screenshots ### Related issues ### Checklist - [x] I have performed a self-review of my code - [ ] I have commented my code, particularly in hard-to-understand areas - [ ] I have updated the documentation accordingly - [x] My changes generate no new warnings ### Additional notes Tokenizer and tokenizer config are shared with `LFM2_VL_1_6B_QUANTIZED`.
Description
Adds the
LFM2_VL_450M_QUANTIZEDmodel constant for the new 450M quantized LFM2.5-VL variant. Reuses the existing LFM2.5-VL tokenizer and tokenizer config, and registers the model inMODEL_REGISTRY.Introduces a breaking change?
Type of change
Tested on
Testing instructions
LFM2_VL_450M_QUANTIZEDfromreact-native-executorch.useLLMand verify the model downloads and runs inference with image input.Screenshots
Related issues
Checklist
Additional notes
Tokenizer and tokenizer config are shared with
LFM2_VL_1_6B_QUANTIZED.