Skip to content

feat: add LFM2.5-VL-450M quantized model#1066

Merged
NorbertKlockiewicz merged 3 commits intomainfrom
@nk/add-lfm2.5-vl-450m
Apr 9, 2026
Merged

feat: add LFM2.5-VL-450M quantized model#1066
NorbertKlockiewicz merged 3 commits intomainfrom
@nk/add-lfm2.5-vl-450m

Conversation

@NorbertKlockiewicz
Copy link
Copy Markdown
Contributor

@NorbertKlockiewicz NorbertKlockiewicz commented Apr 9, 2026

Description

Adds the LFM2_VL_450M_QUANTIZED model constant for the new 450M quantized LFM2.5-VL variant. Reuses the existing LFM2.5-VL tokenizer and tokenizer config, and registers the model in MODEL_REGISTRY.

Introduces a breaking change?

  • Yes
  • No

Type of change

  • Bug fix (change which fixes an issue)
  • New feature (change which adds functionality)
  • Documentation update (improves or adds clarity to existing documentation)
  • Other (chores, tests, code style improvements etc.)

Tested on

  • iOS
  • Android

Testing instructions

  1. Import LFM2_VL_450M_QUANTIZED from react-native-executorch.
  2. Pass it to useLLM and verify the model downloads and runs inference with image input.

Screenshots

Related issues

Checklist

  • I have performed a self-review of my code
  • I have commented my code, particularly in hard-to-understand areas
  • I have updated the documentation accordingly
  • My changes generate no new warnings

Additional notes

Tokenizer and tokenizer config are shared with LFM2_VL_1_6B_QUANTIZED.


// LFM2.5-VL-1.6B
const LFM2_VL_1_6B_QUANTIZED_MODEL = `${URL_PREFIX}-lfm-2.5/${VERSION_TAG}/lfm2.5-VL-1.6B/quantized/lfm2_5_vl_1_6b_8da4w_xnnpack.pte`;
const LFM2_VL_450M_QUANTIZED_MODEL = `${URL_PREFIX}-lfm2.5-VL-1.6B/${VERSION_TAG}/quantized/lfm2_5_vl_450m_8da4w_xnnpack.pte`;
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lets change it before we merge

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done

@msluszniak
Copy link
Copy Markdown
Member

Please also add all analogical changes to this PR: https://github.com/software-mansion/react-native-executorch/pull/1042/changes

Copy link
Copy Markdown
Member

@msluszniak msluszniak left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Reason above

@NorbertKlockiewicz
Copy link
Copy Markdown
Contributor Author

Please also add all analogical changes to this PR: https://github.com/software-mansion/react-native-executorch/pull/1042/changes

We should automate it somehow, it's quite annoying :/

@msluszniak
Copy link
Copy Markdown
Member

Please also add all analogical changes to this PR: https://github.com/software-mansion/react-native-executorch/pull/1042/changes

We should automate it somehow, it's quite annoying :/

Agreed, I'm open to any suggestions ;)

Copy link
Copy Markdown
Member

@msluszniak msluszniak left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We can also add model list as in different demo apps to multimodal LMs, but this can be done in a separate PR

@NorbertKlockiewicz NorbertKlockiewicz merged commit eeaa8a2 into main Apr 9, 2026
5 checks passed
@NorbertKlockiewicz NorbertKlockiewicz deleted the @nk/add-lfm2.5-vl-450m branch April 9, 2026 12:26
mkopcins pushed a commit that referenced this pull request Apr 9, 2026
## Description

Adds the `LFM2_VL_450M_QUANTIZED` model constant for the new 450M
quantized LFM2.5-VL variant. Reuses the existing LFM2.5-VL tokenizer and
tokenizer config, and registers the model in `MODEL_REGISTRY`.

### Introduces a breaking change?

- [ ] Yes
- [x] No

### Type of change

- [ ] Bug fix (change which fixes an issue)
- [x] New feature (change which adds functionality)
- [ ] Documentation update (improves or adds clarity to existing
documentation)
- [ ] Other (chores, tests, code style improvements etc.)

### Tested on

- [ ] iOS
- [ ] Android

### Testing instructions

1. Import `LFM2_VL_450M_QUANTIZED` from `react-native-executorch`.
2. Pass it to `useLLM` and verify the model downloads and runs inference
with image input.

### Screenshots

### Related issues

### Checklist

- [x] I have performed a self-review of my code
- [ ] I have commented my code, particularly in hard-to-understand areas
- [ ] I have updated the documentation accordingly
- [x] My changes generate no new warnings

### Additional notes

Tokenizer and tokenizer config are shared with `LFM2_VL_1_6B_QUANTIZED`.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants