add support for OpenELM #63

smdesai · 2024-04-29T21:23:15Z

@awni Here's the PR

MatthewWaller · 2024-04-29T21:41:24Z

Libraries/LLM/Models.swift

+    public static let openelm270m4bit = ModelConfiguration(
+        id: "mlx-community/OpenELM-270M-Instruct"
+    ) { prompt in
+        "\(prompt)"


First off, phenomenal work! I tested it, and it seems to be doing completion just fine. Do you know how we should format it for instruction? Is it like the phi model at all?, or the other ones like the model above? I didn't see any special tokens for the instruct version thus far.

Thanks very much Matthew. I've no idea what the chat template format should be and perhaps someone from Apple can comment.

Ah, looks like I found something helpful here, just not sure how to translate to the code https://github.com/apple/corenet/blob/main/projects/openelm/instruction_tuning/openelm-instruct.yaml

Oh that looks useful, let me grab the relevant content from the chat template and try it out. Thanks for looking into it.

@MatthewWaller I tried the following template without luck.
"<|system|>\nYou are a helpful assistant<|end|>\n<|user|>(prompt)<|end|>\n<|assistant|>"

Using the default template for the Llama tokenizer, <s>[INST]\(prompt)[/INST] seems to work but I'll leave it to someone who knows better.

Yeah, overall this is working well for completion. I thing for chat there are some things we can do after this PR: there is a bug in swift transformers that just got fixed in main pertaining to an encoding special tokens. AND the config isn’t setup to recognize tokens like “<|user|>” and such, so we would need to adjust that too eventually.

awni · 2024-04-30T15:19:10Z

@smdesai could you run the swift formatting?

pre-commit run --all-files

smdesai · 2024-04-30T15:55:13Z

@awni It's run.

davidkoski

Thank you for the contribution!!

add support for OpenELM

aa27199

MatthewWaller reviewed Apr 29, 2024

View reviewed changes

register model configuration for bootstrap

c199455

davidkoski mentioned this pull request Apr 30, 2024

OpenELM example? #55

Closed

ran swift formatting

595ed96

davidkoski approved these changes Apr 30, 2024

View reviewed changes

davidkoski merged commit 4d20785 into ml-explore:main Apr 30, 2024
3 checks passed

mobile-appz mentioned this pull request May 18, 2024

LLMEval not loading Qwen1.5 -0.5B model in to memory #53

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add support for OpenELM #63

add support for OpenELM #63

smdesai commented Apr 29, 2024

MatthewWaller Apr 29, 2024

smdesai Apr 29, 2024

MatthewWaller Apr 29, 2024

smdesai Apr 29, 2024

smdesai Apr 29, 2024

smdesai Apr 29, 2024 •

edited

MatthewWaller Apr 30, 2024

awni commented Apr 30, 2024 •

edited

smdesai commented Apr 30, 2024

davidkoski left a comment

add support for OpenELM #63

add support for OpenELM #63

Conversation

smdesai commented Apr 29, 2024

MatthewWaller Apr 29, 2024

Choose a reason for hiding this comment

smdesai Apr 29, 2024

Choose a reason for hiding this comment

MatthewWaller Apr 29, 2024

Choose a reason for hiding this comment

smdesai Apr 29, 2024

Choose a reason for hiding this comment

smdesai Apr 29, 2024

Choose a reason for hiding this comment

smdesai Apr 29, 2024 • edited

Choose a reason for hiding this comment

MatthewWaller Apr 30, 2024

Choose a reason for hiding this comment

awni commented Apr 30, 2024 • edited

smdesai commented Apr 30, 2024

davidkoski left a comment

Choose a reason for hiding this comment

smdesai Apr 29, 2024 •

edited

awni commented Apr 30, 2024 •

edited