Support for Phi-3 models #58

retteghy · 2024-04-23T16:41:02Z

see huggingface for the models

guinmoon · 2024-04-24T04:06:33Z

Hi. work normal with this template


<|user|>
{{prompt}}<|end|>
<|assistant|>

And BOS option enabled.

paulilioaica · 2024-04-24T08:26:08Z

Hi. How can I make it generate until EOS? If I select the option, the app crashes.

retteghy · 2024-04-24T14:54:23Z

Hi. work normal with this template
<|user|>
{{prompt}}<|end|>
<|assistant|>
And BOS option enabled.

BOS is enabled, I have set that prompt, but I am getting an error as reply for every message:
Load Model Error: [Error]
modelLoad Error
Load Model Error: [Done]

jekriske-lilly · 2024-04-24T15:10:22Z

@guinmoon when you say "works normal" are you referring to the development version or the version in the App store?

The stable version from the app store isn't honoring the end token and the app crashes if you try enabling EOS.

guinmoon · 2024-04-24T15:13:45Z

development version

Cimplex · 2024-04-24T18:18:26Z

Hi. work normal with this template
<|user|>
{{prompt}}<|end|>
<|assistant|>
And BOS option enabled.
BOS is enabled, I have set that prompt, but I am getting an error as reply for every message: Load Model Error: [Error] modelLoad Error Load Model Error: [Done]

In the TestFlight version I’m using ‘Phi-3-mini-4k-instruct-q4.gguf’

When setting up, I used the “Phi 2” setting template and then wrote the recommended prompt. On my iPhone 14 Pro I’m getting around 2-5 token per second.

Sometimes the <|end|> tag isn’t handle correctly, and it just skips over it and starts a new answer

savkinavmono · 2024-04-25T10:34:17Z

Make sure Metal=on, BOS=on, EOS=off. And try setting contextsize=1024. I got 8-9 Tok/sec.

Officially phi3 is only supported starting with llama.cpp release b2717. The latest LLMFarm commit uses b2692. The Testflight version uses b2135 which officially supports only phi2.

guinmoon closed this as completed May 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for Phi-3 models #58

Support for Phi-3 models #58

retteghy commented Apr 23, 2024

guinmoon commented Apr 24, 2024

paulilioaica commented Apr 24, 2024

retteghy commented Apr 24, 2024

jekriske-lilly commented Apr 24, 2024

guinmoon commented Apr 24, 2024

Cimplex commented Apr 24, 2024

savkinavmono commented Apr 25, 2024 •

edited

Loading

Support for Phi-3 models #58

Support for Phi-3 models #58

Comments

retteghy commented Apr 23, 2024

guinmoon commented Apr 24, 2024

paulilioaica commented Apr 24, 2024

retteghy commented Apr 24, 2024

jekriske-lilly commented Apr 24, 2024

guinmoon commented Apr 24, 2024

Cimplex commented Apr 24, 2024

savkinavmono commented Apr 25, 2024 • edited Loading

savkinavmono commented Apr 25, 2024 •

edited

Loading