Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Messages appear to not be correctly sent to LLM for llama 2 and Mistral/Mixtral + lmstudio / llama.cpp #865

Open
3 tasks done
TyDunn opened this issue Feb 16, 2024 · 1 comment
Assignees
Labels
bug Something isn't working

Comments

@TyDunn
Copy link
Contributor

TyDunn commented Feb 16, 2024

Before submitting your bug report

Relevant environment info

- IDE: VS Code

Description

from this r/LocalLlama thread:

I tried continue on vscode. The problem was that the messages were not sent correctly to the LLM. Other than that, it had some features that copilot didn't. But I still go back to gpt-4 if I can.

thanks! i tried llama and Mistral/Mixtra models. i tried lmstudio and llama.cpp as backends. another problem was that the models would keep writing infinite spaces or letters in comments. this happened only in Continue. I tried adding my own system prompts but that didn't solve it. seems like Continue formats the prompts in a way that doesn't match prompt templates or tokenizers

Yea, in fact, Continue includes 4 demo AIs, and CodeLlama 70B is one of them. It gives good and consistent results. Just not my local model running in llama.cpp. I wonder if Continue has some custom prompt setup for the demo AIs that's hidden from the user config.

To reproduce

No response

Log output

No response

@TyDunn TyDunn added the bug Something isn't working label Feb 16, 2024
@sestinj sestinj self-assigned this Feb 19, 2024
@remixer-dec
Copy link

I faced a similar issue, fixed it by switching provider from llama.cpp to openai, and the problem was that llama.cpp uses completions instead of chat completions and there is no correct format for llama3 yet to handle completions correctly.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

3 participants