Skip to content

Conversation

programmylife
Copy link
Contributor

@programmylife programmylife commented Sep 15, 2023

I've updated llm_llama_cpp.py to accept .gguf files instead of .bin files as they are the new standard (per discussion in #10). I updated the README to reflect that change and use .gguf models in place of the .bin files and added an example for the new python model, which may be of particular interest to users.

I removed the instructions for installing via the wheel since that will use the old code which would reject downloads with .gguf. I thought this was a better solution than explaining the difference and potentially having users have an out of date version which won't be able to use new models. I'm happy to discuss alternatives, though.

@simonw
Copy link
Owner

simonw commented Sep 22, 2023

Thanks for this. I'm going to remove the file extension check entirely and merge this - I don't want to get caught out in the future if they change the extension again.

The README updates are excellent.

@simonw simonw merged commit 3657da0 into simonw:main Sep 22, 2023
simonw added a commit that referenced this pull request Sep 22, 2023
simonw added a commit that referenced this pull request Sep 22, 2023
simonw added a commit that referenced this pull request Sep 22, 2023
I'm not confident enough in this yet to take off the alpha/beta labels.

Refs #10, #11, #12, #14, #16
@programmylife programmylife deleted the gguf-file-ext-update branch September 22, 2023 12:05
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants