Kudos on a great job! Need a little help with BLAS

Let me first congratulate everyone working on this for:
1. Python bindings for llama.cpp
2. Making them compatible with openai's api
3. Superb documentation!

Was wondering if anyone can help me get this working with BLAS? Right now when the model loads, I see BLAS=0.
I've been using [kobold.cpp](https://github.com/LostRuins/koboldcpp), and they have a BLAS flag at compile time which enables BLAS. It cuts down the prompt loading time by 3-4X. This is a major factor in handling longer prompts and chat-style messages.

P.S - Was also wondering what the difference is between create_embedding(input) and embed(input)?


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Kudos on a great job! Need a little help with BLAS #32

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Kudos on a great job! Need a little help with BLAS #32

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions