Try out METAL and add to compilation instructions

Tip from https://mas.to/@goranmoomin/110820724235904467

>  From my last time trying out llama.cpp & Llama 2, I don't think the text generation should be taking ~20s if you're using the Metal-accelerated implementation.
>
> Sorry if you’ve already tried… but have you tried giving the env vars `CMAKE_ARGS='-DLLAMA_METAL=on' FORCE_CMAKE=1` when installing llama-cpp-python? (ref https://github.com/abetlen/llama-cpp-python )

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Try out METAL and add to compilation instructions #7

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Uh oh!

Try out METAL and add to compilation instructions #7

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions