Add KleidiAI example for llama.cpp #135

gmiodice · 2024-06-14T14:25:21Z

Add patch to apply on top of llama.cpp to enable the KleidiAI Int4 matmul micro-kernels
llama.cpp base hash: 6fcd1331efbfbb89c8c96eba2321bb7b4d0c40e4

- Add patch to apply on top of llama.cpp to enable the KleidiAI Int4 matmul micro-kernels - llama.cpp base hash: 6fcd1331efbfbb89c8c96eba2321bb7b4d0c40e4 Signed-off-by: Gian Marco Iodice <gianmarco.iodice@arm.com>

kshitij-sisodia-arm · 2024-06-14T16:29:12Z

kleidiai-examples/llama_cpp/0001-Use-KleidiAI-Int4-Matmul-micro-kernels-in-llama.cpp.patch

+         if (t->data == NULL && t->view_src == NULL) {
+             this_size = GGML_PAD(ggml_backend_buft_get_alloc_size(buft, t), alignment);
+#if defined(GGML_USE_KLEIDIAI)
+            // Temporary solution to allocate more memore if needed for packing the weights.


Trivial: typo: memory

Signed-off-by: Gian Marco Iodice <gianmarco.iodice@arm.com>

Add KleidiAI example for llama.cpp

142a96e

- Add patch to apply on top of llama.cpp to enable the KleidiAI Int4 matmul micro-kernels - llama.cpp base hash: 6fcd1331efbfbb89c8c96eba2321bb7b4d0c40e4 Signed-off-by: Gian Marco Iodice <gianmarco.iodice@arm.com>

kshitij-sisodia-arm reviewed Jun 14, 2024

View reviewed changes

Add README.md file in the kleidiai example for llama.cpp

e90b253

Signed-off-by: Gian Marco Iodice <gianmarco.iodice@arm.com>

phorsman-arm merged commit 739fdf1 into Arm-Examples:main Jun 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add KleidiAI example for llama.cpp #135

Add KleidiAI example for llama.cpp #135

Uh oh!

gmiodice commented Jun 14, 2024

Uh oh!

kshitij-sisodia-arm Jun 14, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add KleidiAI example for llama.cpp #135

Add KleidiAI example for llama.cpp #135

Uh oh!

Conversation

gmiodice commented Jun 14, 2024

Uh oh!

kshitij-sisodia-arm Jun 14, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants