-
Notifications
You must be signed in to change notification settings - Fork 13.4k
Closed
Labels
bug-unconfirmedmedium severityUsed to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)stale
Description
What happened?
I was unable to merge split gguf model files back with the following command:
./llama-gguf-split --merge rubra_q4-0000*-of-00006.gguf rubra-q4.gguf
got error:
gguf_merge: rubra_q4-00001-of-00006.gguf -> rubra_q4-00002-of-00006.gguf
gguf_merge: reading metadata rubra_q4-00001-of-00006.gguf done
gguf_merge: reading metadata rubra_q4-00002-of-00006.gguf ...gguf_init_from_file: invalid magic characters 'U'
gguf_merge: failed to load input GGUF from rubra_q4-00001-of-00006.gguf
and here is how I split the model, this worked:
./llama-gguf-split --split ./rubra-meta-llama-3-70b-instruct.Q4_K_M.gguf ./rubra_q4
n_split: 6
split 00001: n_tensors = 128, total_size = 8030M
split 00002: n_tensors = 128, total_size = 7326M
split 00003: n_tensors = 128, total_size = 7193M
split 00004: n_tensors = 128, total_size = 7044M
split 00005: n_tensors = 128, total_size = 7167M
split 00006: n_tensors = 83, total_size = 5758M
Writing file ./rubra_q4-00001-of-00006.gguf ... done
Writing file ./rubra_q4-00002-of-00006.gguf ... done
Writing file ./rubra_q4-00003-of-00006.gguf ... done
Writing file ./rubra_q4-00004-of-00006.gguf ... done
Writing file ./rubra_q4-00005-of-00006.gguf ... done
Writing file ./rubra_q4-00006-of-00006.gguf ... done
gguf_split: 6 gguf split written with a total of 723 tensors.
Am I missing something here?
Name and Version
./llama-cli --version
version: 3285 (a27152b)
built with cc (Debian 12.2.0-14) 12.2.0 for x86_64-linux-gnu
What operating system are you seeing the problem on?
Linux
Relevant log output
No response
Metadata
Metadata
Assignees
Labels
bug-unconfirmedmedium severityUsed to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)stale