Skip to content

Bug: Error when trying to use ./llama-gguf-split --merge to merge split model gguf files back #8264

@tybalex

Description

@tybalex

What happened?

I was unable to merge split gguf model files back with the following command:

./llama-gguf-split --merge rubra_q4-0000*-of-00006.gguf rubra-q4.gguf

got error:

gguf_merge: rubra_q4-00001-of-00006.gguf -> rubra_q4-00002-of-00006.gguf
gguf_merge: reading metadata rubra_q4-00001-of-00006.gguf done
gguf_merge: reading metadata rubra_q4-00002-of-00006.gguf ...gguf_init_from_file: invalid magic characters 'U'

gguf_merge:  failed to load input GGUF from rubra_q4-00001-of-00006.gguf

and here is how I split the model, this worked:

./llama-gguf-split --split ./rubra-meta-llama-3-70b-instruct.Q4_K_M.gguf ./rubra_q4                        
n_split: 6                                                                                                                                                                                                               
split 00001: n_tensors = 128, total_size = 8030M                                                                                                                                                                         
split 00002: n_tensors = 128, total_size = 7326M                                                                                                                                                                         
split 00003: n_tensors = 128, total_size = 7193M                                                                                                                                                                         
split 00004: n_tensors = 128, total_size = 7044M                                                                                                                                                                         
split 00005: n_tensors = 128, total_size = 7167M                                                                                                                                                                         
split 00006: n_tensors = 83, total_size = 5758M                                                                                                                                                                          
Writing file ./rubra_q4-00001-of-00006.gguf ... done                                                                                                                                                                     
Writing file ./rubra_q4-00002-of-00006.gguf ... done                                                                                                                                                                     
Writing file ./rubra_q4-00003-of-00006.gguf ... done                                                                                                                                                                     
Writing file ./rubra_q4-00004-of-00006.gguf ... done                                                                                                                                                                     
Writing file ./rubra_q4-00005-of-00006.gguf ... done                                                                                                                                                                     
Writing file ./rubra_q4-00006-of-00006.gguf ... done                                                                                                                                                                     
gguf_split: 6 gguf split written with a total of 723 tensors.  

Am I missing something here?

Name and Version

./llama-cli --version
version: 3285 (a27152b)
built with cc (Debian 12.2.0-14) 12.2.0 for x86_64-linux-gnu

What operating system are you seeing the problem on?

Linux

Relevant log output

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    bug-unconfirmedmedium severityUsed to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)stale

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions