What is the format of weights for the input model to convert.py? #2571

RonanKMcGovern · 2023-08-09T17:39:29Z

Here is a comment from the ReadMe regarding usage of convert.py:

# obtain the original LLaMA model weights and place them in ./models

However, it's not clear what format those weights need to be in...

In what format do I need to have the original weights? consolidated.00.pth or .bin?
If .bin, do the weights need to be float 32 or can they be bf16?
If .bin, is there a way to handle sharded weights? e.g. Llama 2 7B sharded into ~10 GB and ~3 GB? Can I specify each shard as a command line argument here:

# convert the 7B model to ggml FP16 format
python3 convert.py models/7B/

The text was updated successfully, but these errors were encountered:

RonanKMcGovern · 2023-08-09T23:37:59Z

Ok, I should have just looked at convert.py before in llama ccp.

To answer my own Qs:

Kushagrapuro · 2024-03-19T08:23:12Z

thnx

RonanKMcGovern closed this as completed Aug 9, 2023

Provide feedback