Which models correspond here #9

iamdrivenbymywonderfullife · 2024-05-23T03:56:50Z

I'm a computer nerd, but very interested in big language modelling
`python3 -m fastchat.model.apply_delta \

--base-model-path /path/to/hf-model/llama-7b \

--target-model-path /path/to/hf-model/character-llm-beethoven-7b \

--delta-path fnlp/character-llm-beethoven-7b-wdiff`

character-llm-beethoven-7b-wdiff ----> the model that can be downloaded here (there's a big pytorch_model file)

And where do the models llama-7b and character-llm-beethoven-7b correspond to the content?
Can you be more specific?

The text was updated successfully, but these errors were encountered:

choosewhatulike · 2024-05-24T02:36:45Z

Hi, the model is trained with Llama 1 and we release the weight differences (the delta of weights) instead of the actual weights. To recover the weights for training and inference, first, you need to download the Llama 1 base model (e.g. https://huggingface.co/luodian/llama-7b-hf), then run the script apply_delta to recover the model. Explained arguments are as follows.

--base-model-path (the Llama 1 base model you need to download and save at this dir)
--target-model-path (the output dir for the recovered model)
--delta-path (the character-ll-wdiff dir, which you can download from our repo)

PS: We use Llama 1 because, at that time, there were not many choices of good open-source LLMs for fine-tuning. You can always switch to a more powerful LLM (e.g. Llama 3) to train a better character-llm.

choosewhatulike closed this as completed Jun 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Which models correspond here #9

Which models correspond here #9

iamdrivenbymywonderfullife commented May 23, 2024 •

edited

choosewhatulike commented May 24, 2024

Which models correspond here #9

Which models correspond here #9

Comments

iamdrivenbymywonderfullife commented May 23, 2024 • edited

choosewhatulike commented May 24, 2024

iamdrivenbymywonderfullife commented May 23, 2024 •

edited