mpt : do not duplicate token_embd.weight on disk #5670

cebtenzzre · 2024-02-22T21:08:27Z

Previous attempt was #3626.

Should be merged after #5650, which will quantize the token_embd tensor with the same type that the output tensor was previously.

Previous attempt was #3626

…ceb/mpt-tied-output

nviet · 2024-03-18T05:07:22Z

This change helped reduce the model file size but breaks loading some previously converted models. I got "wrong number of tensors" error message while trying to load this model - until b2248 it was still okay. Do we really need to convert the models again or is there any better way?

mpt : do not duplicate token_embd.weight on disk

549fe80

Previous attempt was #3626

ggerganov approved these changes Feb 22, 2024

View reviewed changes

cebtenzzre added 2 commits February 22, 2024 16:54

Merge branch 'master' of https://github.com/ggerganov/llama.cpp into …

fb72b1e

…ceb/mpt-tied-output

mpt : remove output tensor name to satisfy quantize check

daf8810

cebtenzzre merged commit 15499eb into master Feb 22, 2024
51 of 58 checks passed

cebtenzzre deleted the ceb/mpt-tied-output branch February 22, 2024 22:05

cebtenzzre mentioned this pull request Feb 22, 2024

models: new MPT model file without duplicated token_embd.weight nomic-ai/gpt4all#2006

Merged

cebtenzzre mentioned this pull request Mar 1, 2024

Assume tied weights if lm_head/output weights is missing. #5824

Merged

jordankanter pushed a commit to jordankanter/llama.cpp that referenced this pull request Mar 13, 2024

mpt : do not duplicate token_embd.weight on disk (ggerganov#5670)

d9dae3f

cebtenzzre mentioned this pull request Mar 18, 2024

mpt : implement backwards compatiblity with duped output tensor #6139

Merged

hodlen pushed a commit to hodlen/llama.cpp that referenced this pull request Apr 1, 2024

mpt : do not duplicate token_embd.weight on disk (ggerganov#5670)

76acaef

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mpt : do not duplicate token_embd.weight on disk #5670

mpt : do not duplicate token_embd.weight on disk #5670

cebtenzzre commented Feb 22, 2024

nviet commented Mar 18, 2024

mpt : do not duplicate token_embd.weight on disk #5670

mpt : do not duplicate token_embd.weight on disk #5670

Conversation

cebtenzzre commented Feb 22, 2024

nviet commented Mar 18, 2024