About the calculation of overhead. #19

znsoftm · 2023-07-10T02:43:10Z

znsoftm · 2023-07-10T02:44:57Z

or BERT mode, its overhead is calculated as :

model_mem_req += (5 + 16 * n_layer) * 256; // object overhead

Can anyone explain the meaning 5 is extra tensors, 16 means each layer has 16 tensor, and 256 for what?

Is it the sizeof ggml_tensor struct ? The actual size is 208 bytes, so 256 is rounded size?

skeskinen · 2023-07-10T08:50:20Z

My memory is a little hazy on this subject.
Like you said 5 should be the extra model wise tensors not tied to any layer. I think I tried smaller number than 256 for the size but it crashed with OOM.
Probably the real size of C structs is always rounded up to the next power of 2?

znsoftm · 2023-07-10T23:35:58Z

thanks for your answer:)

znsoftm · 2023-07-11T22:35:20Z

I have tested the latest ggml, should alter the 256 to 512. Do not understand why:(

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About the calculation of overhead. #19

About the calculation of overhead. #19

znsoftm commented Jul 10, 2023

znsoftm commented Jul 10, 2023

skeskinen commented Jul 10, 2023

znsoftm commented Jul 10, 2023

znsoftm commented Jul 11, 2023

About the calculation of overhead. #19

About the calculation of overhead. #19

Comments

znsoftm commented Jul 10, 2023

znsoftm commented Jul 10, 2023

skeskinen commented Jul 10, 2023

znsoftm commented Jul 10, 2023

znsoftm commented Jul 11, 2023