gguf reader for layer and size estimates #716

earonesty · 2023-09-14T14:36:38Z

i've found that without some sort of layer and size estimate it's very hard to choose the right number of layers to offload

todo:

get a size estimate based on needed context size!

if you think this should be it's own repo, im cool with that

abetlen · 2023-09-30T06:39:32Z

Hey @earonesty this makes sense and I do want to integrate gguf more closely into llama-cpp-python. Is it possible to use the pip published gguf package to reduce the amount of maintenance required when that's updated?

earonesty · 2023-09-30T17:04:57Z

Hey @earonesty this makes sense and I do want to integrate gguf more closely into llama-cpp-python. Is it possible to use the pip published gguf package to reduce the amount of maintenance required when that's updated?

unfortunately that package has no reader support. i used the source for that to reverse engineer the format and write the reader! happy to put it in its own repo, but i dont thnk the llama-cpp team has plans to maintain the reader.

i can try to submit a PR and see if they like it?

earonesty added 3 commits September 14, 2023 10:35

reader

1f69689

__all__

355ce50

add test, skip it because no lfs

d70761f

earonesty changed the title ~~gguf reader layer and size estimates~~ gguf reader for layer and size estimates Sep 14, 2023

earonesty mentioned this pull request Sep 14, 2023

[Question] about calculate n_gpu_layers #581

Closed

earonesty added 7 commits September 15, 2023 12:58

Update gguf_reader.py

904d468

Update gguf_reader.py

7994c63

typo

6a9eabb

old-py-compat

1121323

more back compat

4713791

compat

ee2c690

merg

926e8e1

earonesty force-pushed the reader branch from 75568b3 to 926e8e1 Compare September 15, 2023 16:59

earonesty added 2 commits September 15, 2023 16:08

finally-installed-3.8-locally-to-test-it

1033825

Merge branch 'main' into reader

e8bc856

abetlen force-pushed the main branch 2 times, most recently from 8c93cf8 to cc0fe43 Compare November 14, 2023 20:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

gguf reader for layer and size estimates #716

gguf reader for layer and size estimates #716

Uh oh!

earonesty commented Sep 14, 2023 •

edited

Loading

Uh oh!

abetlen commented Sep 30, 2023

Uh oh!

earonesty commented Sep 30, 2023 •

edited

Loading

Uh oh!

Uh oh!

gguf reader for layer and size estimates #716

Are you sure you want to change the base?

gguf reader for layer and size estimates #716

Uh oh!

Conversation

earonesty commented Sep 14, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

abetlen commented Sep 30, 2023

Uh oh!

earonesty commented Sep 30, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

earonesty commented Sep 14, 2023 •

edited

Loading

earonesty commented Sep 30, 2023 •

edited

Loading