How to use 1.3B OPT weights in `metaseq-api-local` API? #61

ParadoxZW · 2022-05-08T08:49:56Z

I'm trying to use 1.3B OPT weights in metaseq-api-local API and test some inference examples. But I got stucked.

There are two reshard files of 1.3B OPT. But CHECKPOINT_LOCAL in constants.py can only point to a reshard.pt file. What is the correct way to load these two reshard files? (P.S. the comments in constants.py says MODEL_SHARED_FOLDER = "/example/175B/reshard_no_os" but there is CHECKPOINT_FOLDER = os.path.join(MODEL_SHARED_FOLDER, "175B", "reshard_no_os") in line 49, is this also a mistake?

The text was updated successfully, but these errors were encountered:

soloice · 2022-05-08T09:32:45Z

You can modify the paths manually to load different models. Also, I think this script is related.

ParadoxZW · 2022-05-08T13:43:15Z

@soloice I know that we can modify path to load different models. My question is CHECKPOINT_LOCAL can only point to a ckpt file but not a list of reshard files.

soloice · 2022-05-08T14:54:18Z

I have the same question. I assume this is not a bug: the only thing missing here is a script to merge all sharded checkpoints.
This should be the one needed. So we can just wait for good news.

ParadoxZW · 2022-05-09T13:14:59Z

I guess there is a way to directly load two reshard file (and infer under the strategy of model paralleling) rather than merging them.

suchenzang · 2022-05-12T05:43:05Z

Consolidating all asks for model loading into #88, which will be released soon from HuggingFace.

We also have #78 and #77 open to fix some of these usability issues on our end.

ParadoxZW added the bug Something isn't working label May 8, 2022

suchenzang closed this as completed May 12, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to use 1.3B OPT weights in `metaseq-api-local` API? #61

How to use 1.3B OPT weights in `metaseq-api-local` API? #61

ParadoxZW commented May 8, 2022

soloice commented May 8, 2022

ParadoxZW commented May 8, 2022 •

edited

Loading

soloice commented May 8, 2022 •

edited

Loading

ParadoxZW commented May 9, 2022 •

edited

Loading

suchenzang commented May 12, 2022

How to use 1.3B OPT weights in metaseq-api-local API? #61

How to use 1.3B OPT weights in metaseq-api-local API? #61

Comments

ParadoxZW commented May 8, 2022

soloice commented May 8, 2022

ParadoxZW commented May 8, 2022 • edited Loading

soloice commented May 8, 2022 • edited Loading

ParadoxZW commented May 9, 2022 • edited Loading

suchenzang commented May 12, 2022

How to use 1.3B OPT weights in `metaseq-api-local` API? #61

How to use 1.3B OPT weights in `metaseq-api-local` API? #61

ParadoxZW commented May 8, 2022 •

edited

Loading

soloice commented May 8, 2022 •

edited

Loading

ParadoxZW commented May 9, 2022 •

edited

Loading