Add support codegen2 #209

michaelfeil · 2023-05-22T18:55:08Z

1. General Description

This PR adds support to convert models from Codegen-2 to GPT-J.
It does not modify the functionality of the existing converter. The PR seems quite small, but took me hours of debugging to figure out that the architecture of codegen2 is actually fully compatible for the large (7, 16B versions).
For the smaller models, a different permutation order is required, because the smaller models (1, 3_7B versions)were trained on another TPU setting.

2. Changes proposed in this PR:

CodeGen2 compatibility #202

Resolves: #202

3. How to evaluate:

Describe how to evaluate such that it may be reproduced by the reviewer (s).
1.
Self assessment:

Successfully build locally docker-compose build:
Successfully tested the full solution locally
Integrated this PR from here: Adding support for transformers - Salesforce/CodeGen architecture OpenNMT/CTranslate2#1230 and ran the conversion once. Don't have enough capacity to fully end-to-end test the whole solution for this framework.

moyix · 2023-05-26T01:42:08Z

Oh this is wonderful! I didn't realize CodeGen2 was still a standard GPT-J model! I'll try to test this out as soon as possible and get it merged :)

michaelfeil · 2023-06-22T18:06:25Z

Can it be merged? :)

liasece · 2023-06-30T10:22:50Z

Great job. Has anyone experienced the Codegen-2? Is it worth upgrading?

pgarba · 2023-07-07T07:14:48Z

Any update ? Would really like to try this out

pai4451 · 2023-07-07T07:44:25Z

Also interested if anyone experienced the Codegen-2. Is it worth upgrading?
The CodeGen-2 model card also emphasizes its ability to do infilling. Is it possible to do it on the fauxpilot fastertransformer backend?
It seems that Salesforce just introduce another Codegen family - CodeGen2.5

richjohnson-wwt · 2023-07-11T19:13:11Z

Is it possible to test this PR by pulling the branch, following the build steps, and editing the setup.sh to add the codegen2.5 model as an option?

michaelfeil · 2023-07-11T23:20:00Z

CodeGen 2.5 is based on LLama architecture, no longer on CodeGen architecture.

pai4451 · 2023-08-02T18:28:13Z

@michaelfeil Hi, I read the blog of CodeGen 2.5 and Salesforce indeed serve it and evaluate the latency on NVIDIA triton server. Do you know how to serve CodeGen 2.5 with triton? I feel like there’s other ways to make the CodeGen based model supported instead of converting to GPT-J.

michaelfeil · 2023-08-02T22:29:31Z

I would back for tutorials on how to run llama-2-7b on triton, and start from there.

michaelfeil · 2023-10-04T20:21:06Z

@Hoekz Should i close this one in favor of #230 ?

fdegier · 2024-02-07T13:43:55Z

Closed in favor of #230

michaelfeil added 3 commits May 22, 2023 20:46

adding converter support for CodeGen2

0c3b8bf

reformat matrix, to make it more readable

ac293f5

adding trust remote flag for codegen2

5066e2f

michaelfeil requested a review from moyix as a code owner May 22, 2023 18:55

Hoekz mentioned this pull request Aug 21, 2023

Support for CodeGen2 #230

Merged

2 tasks

fdegier changed the base branch from main to dev February 7, 2024 13:28

fdegier closed this Feb 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support codegen2 #209

Add support codegen2 #209

michaelfeil commented May 22, 2023 •

edited

moyix commented May 26, 2023

michaelfeil commented Jun 22, 2023

liasece commented Jun 30, 2023

pgarba commented Jul 7, 2023

pai4451 commented Jul 7, 2023 •

edited

richjohnson-wwt commented Jul 11, 2023

michaelfeil commented Jul 11, 2023

pai4451 commented Aug 2, 2023

michaelfeil commented Aug 2, 2023

michaelfeil commented Oct 4, 2023

fdegier commented Feb 7, 2024

Add support codegen2 #209

Add support codegen2 #209

Conversation

michaelfeil commented May 22, 2023 • edited

1. General Description

2. Changes proposed in this PR:

3. How to evaluate:

moyix commented May 26, 2023

michaelfeil commented Jun 22, 2023

liasece commented Jun 30, 2023

pgarba commented Jul 7, 2023

pai4451 commented Jul 7, 2023 • edited

richjohnson-wwt commented Jul 11, 2023

michaelfeil commented Jul 11, 2023

pai4451 commented Aug 2, 2023

michaelfeil commented Aug 2, 2023

michaelfeil commented Oct 4, 2023

fdegier commented Feb 7, 2024

michaelfeil commented May 22, 2023 •

edited

pai4451 commented Jul 7, 2023 •

edited