Added support for Jais models #3183

grandiose-pizza · 2024-03-04T19:54:02Z

Jais models are pretrained and fine-tuned over a curated Arabic and English text/ prompt-response pairs datasets respectively. It is trained from scratch by Core42 in partnership with MBZUAI and Cerebras systems on their Condor Galaxy. The model architecture is based on transformer-based decoder-only (GPT-3) architecture and uses SwiGLU non-linearity. It implements ALiBi position embeddings, enabling the model to extrapolate to long sequence lengths, providing improved context handling and model precision.

The research can be studied here: https://arxiv.org/pdf/2308.16149.pdf

Metrics can be found here: https://huggingface.co/core42/jais-30b-chat-v3

These are SoTA Arabic-English bilingual models. The playground can be accessed at https://arabic-gpt.ai/

grandiose-pizza · 2024-03-04T23:11:58Z

Not ready to be merged yet. Bugs found.Fixes to be done.

vllm/model_executor/models/jais.py

vllm/transformers_utils/configs/jais.py

robertgshaw2-neuralmagic · 2024-03-05T00:29:39Z

Nice PR! Left a few cosmetic changes + some ideas for how to slightly improve the performance of the models via fusion in the MLP / a fused activation function

vllm/model_executor/models/jais.py

7ossam81 · 2024-03-20T10:58:55Z

@robertgshaw2-neuralmagic could you please check the status of the PR ?

esmeetu · 2024-03-21T00:17:04Z

@grandiose-pizza Sorry for delay some days. Please merge the latest update, and you can now just read model's scale parameter from config and pass it to LogitsProcessor without custom Sampler.

tests/models/test_models.py

grandiose-pizza · 2024-03-21T06:59:07Z

@grandiose-pizza Sorry for delay some days. Please merge the latest update, and you can now just read model's scale parameter from config and pass it to LogitsProcessor without custom Sampler.

Hi @esmeetu ,

I have update the code to adapt to #3233 . Tested with multi and single GPU setting. Works as expected.

However, there was a tiny miss in your PR and I have included that fix as well in this push.

I am running into an error for gpt2, I think you may have missed updating the new_tokens properly just for gpt2.py. For rest is okay. I have included this fix in this PR. Hope this is Okay:

Here is the older bug.
Your sampler forward function during a call from a model is passing three values:

vllm/vllm/model_executor/models/gpt2.py

Line 245 in 6ebd02b

next_tokens = self.sampler(self.lm_head_weight, logits,

    def sample(
        self,
        logits: torch.Tensor,
        sampling_metadata: SamplingMetadata,
    ) -> Optional[SamplerOutput]:
        next_tokens = self.sampler(self.lm_head_weight, logits,
                                   sampling_metadata)
        return next_tokens

But the sampler forward function itself accepts two values:

vllm/vllm/model_executor/layers/sampler.py

Line 31 in 6ebd02b

def forward(

    def forward(
        self,
        logits: torch.Tensor,
        sampling_metadata: SamplingMetadata,
    ) -> Optional[SamplerOutput]:
        assert logits is not None
        _, vocab_size = logits.shape

I am getting the following error due to this:
TypeError: forward() takes 3 positional arguments but 4 were given

esmeetu

Thanks for catching that! LGTM.

* upstream/main: [Misc] Bump up transformers to v4.39.0 & Remove StarCoder2Config (vllm-project#3551) [Misc][Log] Add log for tokenizer length not equal to vocabulary size (vllm-project#3500) [🚀 Ready to be merged] Added support for Jais models (vllm-project#3183) Fix 1D query issue from `_prune_hidden_states` (vllm-project#3539) [PREFIX CACHING FOLLOW UP] OrderedDict-based evictor (vllm-project#3431) [BugFix] Hot fix in setup.py for neuron build (vllm-project#3537) Migrate `logits` computation and gather to `model_runner` (vllm-project#3233) [1/n][Chunked Prefill] Refactor input query shapes (vllm-project#3236) [1/n] Triton sampling kernel (vllm-project#3186) [Bugfix] Fix ROCm support in CMakeLists.txt (vllm-project#3534)

Lalit Pradhan and others added 9 commits March 4, 2024 19:36

updated code for jais

5a3ddef

Merge branch 'vllm-project:main' into main

13cb19e

updated flake-8

b5feaa6

fixed formatting

4d5b65e

fixed formatting

9ad3061

fixed formatting

7b015e6

fixed formatting

b595d39

fixed formatting

c976954

Merge branch 'vllm-project:main' into main

49aa9a4

grandiose-pizza changed the title ~~Added support for Jais amodels~~ Added support for Jais models Mar 4, 2024

Merge branch 'vllm-project:main' into main

c626a78

robertgshaw2-neuralmagic reviewed Mar 5, 2024

View reviewed changes

vllm/model_executor/models/jais.py Show resolved Hide resolved

robertgshaw2-neuralmagic reviewed Mar 5, 2024

View reviewed changes

vllm/transformers_utils/configs/jais.py Show resolved Hide resolved

grandiose-pizza and others added 3 commits March 5, 2024 09:47

Merge branch 'vllm-project:main' into main

dc32e68

fixed inference bugs

7cb2757

apply ruff

452227e

grandiose-pizza changed the title ~~Added support for Jais models~~ [WIP] Added support for Jais models Mar 5, 2024

apply yapf

3776a66

esmeetu reviewed Mar 5, 2024

View reviewed changes

vllm/model_executor/models/jais.py Show resolved Hide resolved

vllm/model_executor/models/jais.py Show resolved Hide resolved

esmeetu added the new model Requests to new models label Mar 5, 2024

Lalit Pradhan added 8 commits March 5, 2024 14:34

bug fixes

689c3ec

ruff and yapf

697969d

ruff and yapf

1d43043

fixed bug in config.scale_qk_dot_by_d

6e4b06e

updated architectures in config

4321fc4

apply ruff

a6166d1

apply ruff

b68e2b1

apply ruff

51b745a

grandiose-pizza and others added 5 commits March 11, 2024 11:56

Merge branch 'main' into main

7fd25e2

adapted to vllm-project#3299

4965c56

Merge branch 'vllm-project:main' into main

b68e975

Merge branch 'vllm-project:main' into main

9860285

Merge branch 'vllm-project:main' into main

7ee7e5a

Merge branch 'vllm-project:main' into main

940e409

esmeetu reviewed Mar 21, 2024

View reviewed changes

tests/models/test_models.py Outdated Show resolved Hide resolved

Merge branch 'vllm-project:main' into main

85cc0ce

adapted to vllm-project#3233 and bug fix for gpt2

33a3a8c

esmeetu approved these changes Mar 21, 2024

View reviewed changes

Lalit Pradhan and others added 4 commits March 21, 2024 07:42

applied ruff and yapf

d0b4df5

apply ruff

31c12c8

format

54d17c7

Merge branch 'vllm-project:main' into main

36e9186

esmeetu enabled auto-merge (squash) March 21, 2024 08:31

esmeetu merged commit 4c07dd2 into vllm-project:main Mar 21, 2024
32 checks passed

grandiose-pizza mentioned this pull request Mar 31, 2024

[v0.4.0] Release Tracker #3155

Closed

3 tasks

grandiose-pizza changed the title ~~[🚀 Ready to be merged] Added support for Jais models~~ Added support for Jais models Apr 8, 2024

dtrifiro mentioned this pull request May 15, 2024

bump ubi base image tag opendatahub-io/vllm#24

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added support for Jais models #3183

Added support for Jais models #3183

grandiose-pizza commented Mar 4, 2024 •

edited

grandiose-pizza commented Mar 4, 2024

robertgshaw2-neuralmagic commented Mar 5, 2024

7ossam81 commented Mar 20, 2024

esmeetu commented Mar 21, 2024

grandiose-pizza commented Mar 21, 2024 •

edited

esmeetu left a comment

Added support for Jais models #3183

Added support for Jais models #3183

Conversation

grandiose-pizza commented Mar 4, 2024 • edited

grandiose-pizza commented Mar 4, 2024

robertgshaw2-neuralmagic commented Mar 5, 2024

7ossam81 commented Mar 20, 2024

esmeetu commented Mar 21, 2024

grandiose-pizza commented Mar 21, 2024 • edited

esmeetu left a comment

Choose a reason for hiding this comment

grandiose-pizza commented Mar 4, 2024 •

edited

grandiose-pizza commented Mar 21, 2024 •

edited