Enable Gemma 2B #75

qihqi · 2024-05-10T00:09:05Z

For Gemma 2B we need to change the shardings because the dimension we usually shard, num_kv_heads happens to be 1 for Gemma 2B. So we pick a different one to shard.

FanhaiLu1 · 2024-05-10T03:47:36Z

jetstream_pt/layers.py


    with jax.named_scope("attn_insert_cache"):
      keys, values = cache.update(xk, xv)
-      self.env.apply_sharding(keys, axis=1)


Any reason to remove the keys, values sharding?

FanhaiLu1

Please fix the lint error:

************* Module benchmarks.analyze_sharegpt
benchmarks/analyze_sharegpt.py:75:19: W0621: Redefining name 'prefill_bucket_size_to_ms' from outer scope (line 56) (redefined-outer-name)

Enable Gemma 2B

94167fc

qihqi requested review from FanhaiLu1 and lsy323 May 10, 2024 00:09

encoding

c4679e7

qihqi force-pushed the hanq_add_model branch from e4c6542 to c4679e7 Compare May 10, 2024 02:04

FanhaiLu1 reviewed May 10, 2024

View reviewed changes

qihqi added 2 commits May 10, 2024 13:33

lint

0df20e0

formatt

4bf89c5

lsy323 approved these changes May 10, 2024

View reviewed changes

FanhaiLu1 approved these changes May 10, 2024

View reviewed changes

FanhaiLu1 merged commit 48a8a22 into main May 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enable Gemma 2B #75

Enable Gemma 2B #75

Uh oh!

qihqi commented May 10, 2024

Uh oh!

FanhaiLu1 May 10, 2024

Uh oh!

FanhaiLu1 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Enable Gemma 2B #75

Enable Gemma 2B #75

Uh oh!

Conversation

qihqi commented May 10, 2024

Uh oh!

FanhaiLu1 May 10, 2024

Choose a reason for hiding this comment

Uh oh!

FanhaiLu1 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants