[BUG] [DOCS] XPOS #106

evelynmitchell · 2024-01-16T02:35:10Z

There's a dimensional error in the 3rd example of Xpos:
Code

import torch
from zeta import fixed_pos_embedding, apply_rotary_pos_emb

# Generate fixed positional embeddings
scale = torch.randn(10, 256)
sin, cos = fixed_pos_embedding(scale)

# Apply rotary positional embeddings to an input tensor
x = torch.randn(1, 10, 256)
output = apply_rotary_pos_emb(x, sin, cos, scale=0.5)

RuntimeError                              Traceback (most recent call last)
[<ipython-input-18-4d63cb090aa8>](https://localhost:8080/#) in <cell line: 10>()
      8 # Apply rotary positional embeddings to an input tensor
      9 x = torch.randn(1, 10, 256)
---> 10 output = apply_rotary_pos_emb(x, sin, cos, scale=0.5)

[/usr/local/lib/python3.10/dist-packages/zeta/nn/embeddings/xpos_relative_position.py](https://localhost:8080/#) in apply_rotary_pos_emb(x, sin, cos, scale)
     69     """
     70     sin, cos = map(lambda t: duplicate_interleave(t * scale), (sin, cos))
---> 71     return (x * cos) + (rotate_every_two(x) * sin)
     72 
     73 

RuntimeError: The size of tensor a (256) must match the size of tensor b (512) at non-singleton dimension 2

The implementation in the original paper is wrong. This wrong implementation was copied into hf/transformers, and then fixed:
huggingface/transformers@052fa2f
https://github.com/huggingface/transformers/blob/edb170238febf7fc3e3278ed5b9ca0b2c40c70e3/src/transformers/models/gptj/modeling_flax_gptj.py#L122

Upvote & Fund

We're using Polar.sh so you can upvote and help fund this issue.
We receive the funding once the issue is completed & confirmed by you.
Thank you in advance for helping prioritize & fund our backlog.

The text was updated successfully, but these errors were encountered:

kyegomez · 2024-01-19T13:49:44Z

@evelynmitchell the link you posted is not in pytorch, they are very different.

github-actions · 2024-03-20T12:42:10Z

Stale issue message

evelynmitchell added the bug Something isn't working label Jan 16, 2024

evelynmitchell assigned kyegomez Jan 16, 2024

evelynmitchell mentioned this issue Jan 17, 2024

[BUG] SinusoidalEmbedding rotate_every_two , related to #106 #117

Closed

github-actions bot added the no-issue-activity label Mar 20, 2024

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Mar 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] [DOCS] XPOS #106

[BUG] [DOCS] XPOS #106

evelynmitchell commented Jan 16, 2024 •

edited by polar-sh bot

kyegomez commented Jan 19, 2024

github-actions bot commented Mar 20, 2024

[BUG] [DOCS] XPOS #106

[BUG] [DOCS] XPOS #106

Comments

evelynmitchell commented Jan 16, 2024 • edited by polar-sh bot

Upvote & Fund

kyegomez commented Jan 19, 2024

github-actions bot commented Mar 20, 2024

evelynmitchell commented Jan 16, 2024 •

edited by polar-sh bot