The Self Attention forward part using 5D tesnor can be used in 4D tensor

https://github.com/huggingface/pytorch-image-models/blob/019550eeaf43b35f998e59bdf53d117bced3c2f3/timm/layers/attention.py#L75-L76

I'm using Ambarella transfering toolchain, which only support 4D tensors. This part, which uses a 5D tensor, causes an error every time I call the self-attention function.
I'm just proposing a method here that uses only 4D tensors for replacement.
```python
H, D = self.num_heads, self.head_dim
qkv = self.qkv(x)  # (B, N, 3*H*D)
q = qkv[:, :, :H*D].reshape(B, N, H, D).transpose(1, 2)       # (B, H, N, D)
k = qkv[:, :, H*D:2*H*D].reshape(B, N, H, D).transpose(1, 2)  # (B, H, N, D)
v = qkv[:, :, 2*H*D:].reshape(B, N, H, D).transpose(1, 2)     # (B, H, N, D)
```
I know the 5D tensor version is more readable, but the 4D tensor version solved my problem.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

The Self Attention forward part using 5D tesnor can be used in 4D tensor #2584

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

	qkv = self.qkv(x).reshape(B, N, 3, self.num_heads, self.head_dim).permute(2, 0, 3, 1, 4)
	q, k, v = qkv.unbind(0)

Uh oh!

The Self Attention forward part using 5D tesnor can be used in 4D tensor #2584

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions