Relevance embedding #42

taeyeopl · 2021-06-15T07:24:02Z

Lines 32 to 33 in 2836600

    
           R_lv3 = torch.bmm(refsr_lv3_unfold, lrsr_lv3_unfold) #[N, Hr*Wr, H*W] 
        
           R_lv3_star, R_lv3_star_arg = torch.max(R_lv3, dim=1) #[N, H*W]

As I understood related to the equation 4 (in the main paper), your relevance matrix is calculating normalized inner product. r_{i,j} = norm-inner{q_i,k_j}. ( Query is from the up-sampled low-resolution image and Key is from the down/up-sampled reference image. )

My understanding is like the below code.
Q1. Can you explain why your code is opposed?? (Usually, transformer makes the scores using the equation scores = (Q, K^T)).

R_lv3 = torch.bmm(lrsr_lv3_unfold, refsr_lv3_unfold) #[N, H*W, Hr*Wr] 
R_lv3_star, R_lv3_star_arg = torch.max(R_lv3, dim=2) #[N, H*W]

Reference code from attention-is-all-you-need-PyTorch
https://github.com/jadore801120/attention-is-all-you-need-pytorch/blob/132907dd272e2cc92e3c10e6c4e783a87ff8893d/transformer/Modules.py#L17

The text was updated successfully, but these errors were encountered:

FuzhiYang · 2021-06-27T14:36:06Z

It is ok to change the parameter position of lrsr_lv3_unfold and refsr_lv3_unfold. You can just instead permute lrsr_lv3_unfold in this line:

TTSR/model/SearchTransfer.py

Line 27 in 2836600

refsr_lv3_unfold = refsr_lv3_unfold.permute(0, 2, 1)

.
And then you can apply your code above to get the equal results.

taeyeopl · 2021-06-30T03:27:14Z

Due to my limited understanding, I do not clearly understand your points.
My understanding is matrix multiplication is not permutation invariant.
Therefore, those operations will make a different relation matrix, even it considers the permute.
Can you explain why at the end of the results you get equal results?
Am I missing something??

R_lv3 = torch.bmm(lrsr_lv3_unfold, refsr_lv3_unfold) #[N, H*W, Hr*Wr] (Oirginal)
R_lv3 = torch.bmm(refsr_lv3_unfold, lrsr_lv3_unfold) #[N, Hr*Wr, H*W] (My Understanding)

FuzhiYang · 2021-07-01T10:44:36Z

I mean the key point is to get "R_lv3_star_arg" and "R_lv3_star" in

TTSR/model/SearchTransfer.py

Line 33 in 2836600

R_lv3_star, R_lv3_star_arg = torch.max(R_lv3, dim=1) #[N, H*W]

Therefore, how to permute the tensor is not important. But you need to adjust "dim" parameter to get the correct "R_lv3_star_arg" and "R_lv3_star" in this line:

TTSR/model/SearchTransfer.py

Line 33 in 2836600

R_lv3_star, R_lv3_star_arg = torch.max(R_lv3, dim=1) #[N, H*W]

taeyeopl · 2021-07-06T03:46:54Z

I see, thanks for the detailed explanation !!

taeyeopl closed this as completed Jul 6, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Relevance embedding #42

Relevance embedding #42

taeyeopl commented Jun 15, 2021 •

edited

FuzhiYang commented Jun 27, 2021

taeyeopl commented Jun 30, 2021 •

edited

FuzhiYang commented Jul 1, 2021 •

edited

taeyeopl commented Jul 6, 2021

Relevance embedding #42

Relevance embedding #42

Comments

taeyeopl commented Jun 15, 2021 • edited

FuzhiYang commented Jun 27, 2021

taeyeopl commented Jun 30, 2021 • edited

FuzhiYang commented Jul 1, 2021 • edited

taeyeopl commented Jul 6, 2021

taeyeopl commented Jun 15, 2021 •

edited

taeyeopl commented Jun 30, 2021 •

edited

FuzhiYang commented Jul 1, 2021 •

edited