Some questions about your paper #45

zinuoli · 2024-04-12T07:34:45Z

Hi, sorry for the distrubance again, I got some ambiguous points after reading your paper:

Do you train CLIP Controller and Restoration Model separtely or train they at the same time?
I saw you introduce learnable prompt at this line, which is smart. However, I notice you incorporate prompt_embedding by t = t + prompt_embedding, my question is why you integrate degradation type into time step, instead of by cross attention like this x = attn(x, context=image_context).
For the NAFNet there's no time step, how did you integrate prompt_embedding into NAFNet?

Something I didn't find answer in your paper (or I missed), sorry for interrupting you. Thank you for your great work.

The text was updated successfully, but these errors were encountered:

Algolzw · 2024-04-12T08:58:11Z

Hi,

I train the two models separately.
I directly add the prompt_embedding to time_embedding for model efficiency since we already have cross-attentions for the image_context.
For the modified NAFNet (with time embeddings), you can refer to the Refusion's code here.

zinuoli · 2024-04-12T11:48:53Z

Got it, thank you so much.😄

zinuoli closed this as completed Apr 12, 2024

zinuoli mentioned this issue Apr 13, 2024

How do you integrate DA-CLIP to NAFNet? #44

Closed

Provide feedback