Role of coefficients #6

NotNANtoN · 2021-11-03T08:53:49Z

Hello again,

in these marked lines you initialize a set of coefficients to optimize over. As far as I can see, these are not mentioned in the paper. The coefficients are multiplied by the direction per source image, so I get that you want to optimize for a different scale of the direction vector per source vector. I have some questions on this:

Did you try it without these coefficients?
To what values do the coefficients converge to? Do they stay close to 1?
You re-initialize the Adam optimizer for the coefficients for every step within the optimization, hence drastically changing the behavior of the optimizer. Is this intended or a misplacement? If it is intended, what is it used for?

Thanks again for your work! I hope I am not too picky on this - I'm just curious about the topic of semantics in these latent spaces :-)

TargetCLIP/optimization/find_dirs.py

Lines 123 to 140 in b5dd2a4

    
           coefficients = [None] * NUM_IMAGES 
        
           for n in range(NUM_IMAGES): 
        
               coefficient = torch.ones(1).to("cuda") 
        
               coefficient.requires_grad = True 
        
               coefficients[n] = coefficient 
        
           opt_loss = torch.Tensor([float("Inf")]).to("cuda") 
        
           pbar = tqdm(range(args.step)) 
        
           for i in pbar: 
        
               # calculate learning rate 
        
               t = i / args.step 
        
               lr = get_lr(t, args.lr) 
        
               optimizer.param_groups[0]["lr"] = lr 
        
               optimizer_coeffs = optim.Adam(coefficients, lr=args.lr, weight_decay=0.01) 
        
               loss = torch.zeros(1).cuda() 
        
               target_semantic = torch.zeros(1).cuda()

hila-chefer · 2021-11-03T12:03:57Z

Hi @NotNANtoN :)
First, please feel free to ask anything, I'm happy to answer :)
Indeed, we do not mention the optimization of coefficients in our paper, since it's a short 4-page paper (+ no supplementary).
In addition, as you observed, the coefficients are used to allow for finer manipulation of each source. Intuitively, if the source and target are semantically close (say both have a beard), we would want to apply a smaller change to the source to resemble the target.

yes, a few times, overall the results were very similar.
from what I observed, usually in the range: 0.5-1.8, this is also around the same range we provide in our notebooks :)
you are absolutely right, this is a misplacement. The optimizer should be initialized for each direction, but not for each step. I'll fix this in my next code update (soon), thanks for the catch!
In the meantime, to address 3, I tried our joker training with the fix, and as you can see (results on the training set with the optimized coefficients) the difference in results isn't major.
I hope I was able to answer all your questions :)

hila-chefer · 2021-11-12T01:02:48Z

Hi @NotNANtoN, I’m closing this issue due to inactivity, but feel free to reopen if necessary.

NotNANtoN · 2021-11-12T01:15:51Z

Thanks a lot, your answers were very insightful :)

hila-chefer closed this as completed Nov 12, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Role of coefficients #6

Role of coefficients #6

NotNANtoN commented Nov 3, 2021

hila-chefer commented Nov 3, 2021

hila-chefer commented Nov 12, 2021

NotNANtoN commented Nov 12, 2021

Role of coefficients #6

Role of coefficients #6

Comments

NotNANtoN commented Nov 3, 2021

hila-chefer commented Nov 3, 2021

hila-chefer commented Nov 12, 2021

NotNANtoN commented Nov 12, 2021