The details of "samples one sub-graph at one training iteration" #12

kongxz · 2019-05-28T07:24:11Z

Can you talk about the details of "samples one sub-graph at one training iteration"?

As far as I know, the result of Gumbel Softmax may not be a one hot vector. It may be a vector like [0.96, 0.01, 0.01, 0.01, 0.01].

When you sample one sub-graph at training, do you just drop all the connections with weights 0.01?

Thanks.

D-X-Y · 2019-05-28T07:54:41Z

Sure, we use the hard mode and thus it is a one-shot vector. Something like this in PyTorch:

y_soft = Gumbel Softmax( ... )
y_hard = one_hot( y_soft )
y_hard = y_hard - y_soft.detach() + y_soft

During the forward, you could use:

cals = []
for i, w in enumerate(y_hard):
  if w.item() == 1:
    cals.append( op[i](x) * w )
  else:
    cals.append( x )
return sum(cals)

coolKeen · 2019-06-17T00:55:30Z

Sure, we use the hard mode and thus it is a one-shot vector. Something like this in PyTorch:
y_soft = Gumbel Softmax( ... )
y_hard = one_hot( y_soft )
y_hard = y_hard - y_soft.detach() + y_soft

How do you implement the backward process?

D-X-Y · 2019-06-18T12:54:40Z

If you implement forward in the above style, it can automatically backward in PyTorch.

brdav · 2019-11-04T11:27:13Z

Sure, we use the hard mode and thus it is a one-shot vector. Something like this in PyTorch:
y_soft = Gumbel Softmax( ... )
y_hard = one_hot( y_soft )
y_hard = y_hard - y_soft.detach() + y_soft
During the forward, you could use:
cals = []
for i, w in enumerate(y_hard):
  if w.item() == 1:
    cals.append( op[i](x) * w )
  else:
    cals.append( x )
return sum(cals)

Thanks for the code snippet!
Why would you append x for the paths with weight 0? Shouldn't there be no forward propagation?

D-X-Y closed this as completed May 28, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The details of "samples one sub-graph at one training iteration" #12

The details of "samples one sub-graph at one training iteration" #12

kongxz commented May 28, 2019

D-X-Y commented May 28, 2019 •

edited

coolKeen commented Jun 17, 2019

D-X-Y commented Jun 18, 2019

brdav commented Nov 4, 2019

The details of "samples one sub-graph at one training iteration" #12

The details of "samples one sub-graph at one training iteration" #12

Comments

kongxz commented May 28, 2019

D-X-Y commented May 28, 2019 • edited

coolKeen commented Jun 17, 2019

D-X-Y commented Jun 18, 2019

brdav commented Nov 4, 2019

D-X-Y commented May 28, 2019 •

edited