Question about the MMN network design #24

laubeke · 2021-03-23T14:26:48Z

Hi, thanks for the nice work and code!

The MMN is a single fully-connected layed defined and used in
https://github.com/microsoft/Cream/blob/main/lib/models/structures/supernet.py#L92
https://github.com/microsoft/Cream/blob/main/lib/models/structures/supernet.py#L127
and then used by the board in
https://github.com/microsoft/Cream/blob/main/lib/models/PrioritizedBoard.py#L32

I do wonder why the MMN size depends on the slice size (i.e. the number of images) and concatenates their class probabilities, instead of using the same weights for each of image?

Z7zuqer · 2021-03-25T03:37:46Z

Hi,

Thanks for your interest in our project!

MMN aims to calculate the matching degree of student and teacher subnets. One image is not abundant enough to reflect their matching degree, thus we use a batch of data containing several images to reflect. So the size of MMN depends on the slice size.

Best,
Hao.

laubeke · 2021-03-25T08:14:13Z

Hi,

thanks for the quick response!
I'm not sure if I'm understood correctly, so let me be more precise:

n = num classes
s = slice size

the MMN net takes a [s, n] shaped vector, reshapes it to [s * n], and then uses a fully connected layer to get an output of size [1]
the alternative is to not reshape (keep [s, n]), but get a [s, 1] sized output, over which one can average to get the [1] sized output

in the second case, neither slice size nor the order of the images matters, and it is also not limited to a single image.

Best,
Kevin

Z7zuqer · 2021-03-26T05:18:11Z

Hi,

Thanks for your interest in our project!

We design the MMN to predict the matching degree of two subnets over mini-batch data. Firstly, it's ok for the second case. However, in the second case, the weight tensor of MMN is only Nx1, theoretically worse than (SxN)x1. Besides, the updating batch size of MMN is fixed during supernet training, so we take the first case.

Best,
Hao.

penghouwen closed this as completed Jul 1, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about the MMN network design #24

Question about the MMN network design #24

laubeke commented Mar 23, 2021 •

edited

Loading

Z7zuqer commented Mar 25, 2021

laubeke commented Mar 25, 2021 •

edited

Loading

Z7zuqer commented Mar 26, 2021

Question about the MMN network design #24

Question about the MMN network design #24

Comments

laubeke commented Mar 23, 2021 • edited Loading

Z7zuqer commented Mar 25, 2021

laubeke commented Mar 25, 2021 • edited Loading

Z7zuqer commented Mar 26, 2021

laubeke commented Mar 23, 2021 •

edited

Loading

laubeke commented Mar 25, 2021 •

edited

Loading