New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
What is AmoebaNet-D? #130
Comments
From one of the authors: "AmoebaNet-D was obtained by evolving on the ImageNet training set, starting with AmoebaNet-B, and then manually extrapolating the evolutionary process and tuning it for low training cost. It was the version submitted to the Stanford DAWNBench competition. More details may follow in subsequent publications." |
@sb2nov Sorry for bothering you. In the implementation of AmoebaNet, I am confused in the code (see https://github.com/tensorflow/tpu/blob/master/models/experimental/amoeba_net/network_utils.py#L321):
Why not insert And I want to know why the |
@bignamehyp could you answer the above? |
Normal cell diagram of amoeba_net_d: https://goo.gl/gKt4fL |
@bignamehyp Thanks for the reply. I am confused why From the diagram, the output of the cell is |
There are 7 elements in used_hiddenstates. The first two elements are input hidden states h0 and h1, which will not be used for concat. The last 5 elements indicates whose outputs were used for concat. |
Thanks for your patience! Yes, I got what you said above. But I am more confused actually. I think the first two hidden states are not skipped. Refer to https://github.com/tensorflow/tpu/blob/master/models/experimental/amoeba_net/network_utils.py#L490 Another question: h0 should be the first element in the list of the hidden state, right? But in the code, |
I agree with @xmfbit. The |
@karandwivedi42 Actually, I set the first two elements of |
Thank you very much for digging into the code. There are actually bugs in the model builder:
Our visualization of cell architecture was based on what we believed instead of what code produced. We have updated the cell architecture on the paper to matched the code output. Please see figure 2 on https://arxiv.org/pdf/1802.01548.pdf for the latest diagrams. |
I check the paper and google search but I could not find any information about AmoebaNet-D. In the paper I found only AmoebaNet-A, AmoebaNet-B, and AmoebaNet-C. What is AmoebaNet-D?
The text was updated successfully, but these errors were encountered: