Hello, Mr. Wu
Could you tell me the b18c384nbt network structure?
Is it like this:
{384 X 1 X 1 X 192, 192 X 3 X 3 X 192, 192 X 3 X 3 X 192, 192 X 1 X 1 X 384} X 18
I guess the strength of this network is equivalent to 36b384c
Because the depth has doubled, each layer of convolution effectively increases the field of view.
so, why not become more slimmer.768 channel is more stronger?
network structure like this:
{768 X 1 X 1 X 192, 192 X 3 X 3 X 192, 192 X 3 X 3 X 192, 192 X 1 X 1 X 768} X 18
Every position on the board can remember double features and params only incrase (4 - 2) / (9 + 2) = 18%.
Another question is whether the activation function is relu?
Thanks.
Hello, Mr. Wu
Could you tell me the b18c384nbt network structure?
Is it like this:
{384 X 1 X 1 X 192, 192 X 3 X 3 X 192, 192 X 3 X 3 X 192, 192 X 1 X 1 X 384} X 18
I guess the strength of this network is equivalent to 36b384c
Because the depth has doubled, each layer of convolution effectively increases the field of view.
so, why not become more slimmer.768 channel is more stronger?
network structure like this:
{768 X 1 X 1 X 192, 192 X 3 X 3 X 192, 192 X 3 X 3 X 192, 192 X 1 X 1 X 768} X 18
Every position on the board can remember double features and params only incrase (4 - 2) / (9 + 2) = 18%.
Another question is whether the activation function is relu?
Thanks.