Loss always stays around 9.3 #7

taoyunuo · 2017-07-27T02:25:24Z

Loss always stays around 9.3, not down
I set the learning rate to 0.01 and 0.06, and loss didn't converge？？？
How does the training network need to be modified?？？？？
How is the training parameter set?？？？？
Hope to get your help！！thanks!!
@wdwen

zhly0 · 2017-07-28T10:12:02Z

I have the same issue,and I think many people have the same issue,maybe it is due to the setting of parameters

recordcode · 2017-07-28T12:44:25Z

You guys did't train MS-celeb, and use other dataset, right?

zhly0 · 2017-07-29T01:02:45Z

@recordcode you mean its relate to the dataset?

taoyunuo · 2017-07-29T01:08:20Z

I set the learning rate is 0.06 ,loss start to converge , RGB picture as the input face, I don't use lmdb as input

wy1iu · 2017-08-09T17:49:12Z

We update the repo to fix some bugs. The pipeline should be okay to run without modifications now. The SphereFace-20 model (described in the paper) is also released.

happynear · 2017-08-09T19:36:59Z

The loss computed after margin_inner_product layer is always bigger than the normal inner_product layer. A small trick used in large margin softmax is to use traditional inner_product layer at testing. Sample prototxt:

############### A-Softmax Loss ##############
layer {
  name: "fc6"
  type: "MarginInnerProduct"
  bottom: "fc5"
  bottom: "label"
  top: "fc6"
  top: "lambda"
  param {
    name: "fc6"
    lr_mult: 1
    decay_mult: 1
  }
  margin_inner_product_param {
    num_output: 10572
    type: QUADRUPLE
    weight_filler {
      type: "xavier"
    }
    base: 1000
    gamma: 0.12
    power: 1
    lambda_min: 5
    iteration: 0
  }
  include {
    phase: TRAIN
  }
}
layer {
  name: "fc6"
  type: "MarginInnerProduct"
  bottom: "fc5"
  bottom: "label"
  top: "fc6"
  top: "lambda"
  param {
    name: "fc6"
    lr_mult: 0
  }
  margin_inner_product_param {
    num_output: 10572
    type: SINGLE
    base: 0
    gamma: 1
    iteration: 0
    lambda_min: 0
    weight_filler {
      type: "msra"
    }
  }
  include {
    phase: TEST
  }
}
layer {
  name: "softmax_loss"
  type: "SoftmaxWithLoss"
  bottom: "fc6"
  bottom: "label"
  top: "softmax_loss"
}
layer {
  name: "accuracy"
  type: "Accuracy"
  bottom: "fc6"
  bottom: "label"
  top: "accuracy"
  include {
    phase: TEST
  }
}

Then you can observe the accuracy and loss at testing phase to check if the network is training normally.

inlmouse · 2017-08-09T21:21:17Z

@happynear You are right. This trick was also mentioned in L-margin softmax.

wy1iu · 2017-08-10T08:34:17Z

If there are still something wrong with your loss, you guys can reopen this issue or open a new one for further discussion.

wy1iu closed this as completed Aug 10, 2017

Zhongdao mentioned this issue Sep 4, 2017

Experiments on larger datasets #14

Open

belleoct mentioned this issue Sep 8, 2017

Training Cant't Converge. #30

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Loss always stays around 9.3 #7

Loss always stays around 9.3 #7

taoyunuo commented Jul 27, 2017

zhly0 commented Jul 28, 2017

recordcode commented Jul 28, 2017

zhly0 commented Jul 29, 2017

taoyunuo commented Jul 29, 2017

wy1iu commented Aug 9, 2017

happynear commented Aug 9, 2017

inlmouse commented Aug 9, 2017

wy1iu commented Aug 10, 2017

Loss always stays around 9.3 #7

Loss always stays around 9.3 #7

Comments

taoyunuo commented Jul 27, 2017

zhly0 commented Jul 28, 2017

recordcode commented Jul 28, 2017

zhly0 commented Jul 29, 2017

taoyunuo commented Jul 29, 2017

wy1iu commented Aug 9, 2017

happynear commented Aug 9, 2017

inlmouse commented Aug 9, 2017

wy1iu commented Aug 10, 2017