Upgrading SoftmaxWithLoss layer to accept also spatially varying weights #5828

shaibagon · 2017-08-07T08:34:06Z

adding "WeightedSoftmaxWithloss" layer.
This new loss layer upgrades "SoftmaxWithLoss" layer (and is derived from it) to accept spatially varying non-negative weights for the loss.

(replaces PR #5801 - due to change in implementation)

Usage:

layer {
  name: "weighted_loss"
  type: "WeightedSoftmaxWithLoss"
  bottom: "predictions"  # raw predictions, e.g., B-C-H-W
  bottom: "labels"  # per-pixel label, e.g., B-1-H-W
  bottom: "weights"  # per pixel loss weight, e.g., B-1-H-W
  top: "loss"
  softmax_param { axis: 1 }
  loss_param { ignore_label: -1 normalization: VALID } # normalize by the SUM of the valid weights
}

This PR includes GPU implementation and tests.

This PR

adds a new layer
made minor changes to "SoftmaxWithLoss" layer
(a) changed type of get_normalizer argument ot Dtype
(b) ignore loss_weight for the internal "Softmax" layer
No changes to caffe.proto(!)

…o accept spatially varying non-negative weights for the loss. making WeightedSoftmaxWithLoss layer inherit from SoftmaxWithLoss layer

shaibagon · 2017-08-07T09:16:22Z

@shelhamer please see this updated PR. I made the weighted loss layer inherit from "SoftmaxWithLoss" as you recommended.

Thanks!

shaibagon · 2017-08-09T06:25:12Z

src/caffe/layers/softmax_loss_layer.cpp

@@ -13,6 +13,8 @@ void SoftmaxWithLossLayer<Dtype>::LayerSetUp(
  LossLayer<Dtype>::LayerSetUp(bottom, top);
  LayerParameter softmax_param(this->layer_param_);
  softmax_param.set_type("Softmax");
+  // no loss weight for the Softmax internal layer.
+  softmax_param.clear_loss_weight();


This change suppose to resolve issue #2968.
loss_weight should be of length 2, the second entry is ignored.

shaibagon · 2017-08-09T06:30:00Z

Note that one of the changes proposed in this PR to "SoftmaxWithLoss" layer resolves issue #2968.

weiliu89 · 2017-08-11T07:51:24Z

src/caffe/layers/weighted_softmax_loss_layer.cpp

+    }
+  }
+  top[0]->mutable_cpu_data()[0] = loss
+    / this->get_normalizer(this->normalization_, agg_weight);


Have you tried to normalize by count instead of agg_weight? For example, this weighted softmax loss implementation is normalized by count. Although I think yours makes a bit more sense, otherwise one might need to tune the loss weight a bit.

This is a good point. It seems like the right way to normalize. However this is not set in stone.

I tried a bit on my dataset. I set different weight for different classes. But it looks like normalizing by count is better than agg_weight. Maybe I should tune the learning late or loss_weight a bit to achieve good results if I choose to use 'agg_weight'. Have you tried the two different ways of normalization?

@weiliu89 I use it mainly for semantic segmentation where each sample has several labels.
How about changing caffe.proto and get_normalizer to have an additional normalizing method? @shelhamer what do you think about it?

shaibagon · 2017-08-17T21:29:35Z

@shelhamer thank you for "focus"ing on this PR. Is there anything I can do to ease the process of accepting this PR?

giihyun · 2018-02-21T17:20:21Z

@shaibagon Thank you for the post. I want to use this layer on my network. How can I add this layer to existing caffe? I installed the caffe with visual studio 2013 and my operating system is Windows.

adding weighted_softmax_loss layer. Upgrading SoftmaxWithLoss layer t…

aa9d686

…o accept spatially varying non-negative weights for the loss. making WeightedSoftmaxWithLoss layer inherit from SoftmaxWithLoss layer

shaibagon mentioned this pull request Aug 7, 2017

Upgrading SoftmaxWithLoss layer to accept also spatially varying weights #5801

Closed

shaibagon mentioned this pull request Aug 9, 2017

SoftmaxWithLoss creation fails with with two top blobs #2968

Open

shaibagon commented Aug 9, 2017

View reviewed changes

weiliu89 reviewed Aug 11, 2017

View reviewed changes

shelhamer added the focus label Aug 16, 2017

dayanguan approved these changes Jan 25, 2019

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Upgrading SoftmaxWithLoss layer to accept also spatially varying weights #5828

Upgrading SoftmaxWithLoss layer to accept also spatially varying weights #5828

shaibagon commented Aug 7, 2017 •

edited

shaibagon commented Aug 7, 2017

shaibagon Aug 9, 2017 •

edited

shaibagon commented Aug 9, 2017

weiliu89 Aug 11, 2017

shaibagon Aug 11, 2017

weiliu89 Aug 23, 2017

shaibagon Aug 24, 2017

shaibagon commented Aug 17, 2017

giihyun commented Feb 21, 2018

Upgrading SoftmaxWithLoss layer to accept also spatially varying weights #5828

Are you sure you want to change the base?

Upgrading SoftmaxWithLoss layer to accept also spatially varying weights #5828

Conversation

shaibagon commented Aug 7, 2017 • edited

shaibagon commented Aug 7, 2017

shaibagon Aug 9, 2017 • edited

Choose a reason for hiding this comment

shaibagon commented Aug 9, 2017

weiliu89 Aug 11, 2017

Choose a reason for hiding this comment

shaibagon Aug 11, 2017

Choose a reason for hiding this comment

weiliu89 Aug 23, 2017

Choose a reason for hiding this comment

shaibagon Aug 24, 2017

Choose a reason for hiding this comment

shaibagon commented Aug 17, 2017

giihyun commented Feb 21, 2018

shaibagon commented Aug 7, 2017 •

edited

shaibagon Aug 9, 2017 •

edited