Treat network outputs differently depending on their names #193

lukeyeager · 2015-08-04T22:02:43Z

Jon just had a good idea.

What if instead of creating multiple paths in DIGITS like "Classification", "Generic Inference", "Bounding Boxes", etc., we had only a single generic path, and the decision about what to do with the network outputs was handled only by looking at the name of the outputs?

Output name	Assumed network type	Action
`classifications`	Classification	print confidence for top 5 classes
`bbox`	Bounding Box	draw rectangle on top of input image
`segmentation`	Segmentation	show each pixel as a color corresponding to its predicted class
other	Generic	just print the numbers

This is worth giving some more thought ...

The text was updated successfully, but these errors were encountered:

semisight · 2015-08-04T22:51:36Z

This would have to be very explicit in the docs. I can see someone changing the output layer name to something more meaningful to them, and then not understanding why they broke it.

I do like where the idea is going however. Can we move the "signal" out of band somewhere? Maybe to a param in the prototxt file? Like an "optional" param at the top of the prototxt that tells us to do extra processing (like bbox for example).

gheinrich · 2015-08-05T09:25:53Z

It feels to me like a natural way of doing it would be to extend the model with one (or more) visualisation layer whose meaning would only be of significance to the DL front-end. If caffe does not want to ignore the layer it could be implemented as an identify layer. For example:

layer {
  name: "myVisu"
  type: "Visualisation"
  bottom: "output"
  visualization_param {
    type: "classification" 
    classification_param {
      top_confidence: 5
    }
  }
  include {
    phase: TEST
  }
}

lukeyeager · 2015-08-05T16:46:59Z

@semisight, why not just store the network type as a piece of DIGITS metadata? Why does it need to be included in the prototxt? We can still do it in such a way that we can merge the Classification and Generic Inference paths.

if model.output_type == 'classification':
    # do something
elif model.output_type == 'bbox':
    # do something else
else:
    # default - treat as generic inference

@gheinrich, are you suggesting the network should output strings like Dog - 90%? The output of Caffe has to be n-dimensional blobs, not strings. I think it's fine to leave the interpretation of the outputs as a post-processing step external to the network definition.

semisight · 2015-08-05T17:50:12Z

@lukeyeager that's fine with me. I know we don't really "own" the prototxt so it's probably not a good idea to modify it. The in-name solution just looks like a bit too much magic for me.

After thinking about it, I don't think Caffe sits at a high enough level to "care" about whether a network is classification or not. So I think storing it as DIGITS metadata is probably the best way to do it.

gheinrich · 2015-08-05T20:33:06Z

My thinking was that, conceptually, the visualisation of results is the final layer in your network. The DL back-end does not need to deal with it (the visualisation layers could be either hidden from the DL back-end, or implemented as an identity layer).
Since the model is defined by ways of prototxt files it would be consistent to define the visualisation in a prototxt file. So you could have a visualisation.prototxt where you can specify the various visualisations that you would like to have. This would allow you to specify different visualisations for different layers so DIGITS does not have to make assumptions about what the user wants to see (for example I think there is an assumption in DIGITS that in order to show top-N predictions you can just look at the penultimate layer - this might not hold for all classification networks). The prototxt format is handy because it is easily extendable, shared, and requires no buttons. Any text format would do though.

lukeyeager · 2016-10-31T17:01:47Z

Closed by #756 (with a much better implementation than what I had originally proposed).

lukeyeager added the question label Aug 7, 2015

lukeyeager closed this as completed Oct 31, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Treat network outputs differently depending on their names #193

Treat network outputs differently depending on their names #193

lukeyeager commented Aug 4, 2015

semisight commented Aug 4, 2015

gheinrich commented Aug 5, 2015

lukeyeager commented Aug 5, 2015

semisight commented Aug 5, 2015

gheinrich commented Aug 5, 2015

lukeyeager commented Oct 31, 2016

Treat network outputs differently depending on their names #193

Treat network outputs differently depending on their names #193

Comments

lukeyeager commented Aug 4, 2015

semisight commented Aug 4, 2015

gheinrich commented Aug 5, 2015

lukeyeager commented Aug 5, 2015

semisight commented Aug 5, 2015

gheinrich commented Aug 5, 2015

lukeyeager commented Oct 31, 2016