Confusion matrices as measures

Hi, I'm doing some model building for small data classification using MLJ.

I would like to use the confusion matrix directly for evaluation by summing it over cross validation folds. (Personally I often like to use the confusion matrix directly to assess model performance as it avoids many of the caveats which come from trying to summarize it into a single number.)

With that in mind, I tried something like

```julia
labels, features = unpack(...)

leave_one_out_cv = CV(nfolds=length(labels)) # small data!
evaluate!(machine(LinearSVC(), features, labels),
          resampling=leave_one_out_cv,
          measure=confusion_matrix
          )
```

However, this fails with an error deep in the aggregation machinery (confusion matrices cannot be summed etc)

You seem to almost have the pieces in place for this to work, so for now I worked around this with some type piracy:

```julia
# Hacks to allow us to treat the confusion matrix as an MLJBase measure.

MLJBase.aggregation(::Type{typeof(confusion_matrix)}) = MLJBase.Sum()
Base.round(m::MLJBase.ConfusionMatrix; kws...) = m

function Base.:+(m1::MLJBase.ConfusionMatrix, m2::MLJBase.ConfusionMatrix)
    if m1.labels != m2.labels
        throw(ArgumentError("Confusion matrix labels must agree"))
    end
    MLJBase.ConfusionMatrix(m1.mat + m2.mat, m1.labels)
end
```

Is this questionable? Can it be made to "just work" using something like the above, or am I just abusing the machinery in some way?

(Side note - I'm finding MLJ/MLJBase fantastic so far. In particular I'm greatly enjoying the way that metadata about labels and features is naturally preserved throughout the training pipeline. Bravo, this is the way it should be!)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Confusion matrices as measures #168

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Confusion matrices as measures #168

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions