Probabilistic losses?

Hello, this is @fkiraly from MLJ.jl across the street, we were wondering whether there are currently any plans for probabilistic losses, such as (strictly) proper scoring rules (with a negative sign)?
https://github.com/alan-turing-institute/MLJ.jl/issues/91
For example: log-loss (aka cross-entropy loss), Brier score (aka squared integrated loss), discrete and continuous rank probability score, etc.

Practically, this would mean comparing an observation with a probability distribution that intends to predictively model the distribution the observation will be coming from.

This would be quite important for:
(i) evaluating machine learning models that are capable of returning such predictions - this actually happens relatively frequently, but isn't always properly supported by well-designed interfaces (cough).
(ii) optimizing/fitting/tuning probabilistic machine learning models w.r.t. empirical (possibly penalized) probabilistic risk/loss

Technically, it would mean comparing a vector of predictions to a vector of Distributions.jl (which we've chosen in MLJ.jl). 
A nice benefit of this interface is that it also disentangles the usual predict_proba etc probabilistic classification interface, see e.g. here for a discussion a while ago.
https://github.com/alan-turing-institute/MLJ.jl/issues/34

Of course, it's also different from (a generalization of) your current interface... hence the question.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Probabilistic losses? #108

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Probabilistic losses? #108

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions