I was suprised that `logloss` for binary classification or `crossentropy` is not implemented. There is `LogitDistLoss` which is `(logloss(w, t))^2`.