Fix log_loss and add test for multiclass #28

glennq · 2015-08-28T13:49:49Z

In MLPClassifier, loss is set to 'log_loss' for both multiclass case and multilabel case.

The log_loss implementation in neural_network/base.py is in fact binary cross entropy, but for multiclass case where self.out_activation_ is set to 'softmax', it does not make sense to use binary cross entropy loss. I think categorical cross entropy, or otherwise called negative log likelihood should be used instead.

amueller · 2015-08-28T15:07:35Z

sklearn/neural_network/base.py

+    return -np.sum(y_true * np.log(y_prob)) / y_prob.shape[0]
+
+
+def binary_log_loss(y_true, y_prob):


Don't you think calling it multilabel_log_loss would be more informative?

You mean renaming binary_log_loss to multilabel_log_loss? This makes sense, it's just that this loss function is actually used in both binary and multilabel case since I made minimal change to BaseMultilayerPerceptron._initialize(). I took the 'binary' here from 'binary cross-entropy'.

amueller · 2015-08-28T15:08:17Z

I agree, that's a bug.
It would be good to have a non-regression test.

amueller · 2015-08-28T15:09:14Z

I'm not sure if it is better to work on this branch or on scikit-learn#3204. But we can fix this here for now.

glennq · 2015-08-28T15:23:03Z

I'm also confused about which pull request is going to be merged to master, but I guess we should first merge the two pull requests 3939 and 3204.

amueller · 2015-08-28T15:27:00Z

Unfortunately that is not entirely decided. scikit-learn#3204 is more "up to date" but I'm not sure if all the additions there are useful.
It would be good to benchmark / review the implemented learning algorithms before deciding which way to go.

glennq · 2015-08-28T15:56:19Z

I see. I was trying to compare the two pull requests through github but couldn't get an informative comparison result.
I guess the main difference is the newly added learning_rate_class which implements a few learning rate adaption schemes for SGD.
So I think you were talking about benchmark on these algorithm. This makes sense.

I'm just wondering if it would be better to merge an extensible minimal working implementation first, and consider new features later, since this pull request has been around for nearly two years...

amueller · 2015-08-28T17:39:18Z

I mostly agree with your assessment. The problem is that we need working default values, and if the defaults just not work, people might be discouraged from using it.
So we need at least one well-working example. And we probably need to include early stopping.

I just wrote an email to some people who have worked on this and on deep learning in general to ask their opinion.

Apart from benchmarking the learning schedules (these are not really learning rates imho), the other thing would be investigating early stopping.
I think it is not included in my branch yet, but there should be an option to split the provided dataset into a test and validation set, and do early stopping on the validation set.
It would be great if you could provide an example where this is beneficial.

…o patch-1 Conflicts: doc/whats_new.rst

…into patch-1 Conflicts: doc/whats_new.rst

Empty "residues_" in "LinearRegression": documented the case when the "residues_" attribute is empty.

…d_stuff [MRG] MAINT remove deprecated stuff that will no longer be supported in 0.18

[MRG+2] MAINT: deprecation warns from StandardScaler std_

[MRG + 1] Bug fix for unnormalized laplacian

MAINT: remove unused import

Circle ci to build the documentation (only after merge to master)

…raph_is_connected [MRG+1] Optimize sklearn.manifold._graph_is_connected

…huffle [MRG+1] Remove shuffling in LabelKFold

Residues can be empty if the rank of X does not satisfy the conditions described in scipy.linalg.lstsq documentation. As residues are not that useful (i.e. we're not doing stat testing), this property is deprecated and will be removed in sklearn 0.19.

[MRG+1] The code was raising an exception while plotting 2nd curve

[MRG + 1] Removed deprecated stuff in 0.18

…-1-row-csr-fix [MRG + 1] max abs scaler 1 row csr fix

…zed-svd [MRG + 2] ENH: optimizing power iterations phase for randomized_svd

…age as default multioutput parameter of r2_score function after 0.19) adding deprecation message fixing deprecation message

confirm deprecation starting from 0.16 and removal after 0.18

Used to print "('violation:', 1.0)". Now "violation: 1.0".

Empty `residues_` in `LinearRegression`: docstring updated

Changed imports in test to separate testing of API and of internals. See scikit-learngh-5509.

[MRG + 1] Remove deprecated stuff from SVM

…metrics_second_pass [MRG+2] Use uniform_average as default multioutput parameter of r2_score function

…metrics [MRG+1] removing deprecated files in metrics

[MRG+1] partly fixed issue scikit-learn#3450 for hamming loss

…sionadded DOC versionadded randomized_svd

Seeking to finalize MLP

…trl+C stop option for SGD

…el case

amueller reviewed Aug 28, 2015
View reviewed changes

yanlend and others added 23 commits October 19, 2015 13:46

Merge branch 'patch-1' of https://github.com/yanlend/scikit-learn int…

60cff80

…o patch-1 Conflicts: doc/whats_new.rst

Merge branch 'master' of https://github.com/scikit-learn/scikit-learn …

56796e4

…into patch-1 Conflicts: doc/whats_new.rst

Fixed the docstring (issue scikit-learn#5313)

8deaf31

Empty "residues_" in "LinearRegression": documented the case when the "residues_" attribute is empty.

DOC: clarify inputs for haversine metric

969337c

referencing datasets documentation in toc

0cd3d63

Merge pull request scikit-learn#5437 from rvraghav93/remove_deprecate…

8d2c15c

…d_stuff [MRG] MAINT remove deprecated stuff that will no longer be supported in 0.18

MAINT: remove unused import

8c9c8a5

Merge pull request scikit-learn#5440 from giorgiop/scaler-deprecation

40b541e

[MRG+2] MAINT: deprecation warns from StandardScaler std_

Merge pull request scikit-learn#4995 from yanlend/patch-1

3ab597c

[MRG + 1] Bug fix for unnormalized laplacian

Merge pull request scikit-learn#5454 from GaelVaroquaux/test_branch

19fc67c

MAINT: remove unused import

Add test for _graph_connected_component function

657190d

FIX: remove shuffling in LabelKFold

a4e4cf9

Correct Typo with Gael comment

1dd5518

fix scikit-learn#4986 doc building on master with circleCI

9965f99

Merge pull request scikit-learn#5459 from waterponey/circleCI

0c24fee

Circle ci to build the documentation (only after merge to master)

Merge pull request scikit-learn#5443 from AlexandreAbraham/optimize_g…

3fd38e9

…raph_is_connected [MRG+1] Optimize sklearn.manifold._graph_is_connected

updated test with tips from @giorgiop

658129a

fixing MaxAbsScaler according to MinMaxScaler

23e3987

Merge pull request scikit-learn#5458 from glouppe/labelkfold-withouts…

40ba4fc

…huffle [MRG+1] Remove shuffling in LabelKFold

add test for inverse_transform

2571158

add fix for inverse transform

9da9b58

changed weighting in hamming_loss in sklearn/metrics/classification.py

a0fc0ed

zermelozf and others added 29 commits October 21, 2015 10:24

MAINT Removed deprecated stuff.

5db2adf

Addressed comments on PR scikit-learn#5451

62633f7

Merge pull request scikit-learn#5436 from Mbompr/plot-roc-example-4976

95d4e23

[MRG+1] The code was raising an exception while plotting 2nd curve

FIX: GaussianProcess will be deprecated in 0.20, not 0.19

4bcfedb

Merge pull request scikit-learn#5451 from zermelozf/deprecated-stuff

bad3dd6

[MRG + 1] Removed deprecated stuff in 0.18

randomized_svd: power iter, normalization, benchmark

b18f295

Merge pull request scikit-learn#5449 from Jeffrey04/5433-MaxAbsScaler…

718a7df

…-1-row-csr-fix [MRG + 1] max abs scaler 1 row csr fix

Merge pull request scikit-learn#5141 from giorgiop/power-iter-randomi…

0cb93b0

…zed-svd [MRG + 2] ENH: optimizing power iterations phase for randomized_svd

Update deprecation message in metrics/regression.py (use uniform_aver…

08b7253

…age as default multioutput parameter of r2_score function after 0.19) adding deprecation message fixing deprecation message

Updating deprecation message in metrics.py:

888a184

confirm deprecation starting from 0.16 and removal after 0.18

COSMIT nicer output from NMF with verbose > 0

1eb9bed

Used to print "('violation:', 1.0)". Now "violation: 1.0".

removing sklearn/metrics/metrics.py

8e0f337

Merge pull request scikit-learn#5446 from deepcharles/charles1

ee0fd9a

Empty `residues_` in `LinearRegression`: docstring updated

MAINT/TST: public export of non_negative_factorization

3697d75

Changed imports in test to separate testing of API and of internals. See scikit-learngh-5509.

Merge pull request scikit-learn#5469 from rvraghav93/zero_gamma

65aee0c

[MRG + 1] Remove deprecated stuff from SVM

Merge pull request scikit-learn#5457 from aabadie/remove_deprecation_…

7c69c55

…metrics_second_pass [MRG+2] Use uniform_average as default multioutput parameter of r2_score function

Merge pull request scikit-learn#5456 from aabadie/remove_deprecation_…

ebf8d3a

…metrics [MRG+1] removing deprecated files in metrics

Merge pull request scikit-learn#5465 from massil94/dev-massil

b6a5ed2

[MRG+1] partly fixed issue scikit-learn#3450 for hamming loss

Added documentation about y nan handling in check_X_y

c450751

DOC versionadded randomized_svd

f45f260

Merge pull request scikit-learn#5512 from giorgiop/randomized-svd-ver…

3c988d5

…sionadded DOC versionadded randomized_svd

(WIP) Added Multi-layer perceptron (MLP)

587315e

Seeking to finalize MLP

refactor mlp optimization methods into _fit_lbfgs and _fit_sgd, add C…

b52bae4

…trl+C stop option for SGD

minor fixes to bench_mnist.

cb4c935

FIX partial fit test for MLP

ee071e0

ENH better 'constant' learning rate schedule

3c8210f

iterate, improve. Nesterov's momentum.

3891af8

fix log_loss for multiclass case and add binary_log_loss for multilab…

bde2270

…el case

Finish up mlp_refactoring and squash previous commits

917bacb

amueller closed this Oct 23, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix log_loss and add test for multiclass #28

Fix log_loss and add test for multiclass #28

glennq commented Aug 28, 2015

amueller Aug 28, 2015

glennq Aug 28, 2015

amueller commented Aug 28, 2015

amueller commented Aug 28, 2015

glennq commented Aug 28, 2015

amueller commented Aug 28, 2015

glennq commented Aug 28, 2015

amueller commented Aug 28, 2015

		return -np.sum(y_true * np.log(y_prob)) / y_prob.shape[0]


		def binary_log_loss(y_true, y_prob):

Fix log_loss and add test for multiclass #28

Fix log_loss and add test for multiclass #28

Conversation

glennq commented Aug 28, 2015

amueller Aug 28, 2015

Choose a reason for hiding this comment

glennq Aug 28, 2015

Choose a reason for hiding this comment

amueller commented Aug 28, 2015

amueller commented Aug 28, 2015

glennq commented Aug 28, 2015

amueller commented Aug 28, 2015

glennq commented Aug 28, 2015

amueller commented Aug 28, 2015