Issue 430/all probabilities get predictions #433

Lguyogiro · 2018-10-18T05:18:55Z

This PR implements the feature suggested in PR-430. Namely, it adds an option in generate_predictions to output the probability for all labels, instead of just the indicated positive label. It also updates generate_predictions to behave more like the Learner.predict method in that it prints out the predictions in tsv format, optionally to a file.

NOTE: This is will break backwards compatibility since we are now adding a header even in the single label case

…sistent with that of Learner.predict

coveralls · 2018-10-18T05:46:22Z

Coverage increased (+36.4%) to 92.288% when pulling 67587d4 on issue-430/all-probabilities-get-predictions into 9f7e962 on master.

desilinguist

The actual algorithm for generating the probabilities is fine but the naming of some variables and arguments can be improved.

Also, can you please note in the PR description that this is will break backwards compatibility since we are now adding a header even in the single label case?

Also, we need to figure out why the coverage dropped.

skll/utilities/generate_predictions.py

tests/test_utilities.py

desilinguist · 2018-11-16T16:54:01Z

@Lguyogiro what’s happening with this PR? Do you need help with the coverage issue?

Lguyogiro · 2018-11-28T18:37:51Z

@desilinguist Sorry I got a little sidetracked with something else. I will add tests and hopefully have the coverage issues resolved by end of week.

We need to do this since otherwise the travis builds time out.

…get-predictions Merge in MLP test fix

skll/utilities/generate_predictions.py

Lguyogiro · 2018-11-30T16:33:59Z

This is now passing and the coverage drop has been fixed. Please feel free to review 👍

mulhod

Looks good to me!

mulhod · 2018-11-30T16:36:29Z

skll/utilities/generate_predictions.py

@@ -14,6 +14,7 @@
 import argparse
 import logging
 import os
+import sys


Nitpick: It seems like this import is never used?

desilinguist · 2018-11-30T16:44:36Z

@Lguyogiro actually you still need to update the PR description per my original comment to say that this will break backwards compatibility.

jbiggsets

Two minor comments.

jbiggsets · 2018-11-30T20:01:33Z

skll/utilities/generate_predictions.py

+        all_labels: bool
+            A flag indicating whether to return the probabilities for all
+            labels in each row instead of just returning the probability of
+            `positive_label`.


This is a nitpick, but can we add "Defaults to False" here?

jbiggsets · 2018-11-30T20:07:13Z

skll/utilities/generate_predictions.py

+                            probs_str = "\t".join([str(p) for p in probabilities])
+                            print("{}\t{}".format(id_, probs_str), file=outputfh)
+                    else:
+                        for i, pred in enumerate(preds):


Can we call this j or something else, since we're using i in the outer loop? Same comment below.

Lguyogiro · 2018-12-01T20:11:47Z

I've addressed @jbiggsets comments, and also added a unit test for the case of having multiple input files.

desilinguist · 2018-12-02T17:10:19Z

I‘ll take a look at the other test you added on Monday.

skll/utilities/generate_predictions.py

Robert Pugh added 4 commits October 16, 2018 09:52

add return_all_probs parameter to generate_predictions

db514c2

add option to print all label probabilities, make output behavior con…

3c4c4a5

…sistent with that of Learner.predict

update unit tests

b5f72f9

use assert_array_almost_equal when all_probabilities is turned on

cecce05

Lguyogiro requested review from desilinguist, jbiggsets, aoifecahill, a user and mulhod October 18, 2018 05:18

desilinguist requested changes Oct 18, 2018

View reviewed changes

Robert Pugh added 5 commits October 18, 2018 07:50

respond to PR comments about variable names

d72d001

update tests

8620280

add test for tsv file output

6a124c8

make delimiter a str for csv.reader

70b22ad

add test for file headers in Predictor

20b1b8f

Robert Pugh and others added 3 commits November 28, 2018 15:19

increase test coverage

4b53eaf

Turn off grid search in MLP tests

4da69cd

We need to do this since otherwise the travis builds time out.

Merge branch 'make-mlp-test-faster' into issue-430/all-probabilities-…

8c1eaca

…get-predictions Merge in MLP test fix

mulhod reviewed Nov 30, 2018

View reviewed changes

skll/utilities/generate_predictions.py Show resolved Hide resolved

mulhod approved these changes Nov 30, 2018

View reviewed changes

desilinguist approved these changes Nov 30, 2018

View reviewed changes

jbiggsets reviewed Nov 30, 2018

View reviewed changes

Robert Pugh added 2 commits December 1, 2018 09:26

update inner loop count variable.

bbd8f50

add documentation, new test for multiple input files

67587d4

jbiggsets approved these changes Dec 2, 2018

View reviewed changes

desilinguist approved these changes Dec 3, 2018

View reviewed changes

desilinguist merged commit 075971a into master Dec 3, 2018

desilinguist deleted the issue-430/all-probabilities-get-predictions branch December 3, 2018 13:50

desilinguist mentioned this pull request Dec 3, 2018

update generate_predictions to include probabilities for all classes if requested #430

Closed

desilinguist added this to Done in SKLL Release v1.5.3 Dec 3, 2018

slava92 reviewed May 7, 2019

View reviewed changes

skll/utilities/generate_predictions.py Show resolved Hide resolved

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue 430/all probabilities get predictions #433

Issue 430/all probabilities get predictions #433

Lguyogiro commented Oct 18, 2018 •

edited

coveralls commented Oct 18, 2018 •

edited

desilinguist left a comment •

edited

desilinguist commented Nov 16, 2018

Lguyogiro commented Nov 28, 2018

Lguyogiro commented Nov 30, 2018

mulhod left a comment

mulhod Nov 30, 2018

desilinguist commented Nov 30, 2018 •

edited

jbiggsets left a comment

jbiggsets Nov 30, 2018

jbiggsets Nov 30, 2018

Lguyogiro commented Dec 1, 2018

desilinguist commented Dec 2, 2018

Issue 430/all probabilities get predictions #433

Issue 430/all probabilities get predictions #433

Conversation

Lguyogiro commented Oct 18, 2018 • edited

coveralls commented Oct 18, 2018 • edited

desilinguist left a comment • edited

Choose a reason for hiding this comment

desilinguist commented Nov 16, 2018

Lguyogiro commented Nov 28, 2018

Lguyogiro commented Nov 30, 2018

mulhod left a comment

Choose a reason for hiding this comment

mulhod Nov 30, 2018

Choose a reason for hiding this comment

desilinguist commented Nov 30, 2018 • edited

jbiggsets left a comment

Choose a reason for hiding this comment

jbiggsets Nov 30, 2018

Choose a reason for hiding this comment

jbiggsets Nov 30, 2018

Choose a reason for hiding this comment

Lguyogiro commented Dec 1, 2018

desilinguist commented Dec 2, 2018

Lguyogiro commented Oct 18, 2018 •

edited

coveralls commented Oct 18, 2018 •

edited

desilinguist left a comment •

edited

desilinguist commented Nov 30, 2018 •

edited