Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature engineering #39

Merged
merged 12 commits into from
May 17, 2017
Merged

Feature engineering #39

merged 12 commits into from
May 17, 2017

Conversation

bkowshik
Copy link
Contributor

Working on #37

@bkowshik
Copy link
Contributor Author

Training dataset

The Area Under the Curve (AUC) represents a model’s ability to discriminate between positive and negative classes. An area of 1.0 represents a model that made all predictions perfectly. An area of 0.5 represents a model that is as good as random.

  • AUC score: 0.8

Validation dataset

  • AUC score: 0.7
Predicted good Predicted harmful
Labelled good 1750 151
Labelled harmful 541 140
             precision    recall  f1-score   support

      False       0.76      0.92      0.83      1901
       True       0.48      0.21      0.29       681

avg / total       0.69      0.73      0.69      2582

Testing dataset

  • Predicted good: 25,385
  • Predicted harmful: 219

@bkowshik bkowshik merged commit b9591ab into master May 17, 2017
@bkowshik bkowshik deleted the feature-engineering branch May 17, 2017 06:19
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant