Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature Request: TPOTOutlierDetector #794

Open
Y-oHr-N opened this issue Nov 7, 2018 · 1 comment
Open

Feature Request: TPOTOutlierDetector #794

Y-oHr-N opened this issue Nov 7, 2018 · 1 comment

Comments

@Y-oHr-N
Copy link

Y-oHr-N commented Nov 7, 2018

Hello,

scikit-learn 0.20 provides more consistent outlier detection API.
https://speakerdeck.com/albertcthomas/anomaly-detection-in-scikit-learn-ongoing-work-and-future-developments

  • covariance.EllipticEnvelope
  • svm.OneClassSVM
  • ensemble.IsolationForest
  • neighbors.LocalOutlierFactor

So I want an estimator that fits all outlier detection models like TPOTClassifier.

Thank you.

@joanesplazaola
Copy link

Hi, I have been checking the possibility of adding this to TPOT, but I would like to double check the procedure.

As in Novelty detection, the fit function should get both, the fit data (normal data) and a Xy validation set (mixture of normal and novel data with their labels), in order to score the validity of the evaluated models.
Also, those labels of validation should be determined, as sklearn uses 1/-1, and many people uses 1/0.
The training time of those models can also be a problem, mostly for the One Class SVM, as the time needed may exceed easily the expected limits.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants