Feature Request: AutoSklearnOutlierDetector #578

Y-oHr-N · 2018-11-07T06:53:01Z

Hello,

scikit-learn 0.20 provides more consistent outlier detection API.
https://speakerdeck.com/albertcthomas/anomaly-detection-in-scikit-learn-ongoing-work-and-future-developments

covariance.EllipticEnvelope
svm.OneClassSVM
ensemble.IsolationForest
neighbors.LocalOutlierFactor

So I want an estimator that fits all outlier detection models like AutoSklearnClassifier.

Thank you.

mfeurer · 2018-11-19T12:21:06Z

Just for clarification, do you think that these should be part of the pipeline tuned by Auto-sklearn or that there should be a standalone mode AutoSklearnOutlierDetector?

According to the title you want the second thing. From my understanding, this is an unsupervised learning problem. The central assumption in Auto-sklearn is that there as a loss function which can be used to tune the hyperparameters. What would such a loss function look like for outlier detection?

Y-oHr-N · 2018-11-21T02:50:45Z

Thank you for your reply.
As far as I know, threre are two metrics for outlier function.

One is the square of the geometric mean of precision and recall.

outliers - Metrics for one-class classification - Cross Validated
https://stats.stackexchange.com/questions/192530/metrics-for-one-class-classification
Lee, W. S, and Liu, B., "Learning with positive and unlabeled examples using weighted Logistic Regression," In Proceedings of ICML, pp. 448-455, 2003.
https://www.aaai.org/Papers/ICML/2003/ICML03-060.pdf

The other is the area under the Mass-Volume curve.

Goix, N., "How to evaluate the quality of unsupervised anomaly detection algorithms?" In ICML Anomaly Detection Workshop, 2016.
https://arxiv.org/pdf/1607.01152.pdf
Thomas, A., Clémençon, S., Feuillard, V., and Gramfort, A., "Learning hyperparameters for unsupervised anomaly detection," In ICML Anomaly Detection Workshop, 2016.
https://github.com/albertcthomas/anomaly_tuning

I implemented two scikit-learn compatible metrics.
https://github.com/HazureChi/kenchi/blob/master/kenchi/metrics.py

mfeurer · 2018-11-30T12:47:11Z

I'm afraid that I won't have the time to implement something here. Also, I think this is somewhat out of scope for Auto-sklearn if the metrics are not in scikit-learn yet.

github-actions · 2021-05-05T01:51:56Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs for the next 7 days. Thank you for your contributions.

jmren168 · 2023-02-24T06:51:21Z

Hi @mfeurer,

Is it possible to create a customized one-class SVM as a two-class SVM, and then put it into AutoSklearnClassifier?
What I'm trying to do is

add a customized classifier (input: a one-class SVM, and X_train and pseudo_y_train)
make a customized score
if pseudo_y_train are all 0 (only one class), then the score is 1e-5;
otherwise, give a higher socre if it classifies outliers correctly
put the customized classifier and the customized score into AutoSklearnClassifier

Does it sound reasonable and workable?

Any comments are highly appreciated.

JM

franchuterivera added the enhancement A new improvement or feature label Feb 17, 2021

github-actions bot added the stale label May 5, 2021

mfeurer removed the stale label May 6, 2021

eddiebergman mentioned this issue Jul 21, 2023

What's in store for Auto-Sklearn? -- From the Developers #1677

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature Request: AutoSklearnOutlierDetector #578

Feature Request: AutoSklearnOutlierDetector #578

Y-oHr-N commented Nov 7, 2018

mfeurer commented Nov 19, 2018

Uh oh!

Y-oHr-N commented Nov 21, 2018 •

edited

Loading

Uh oh!

mfeurer commented Nov 30, 2018

Uh oh!

github-actions bot commented May 5, 2021

Uh oh!

jmren168 commented Feb 24, 2023 •

edited

Loading

Uh oh!

Feature Request: AutoSklearnOutlierDetector #578

Feature Request: AutoSklearnOutlierDetector #578

Comments

Y-oHr-N commented Nov 7, 2018

mfeurer commented Nov 19, 2018

Uh oh!

Y-oHr-N commented Nov 21, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mfeurer commented Nov 30, 2018

Uh oh!

github-actions bot commented May 5, 2021

Uh oh!

jmren168 commented Feb 24, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Y-oHr-N commented Nov 21, 2018 •

edited

Loading

jmren168 commented Feb 24, 2023 •

edited

Loading