-
Notifications
You must be signed in to change notification settings - Fork 1.3k
Generate a model file and reuse model to classify new samples (eg streaming big data) #30
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Yes, that would be useful, but so far it can't. What you can do is use show_models(). It outputs something like:
which determines the final ensemble. You can use that in order to retrain your model on the full data and pickle it in your own code. |
Looks like scikit learn uses some external libs |
is there a simple programmatic way to convert the output of show_models() into a string that can be used to construct the classifiers in the code? Currently it comes out as
|
Have a look at this. |
Also show_models() can be very slow and occupies a lot of memory - takes tens of minutes and tens of GB in my case. Instead I am using |
The latest version of auto-sklearn features pickleable classifiers/regressors. If there is still an issue with model persistence, please open a new issue. |
Can autosklearn generate a model file that can be reused for classifying new data? Would be useful for classifying big data streams.
The text was updated successfully, but these errors were encountered: