Skip to content

[Question] Integration with sklearn-evaluation #1640

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
idomic opened this issue Jan 12, 2023 · 3 comments
Open

[Question] Integration with sklearn-evaluation #1640

idomic opened this issue Jan 12, 2023 · 3 comments

Comments

@idomic
Copy link

idomic commented Jan 12, 2023

I think adding an integration/tutorial and the right documentation of the framework can go a long way.

Usually when using auto-sklearn you'd get the final model/estimator, and then you'll check performance and results.
This will allow an easier mechanism for the users to solve the second part of it.

@aron-bram
Copy link
Collaborator

aron-bram commented Jan 13, 2023

Hi,

thank you for your suggestion. I will look into sklearn-evaluation and see how it could be combined with the results of our optimization runs. Hopefully small discrepancies between the implementations of some things like "cv_results_" in sklearn and auto-sklearn won't cause too much of a problem. Otherwise this could take a long time.
As a side note, we are currently in the process of updating our sklearn dependency to one of the more recent versions.

Then I could add a tutorial under the examples folder about how to find the optimal pipeline for some problem with auto-sklearn and analyze that pipeline using sklearn-evaluation. And for instance, one about analyzing the explored hyperparameter space using cv_results_.
Is this what you propose?

What do you think @eddiebergman?

@idomic
Copy link
Author

idomic commented Jan 13, 2023

Is this what you propose?

Yes exactly!
For instance, for hyperparameter analysis, we can take the grid approach and evaluate multiple parameters at once.
I'm also happy to help and collaborate on the PR. Once we have a solid example or two we can also blog about it.
Probably the first step is like you're suggesting, testing the compatibility of some parameters that are different between the two like cv_results_

@aron-bram
Copy link
Collaborator

Great! Thank you for the confirmation and for your willingness to collaborate.

I will let you know once I've completed the preliminary steps.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants