-
Notifications
You must be signed in to change notification settings - Fork 1.9k
NaN metric value handling in AutoML #4663
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
@justinormont You suggested fixing two things in #4648 . The second one is being fixed in ML.NET. Does the first work item still need to be done as part of this bug fix? If so, can you please add some repro steps for this bug? |
@CBrauer has a repro in bug.zip from his original bug report: #4648 (comment)
In this example, he is optimizing towards the
The AutoML code does need to be robust to Another way to reproduce is in a debugger and replacing the model's returned metric w/ |
@CBrauer The attached zip file did not contain the csv files for validation and test. I reduced the training file by 40% and created two new files for validation and test. With that, I have not been able to reproduce the issue you are seeing. Can you please update the zip file with the necessary files that reproduce the issue? |
We don't need to split the file into train, validation, and test. AutoML does the split internally. In this case, the training set is used as the validation set just to evaluate metrics from the best AutoML run. The choice of dataset is unrelated to AutoML training, which is the relevant part of the code for this bug. I can reproduce the error with the data and code provided. It uses I'll look at how |
AutoML API code is not handing
NaN
values for metrics. During the sweep, when a model returns aNaN
value for the metric being optimized, AutoML crashes.See background: #4648 (comment)
The text was updated successfully, but these errors were encountered: