Description
System information
- OS version/distro: Windows 10
- .NET Version (eg., dotnet --info): Core 3.0, preview 1.0 of ML
Issue
Working larger datasets it becomes important to get a progress on what the system is doing.
I am testing the framework against a smaller set of production data (21 days@23:45:00 per day), The smaller training file is ~300 GB and 35630 columns, depending on the multiclass label to predict.
There is not only no documentation in regards to sizing there is also no progress report or early stopping method that one can hook into, or is there?.
One can limit the amount of iterations, however it would be far cooler to be able to hooking to an Iteration have an event that states the current metric as well as a "Cancel" property. there are several events that use such hooks like Form.OnClosing.
Having a cancel would ideally have the model fitted in such a state that one can save it. Ideally I would suggest to have "duration of training" as well as "Iteration performed" in the event. this would nicely hook into progress reporting tools.
I guess it could be a part of IProgressChannel?
Would be nice if this could be considered