You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Since that is a data operation that only involves splitting data and since it is independent on the training task, it would make more sense to have it in mlContext.Data. The previous line would look like:
Well, not exactly. In the case of ranking, for example, it is critical that the split be stratified on a sort of group ID. But we might consider that we expect that having the stratification is an option, and if someone doesn't realize stratification is important for a task like ranking, well... that's just too bad.
But I would at least be aware of this issue.
Also this does imply a breaking change, so I'm adding it to Project 13.
Currently
TrainTestSplit
can be found in the various training tasks inMLContext
:machinelearning/test/Microsoft.ML.Tests/Scenarios/Api/CookbookSamples/CookbookSamples.cs
Lines 604 to 605 in 533171a
Since that is a data operation that only involves splitting data and since it is independent on the training task, it would make more sense to have it in
mlContext.Data
. The previous line would look like:The text was updated successfully, but these errors were encountered: