MLContext.Transform should be further organized #2361

artidoro · 2019-02-01T06:34:50Z

It seems that MLContext.Transforms can be further organized.
In MLContext.Transforms it is possible to access many transforms directly:

IndicateMissingValues
ReplaceMissingValues
ApplyOnnxModel
Concatenate
LoadImage
Normalize
Resize
ScoreTensorFlowModel
SelectColumns
TemsorFlow

There are also subgroups:

Transforms.Text
Transforms.Projection
Transforms.Categorical
Transforms.Conversion
Transforms.FeatureSelection

Suggestions:

It seems that more groupings can be made:
- Transforms.Image
- Transforms.MissingValues
- Transforms.TensorFlow
And maybe Normalize can be moved to the Transforms.Projections.

Question:
Does it even make sense to have some transforms in a subgroup, while others directly accessible?

/cc: @rogancarr, @sfilipi, @TomFinley

The text was updated successfully, but these errors were encountered:

rogancarr · 2019-02-01T17:32:56Z

I'd like to move schema operations into something like Schema. e.g. mlContext.Transforms.Schema.DropColumns().

TomFinley · 2019-02-01T22:31:14Z

Transforms.MissingValues might be OK, but I'm not sure I see the point in having something if there are only two entries. Honestly I am not too excited about that.

I do not think Transforms.Image or Transforms.TensorFlow are sensible to have. Bear in mind that these extensions are not seen unless someone explicitly chooses to include the relevant nuget packages. So if we were to have another "subcategory," everyone would have to see those whether they had included the relevant nuget or not. (Since there is no such thing as an extension property.) So it would just have to be there, and certainly it would be bad to have these "categories" which are by default empty. And if someone has chosen to explicitly rely on those nugets, that's a pretty solid indication that they want to use that code. So why hide it?

So I don't necessarily agree with the points in this issue? Is the fact that there are some transforms at the root of the hierarchy really so very bad?

artidoro · 2019-07-03T02:09:46Z

This is not applicable any longer since we have now release with the categories currently present in MLContext. When we will consider breaking changes, we might think about reorganizing MLContext.

artidoro added the API Issues pertaining the friendly API label Feb 1, 2019

sfilipi mentioned this issue Feb 1, 2019

Image analytics documentation, samples, internalization #2372

Merged

artidoro closed this as completed Jul 3, 2019

ghost locked as resolved and limited conversation to collaborators Mar 25, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MLContext.Transform should be further organized #2361

MLContext.Transform should be further organized #2361

artidoro commented Feb 1, 2019

rogancarr commented Feb 1, 2019 •

edited

Loading

TomFinley commented Feb 1, 2019 •

edited

Loading

artidoro commented Jul 3, 2019

MLContext.Transform should be further organized #2361

MLContext.Transform should be further organized #2361

Comments

artidoro commented Feb 1, 2019

rogancarr commented Feb 1, 2019 • edited Loading

TomFinley commented Feb 1, 2019 • edited Loading

artidoro commented Jul 3, 2019

rogancarr commented Feb 1, 2019 •

edited

Loading

TomFinley commented Feb 1, 2019 •

edited

Loading