-
Notifications
You must be signed in to change notification settings - Fork 1.9k
Towards #3204 - documentation for FeatureContributionCalculatingEstimator #3384
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
/// StochasticGradientDescent (SGD), SymbolicStochasticGradientDescent, GeneralizedAdditiveModels (GAM), | ||
/// FastForest, FastTree, LightGbm | ||
/// Ranking: | ||
/// FastTree, LightGbm |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Unfortunately this does not seem to look good in the xml doc:
https://docs.microsoft.com/en-us/dotnet/api/microsoft.ml.transforms.featurecontributioncalculatingtransformer?view=ml-dotnet
Could you find a way to itemize these points? Or another approach that will make it easier to read? #Resolved
/// | Input column data type | Vector of floats | | ||
/// | Output column data type | Vector of floats | | ||
/// | ||
/// <para> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This won't be processed or will cause an error, as you are inside a markdown block here. #Resolved
/// it can be useful to inspect which features influenced them most significantly. This transformer computes a model-specific | ||
/// list of per-feature contributions to the score for each example. These contributions can be positive (they make the score higher) or negative | ||
/// (they make the score lower). | ||
/// </para> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Remove #Resolved
/// list of per-feature contributions to the score for each example. These contributions can be positive (they make the score higher) or negative | ||
/// (they make the score lower). | ||
/// </para> | ||
/// <para> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Same #Resolved
/// | | | | ||
/// | -- | -- | | ||
/// | Does this estimator need to look at the data to train its parameters? | No | | ||
/// | Input column data type | Vector of floats | |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@sfilipi @shmoradims were we going to use Single instead of float? #Resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
yes
Use System.Single instead of 'float'. 'float' is a C# keywork, not a .NET type, and F# uses different terminology.
in xml
xref:System.Single in markdown
Same as above for 'double'
In reply to: 276422052 [](ancestors = 276422052)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/// (they make the score lower). | ||
/// </para> | ||
/// <para> | ||
/// Feature Contribution Calculation is currently supported for the following models: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You could make this a bulleted list in markdown
Regression
- OrdinaryLeastSquares
- etc
Would also be good to get the algorithm names to be exactly the same as the name of the trainer classes #Resolved
/// and the score obtained by taking the opposite decision at the node corresponding to feature F1. This algorithm extends naturally to models with | ||
/// many decision trees. | ||
/// </para> | ||
/// See the See Also section for links to examples of the usage. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Not sure this line is necessary #Resolved
/// the feature value. | ||
/// </para> | ||
/// <para> | ||
/// For tree-based models, the calculation of feature contribution essentially consists in determining which splits in the tree have the most impact |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This gives a good description of tree based models. Worth mentioning how it works for the other models?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The description for linear models and GAM is above. Do you think it should be more detailed?
In reply to: 276423670 [](ancestors = 276423670)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/// Estimator producing a FeatureContributionCalculatingTransformer which scores the model on an input dataset and | ||
/// computes model-specific contribution scores for each feature. | ||
/// Computes model-specific per-feature contributions to the score of each input vector. | ||
/// See the list of currently supported models below. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
See the list of currently supported models below. [](start = 8, length = 49)
I would remove this, because this line displays on the IntelliSense, and there won't be an option to see the remarks section there. #Resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Codecov Report
@@ Coverage Diff @@
## master #3384 +/- ##
==========================================
- Coverage 72.71% 72.69% -0.02%
==========================================
Files 807 807
Lines 145172 145172
Branches 16225 16225
==========================================
- Hits 105559 105538 -21
- Misses 35195 35219 +24
+ Partials 4418 4415 -3
|
@natke if this pr looks good to you could you unblock it? |
Adhering to the template in #3204 (comment) for the ColumnCopying estimator extensions, estimator, transformer.