Skip to content

Towards #3204 -FeatureSelection #3424

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 2 commits into from
Apr 19, 2019
Merged

Conversation

sfilipi
Copy link
Member

@sfilipi sfilipi commented Apr 19, 2019

Adhering to the #3204 template for the FeatureSelection estimator and transformer.

@sfilipi sfilipi changed the title Towards #3204 Towards #3204 -FeatureSelection Apr 19, 2019
@@ -18,7 +18,47 @@

namespace Microsoft.ML.Transforms
{
/// <include file='doc.xml' path='doc/members/member[@name="CountFeatureSelection"]' />
/// <summary>
/// Selects the slots for which the count of non-default values is greater than or equal to a threshold.
Copy link

@shmoradims shmoradims Apr 19, 2019

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

default values [](start = 54, length = 14)

create new issue for later: we need to specify for each input type what's the default value. for e.g. it's not clear if for text, default is null, empty string, or whitespaces. same goes with key type. #Resolved

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Created: #3443


In reply to: 277027260 [](ancestors = 277027260)

/// |True |4,0 |
/// |False|0,5 |
/// |True |4,0 |
/// |False|0,5 |
Copy link

@shmoradims shmoradims Apr 19, 2019

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

do we need these examples? we already have similar samples. #Resolved

Copy link
Member Author

@sfilipi sfilipi Apr 19, 2019

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

it is really hard to understand without them, i think.
The samples should be for how to use it, not to understand what is it doing. @natke what do you think?
#Resolved

Copy link
Contributor

@natke natke Apr 19, 2019

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I like the samples. It helped me understand when I read it. #Resolved

/// The count feature selection can remove the features generated by the hash transform that have no data in the examples.
///
/// For example, if we set the count to 3 and fit the estimator, apply the transformer to the following Features column,
/// we would see the second solot, containing: NaN, 5, 5, NaN values being dropped because that slot has only two non-default values, the two 5 values.
Copy link

@shmoradims shmoradims Apr 19, 2019

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

solot [](start = 32, length = 5)

slot

please use spell checker. we won't catch all the typos manually.
https://marketplace.visualstudio.com/items?itemName=EWoodruff.VisualStudioSpellChecker #Resolved

Copy link

@shmoradims shmoradims left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

:shipit:

@sfilipi sfilipi merged commit 20390ec into dotnet:master Apr 19, 2019
@sfilipi sfilipi deleted the featureSelection branch April 19, 2019 23:38
@ghost ghost locked as resolved and limited conversation to collaborators Mar 22, 2022
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants