Created samples for 'FeaturizeText' API. #3120

zeahmed · 2019-03-27T20:36:49Z

Related to #1209.

shmoradims · 2019-03-27T21:13:24Z

docs/samples/Microsoft.ML.Samples/Dynamic/Transforms/Text/FeaturizeText.cs

+            // as well as the source of randomness.
+            var mlContext = new MLContext();
+
+            // Get a small dataset as an IEnumerable.


Get [](start = 15, length = 3)

Create / Define #Resolved

shmoradims · 2019-03-27T21:13:48Z

docs/samples/Microsoft.ML.Samples/Dynamic/Transforms/Text/FeaturizeText.cs

+            // Get a small dataset as an IEnumerable.
+            var samples = new List<TextData>()
+            {
+                new TextData(){ Text ="ML.NET's FeaturizeText API uses a composition of several basic transforms to convert text into numeric features." },


extra space? = " #Resolved

codecov · 2019-03-27T21:17:09Z

Codecov Report

Merging #3120 into master will decrease coverage by <.01%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##           master    #3120      +/-   ##
==========================================
- Coverage   72.52%   72.51%   -0.01%     
==========================================
  Files         808      808              
  Lines      144665   144665              
  Branches    16198    16198              
==========================================
- Hits       104913   104910       -3     
- Misses      35342    35344       +2     
- Partials     4410     4411       +1

Flag	Coverage Δ
#Debug	`72.51% <ø> (-0.01%)`	⬇️
#production	`68.11% <ø> (-0.01%)`	⬇️
#test	`88.81% <ø> (ø)`	⬆️

Impacted Files	Coverage Δ
src/Microsoft.ML.Transforms/Text/LdaTransform.cs	`89.26% <0%> (-0.63%)`	⬇️
...StandardTrainers/Standard/LinearModelParameters.cs	`60.05% <0%> (-0.27%)`	⬇️
...soft.ML.Data/DataLoadSave/Text/TextLoaderCursor.cs	`84.7% <0%> (-0.21%)`	⬇️
src/Microsoft.ML.Maml/MAML.cs	`26.21% <0%> (+1.45%)`	⬆️

codecov · 2019-03-27T21:17:52Z

Codecov Report

Merging #3120 into master will not change coverage.
The diff coverage is n/a.

@@           Coverage Diff           @@
##           master    #3120   +/-   ##
=======================================
  Coverage   72.52%   72.52%           
=======================================
  Files         808      808           
  Lines      144665   144665           
  Branches    16198    16198           
=======================================
  Hits       104913   104913           
+ Misses      35342    35341    -1     
- Partials     4410     4411    +1

Flag	Coverage Δ
#Debug	`72.52% <ø> (ø)`	⬆️
#production	`68.12% <ø> (ø)`	⬆️
#test	`88.81% <ø> (ø)`	⬆️

Impacted Files	Coverage Δ
src/Microsoft.ML.Transforms/Text/TextCatalog.cs	`41.66% <ø> (ø)`	⬆️
...StandardTrainers/Standard/LinearModelParameters.cs	`60.05% <0%> (-0.27%)`	⬇️
...soft.ML.Data/DataLoadSave/Text/TextLoaderCursor.cs	`84.7% <0%> (-0.21%)`	⬇️
...ML.Transforms/Text/StopWordsRemovingTransformer.cs	`86.1% <0%> (-0.16%)`	⬇️
src/Microsoft.ML.Maml/MAML.cs	`26.21% <0%> (+1.45%)`	⬆️

shmoradims · 2019-03-27T21:22:26Z

docs/samples/Microsoft.ML.Samples/Dynamic/Transforms/Text/FeaturizeTextWithOptions.cs

+                // Use ML.NET's built-in stop word remover
+                StopWordsRemoverOptions = new StopWordsRemovingEstimator.Options() { Language = TextFeaturizingEstimator.Language.English },
+                WordFeatureExtractor = new WordBagEstimator.Options() { NgramLength = 1 },
+                CharFeatureExtractor = new WordBagEstimator.Options() { NgramLength = 1 },


1 [](start = 86, length = 1)

is this single char tokenization? it would be just the alphabets. is it ever useful? #Resolved

Yes, can be useful in some cases. but lets change it to 3-gram which is more useful.

In reply to: 269776047 [](ancestors = 269776047)

shmoradims

singlis

zeahmed · 2019-03-28T22:05:28Z

Thanks!

Created samples for 'FeaturizeText' API.

584f365

zeahmed requested review from shmoradims, sfilipi and rogancarr March 27, 2019 20:36

shmoradims reviewed Mar 27, 2019

View reviewed changes

shmoradims approved these changes Mar 27, 2019

View reviewed changes

zeahmed added 2 commits March 27, 2019 15:11

Addressed reviewers' comments.

0591c2b

Reference the sample from the API example section.

44fa39b

sfilipi mentioned this pull request Mar 27, 2019

API reference - Samples for Transforms #1209

Closed

singlis approved these changes Mar 28, 2019

View reviewed changes

zeahmed merged commit 233bc2d into dotnet:master Mar 28, 2019

zeahmed added a commit to zeahmed/machinelearning that referenced this pull request Apr 8, 2019

Created samples for 'FeaturizeText' API. (dotnet#3120)

a9335da

zeahmed mentioned this pull request Apr 8, 2019

Cherry pick for samples (Text) #3240

Closed

ghost locked as resolved and limited conversation to collaborators Mar 23, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Created samples for 'FeaturizeText' API. #3120

Created samples for 'FeaturizeText' API. #3120

Uh oh!

zeahmed commented Mar 27, 2019

Uh oh!

shmoradims Mar 27, 2019 •

edited by zeahmed

Loading

Uh oh!

shmoradims Mar 27, 2019 •

edited by zeahmed

Loading

Uh oh!

codecov bot commented Mar 27, 2019

Uh oh!

codecov bot commented Mar 27, 2019 •

edited

Loading

Uh oh!

shmoradims Mar 27, 2019 •

edited by zeahmed

Loading

Uh oh!

zeahmed Mar 27, 2019

Uh oh!

shmoradims left a comment

Uh oh!

singlis left a comment

Uh oh!

zeahmed commented Mar 28, 2019

Uh oh!

Uh oh!

Created samples for 'FeaturizeText' API. #3120

Created samples for 'FeaturizeText' API. #3120

Uh oh!

Conversation

zeahmed commented Mar 27, 2019

Uh oh!

shmoradims Mar 27, 2019 • edited by zeahmed Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

shmoradims Mar 27, 2019 • edited by zeahmed Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

codecov bot commented Mar 27, 2019

Codecov Report

Uh oh!

codecov bot commented Mar 27, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

shmoradims Mar 27, 2019 • edited by zeahmed Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

zeahmed Mar 27, 2019

Choose a reason for hiding this comment

Uh oh!

shmoradims left a comment

Choose a reason for hiding this comment

Uh oh!

singlis left a comment

Choose a reason for hiding this comment

Uh oh!

zeahmed commented Mar 28, 2019

Uh oh!

Uh oh!

shmoradims Mar 27, 2019 •

edited by zeahmed

Loading

shmoradims Mar 27, 2019 •

edited by zeahmed

Loading

codecov bot commented Mar 27, 2019 •

edited

Loading

shmoradims Mar 27, 2019 •

edited by zeahmed

Loading