Skip to content

Pass weighting value from ProduceWordBags to WordBagEstimator constru… #3135

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Mar 29, 2019

Conversation

MarcinJuraszek
Copy link
Contributor

Fixes #3134

@codecov
Copy link

codecov bot commented Mar 28, 2019

Codecov Report

Merging #3135 into master will decrease coverage by <.01%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##           master    #3135      +/-   ##
==========================================
- Coverage   72.52%   72.52%   -0.01%     
==========================================
  Files         808      808              
  Lines      144665   144665              
  Branches    16198    16198              
==========================================
- Hits       104914   104912       -2     
- Misses      35340    35343       +3     
+ Partials     4411     4410       -1
Flag Coverage Δ
#Debug 72.52% <ø> (-0.01%) ⬇️
#production 68.12% <ø> (ø) ⬆️
#test 88.81% <ø> (-0.01%) ⬇️
Impacted Files Coverage Δ
src/Microsoft.ML.Transforms/Text/TextCatalog.cs 41.66% <ø> (ø) ⬆️
...soft.ML.TestFramework/DataPipe/TestDataPipeBase.cs 73.7% <0%> (-0.34%) ⬇️
...ML.Transforms/Text/StopWordsRemovingTransformer.cs 86.26% <0%> (+0.15%) ⬆️

Copy link
Contributor

@Ivanidzo4ka Ivanidzo4ka left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

:shipit:

@@ -283,7 +283,7 @@ public static class TextCatalog
int maximumNgramsCount = NgramExtractingEstimator.Defaults.MaximumNgramsCount,
NgramExtractingEstimator.WeightingCriteria weighting = NgramExtractingEstimator.WeightingCriteria.Tf)
=> new WordBagEstimator(Contracts.CheckRef(catalog, nameof(catalog)).GetEnvironment(),
outputColumnName, inputColumnName, ngramLength, skipLength, useAllLengths, maximumNgramsCount);
outputColumnName, inputColumnName, ngramLength, skipLength, useAllLengths, maximumNgramsCount, weighting);
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

weighting [](start = 111, length = 9)

Would be nice to add baseline test, something similar to TestWordEmbeddings.
Thank you for catching and fixing it!

@Ivanidzo4ka Ivanidzo4ka requested review from wschin and zeahmed March 28, 2019 23:24
Copy link
Contributor

@zeahmed zeahmed left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM! Thanks for the catching it.

@Ivanidzo4ka Ivanidzo4ka merged commit e95f31d into dotnet:master Mar 29, 2019
shauheen pushed a commit to shauheen/machinelearning that referenced this pull request Apr 2, 2019
@ghost ghost locked as resolved and limited conversation to collaborators Mar 23, 2022
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants