Skip to content

WordEmbedding Tests added plus added dimension check for the first row #880

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 3 commits into from
Sep 12, 2018
Merged

WordEmbedding Tests added plus added dimension check for the first row #880

merged 3 commits into from
Sep 12, 2018

Conversation

Anipik
Copy link
Contributor

@Anipik Anipik commented Sep 10, 2018

Fixes #873
Related PR #820

please review it after 55fb378

cc @danmosemsft @sfilipi @eerhardt @Ivanidzo4ka @justinormont

@Anipik
Copy link
Contributor Author

Anipik commented Sep 11, 2018

Benchmark results for this test

Toolchain=netcoreapp2.1  MaxIterationCount=20  WarmupCount=1  
BigramAndTrigramBenchmark.CV_Multiclass_WikiDetox_WordEmbeddings_OVAAveragedPerceptron: 
Mean = 291.0139 s, StdErr = 0.3224 s (0.11%); N = 15, StdDev = 1.2487 s
Min = 288.9763 s, Q1 = 289.8307 s, Median = 291.3262 s, Q3 = 292.1041 s, Max = 292.6711 s
IQR = 2.2734 s, LowerFence = 286.4205 s, UpperFence = 295.5143 s
ConfidenceInterval = [289.6789 s; 292.3489 s] (CI 99.9%), Margin = 1.3350 s (0.46% of Mean)
Skewness = -0.19, Kurtosis = 1.39, MValue = 2
-------------------- Histogram --------------------
[288.533 s ; 293.114 s) | @@@@@@@@@@@@@@@
---------------------------------------------------

BigramAndTrigramBenchmark.CV_Multiclass_WikiDetox_WordEmbeddings_SDCAMC:
Mean = 191.1037 s, StdErr = 0.2887 s (0.15%); N = 13, StdDev = 1.0408 s
Min = 190.0461 s, Q1 = 190.2511 s, Median = 190.7204 s, Q3 = 192.2263 s, Max = 193.1013 s
IQR = 1.9752 s, LowerFence = 187.2882 s, UpperFence = 195.1891 s
ConfidenceInterval = [189.8574 s; 192.3501 s] (CI 99.9%), Margin = 1.2464 s (0.65% of Mean)
Skewness = 0.66, Kurtosis = 1.74, MValue = 2
-------------------- Histogram --------------------
[189.659 s ; 193.489 s) | @@@@@@@@@@@@@
---------------------------------------------------
Method Mean Error StdDev Extra Metric Gen 0 Gen 1 Gen 2 Allocated
CV_Multiclass_WikiDetox_WordEmbeddings_OVAAveragedPerceptron 291.0 s 1.335 s 1.249 s - 20626000.0000 4697000.0000 108000.0000 817.77 KB
CV_Multiclass_WikiDetox_WordEmbeddings_SDCAMC 191.1 s 1.246 s 1.041 s - 15681000.0000 3279000.0000 73000.0000 789.73 KB

@Zruty0
Copy link
Contributor

Zruty0 commented Sep 11, 2018

@sfilipi is added to the review. #Closed

Copy link
Member

@eerhardt eerhardt left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

:shipit:

Copy link
Contributor

@justinormont justinormont left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

public void CV_Multiclass_WikiDetox_WordEmbeddings_SDCAMC()
{
string cmd = @"CV tr=SDCAMC k=5 loader=TextLoader{quote=- sparse=- col=Label:R4:0 col=rev_id:TX:1 col=comment:TX:2 col=logged_in:BL:4 col=ns:TX:5 col=sample:TX:6 col=split:TX:7 col=year:R4:3 header=+} data=" + _dataPath_Wiki + " xf=Convert{col=logged_in type=R4} xf=CategoricalTransform{col=ns} xf=TextTransform{col=FeaturesText:comment tokens=+ wordExtractor={} charExtractor={}} xf=WordEmbeddingsTransform{col=FeaturesWordEmbedding:FeaturesText_TransformedText model=FastTextWikipedia300D} xf=Concat{col=Features:FeaturesWordEmbedding,logged_in,ns}";
using (var tlc = new TlcEnvironment(verbose: false, sensitivity: MessageSensitivity.None, outWriter: EmptyWriter.Instance))
Copy link
Member

@sfilipi sfilipi Sep 12, 2018

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

indent those, maybe, for readability

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

indent the cmd or tlc environment ?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The cmd


In reply to: 217119176 [](ancestors = 217119176)

@justinormont justinormont merged commit 4e0800c into dotnet:master Sep 12, 2018
@Anipik Anipik deleted the wordEmbed3 branch October 10, 2018 18:23
@ghost ghost locked as resolved and limited conversation to collaborators Mar 29, 2022
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants