Sweep Range of L2RegularizerWeight in AveragedPerceptron #579

SolyarA · 2018-07-24T18:18:20Z

Fixes #567

dnfclas · 2018-07-24T18:18:30Z

All CLA requirements met.

TomFinley

Thank you @SolyarA ... this looks OK to me @justinormont is it along the lines of what you wanted in #567?

justinormont · 2018-07-25T06:52:15Z

@TomFinley, @Zruty0 : What's the implications of missing the range of 0.4 to 0.5 in L2RegularizerWeight for AveragedPerceptron?

A better fix for this would be to add a param to the sweep range to note if the range boundaries are inclusive vs. exclusive.

Zruty0

Zruty0 · 2018-07-25T15:17:22Z

I cannot say what the implications are, I don't have an intuition on how the optimizer would behave close to 0.5 boundary.

As for inclusive vs. exclusive boundaries: I believe they were originally included in the code (as Min/Max and also Inf/Lim, or something along these lines). But I assume it was deemed unnecessary complexity and removed? I don't find any trace of this anymore.

In reply to: 407652667 [](ancestors = 407652667)

TomFinley · 2018-07-25T20:59:54Z

I'm not sure "close" to 0.5 is actually a completely sensible value. See here:

machinelearning/src/Microsoft.ML.StandardLearners/Standard/Online/AveragedLinear.cs

Lines 82 to 83 in 5e08fa1

    
           // Weights are scaled down by 2 * L2 regularization on each update step, so 0.5 would scale all weights to 0, which is not sensible. 
        
           Contracts.CheckUserArg(0 <= args.L2RegularizerWeight && args.L2RegularizerWeight < 0.5, nameof(args.L2RegularizerWeight), "must be in range [0, 0.5)");

and here:

machinelearning/src/Microsoft.ML.StandardLearners/Standard/Online/AveragedLinear.cs

Line 209 in 5e08fa1

WeightsScale *= 1 - 2 * Args.L2RegularizerWeight; // L2 regularization.

Indeed I feel like this is all somewhat haphazard, and whoever introduced this sweep range was making the mistake of confusing sweep range with defining valid values... which is not the point at all. Anyway, I'm inclined to just accept @justinormont if that is all right. It seems though like if we are going to have continuous values that the notion of inclusive vs. exclusive bounds needs to be accounted for somehow, not sure why such a concept would be removed. 😦

justinormont · 2018-07-26T00:02:13Z

Thanks for pushing in. And thanks @SolyarA for your PR.

@TomFinley: Agreed; we will want to reduce the range of the sweep params from the valid to the useful ranges. This will speed up the hyperparameter optimization. The only reason I see to keep the ranges as wide as the valid is if we can find examples where the extreme values led to good scores. We have further ideas on how to focus the sweeper's energy towards useful ranges of hyperparameters, so perhaps the work of figuring out the useful ranges won't be needed.

* Changed range of L2RegularizerWeight parameter in AveragedPerceptron

SolyarA added 3 commits July 24, 2018 19:10

Changed range of L2RegularizerWeight parameter

bcc2298

Regenerated CSharpApi file

fd33df7

Updated core_manifest.json

ff96473

TomFinley requested review from justinormont and sfilipi July 24, 2018 20:04

TomFinley approved these changes Jul 24, 2018

View reviewed changes

Zruty0 approved these changes Jul 25, 2018

View reviewed changes

TomFinley merged commit 7fea0af into dotnet:master Jul 25, 2018

eerhardt pushed a commit to eerhardt/machinelearning that referenced this pull request Jul 27, 2018

Sweep Range of L2RegularizerWeight in AveragedPerceptron (dotnet#579)

efad32c

* Changed range of L2RegularizerWeight parameter in AveragedPerceptron

codemzs pushed a commit to codemzs/machinelearning that referenced this pull request Aug 1, 2018

Sweep Range of L2RegularizerWeight in AveragedPerceptron (dotnet#579)

cfd8caa

* Changed range of L2RegularizerWeight parameter in AveragedPerceptron

justinormont mentioned this pull request Nov 15, 2019

ML.NET Builder Get Started sample throw excepion when Train time longer than 60 sec dotnet/machinelearning-modelbuilder#238

Closed

ghost locked as resolved and limited conversation to collaborators Mar 29, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sweep Range of L2RegularizerWeight in AveragedPerceptron #579

Sweep Range of L2RegularizerWeight in AveragedPerceptron #579

SolyarA commented Jul 24, 2018

dnfclas commented Jul 24, 2018 •

edited

Loading

TomFinley left a comment

justinormont commented Jul 25, 2018

Zruty0 left a comment

Zruty0 commented Jul 25, 2018

TomFinley commented Jul 25, 2018

justinormont commented Jul 26, 2018

Sweep Range of L2RegularizerWeight in AveragedPerceptron #579

Sweep Range of L2RegularizerWeight in AveragedPerceptron #579

Conversation

SolyarA commented Jul 24, 2018

dnfclas commented Jul 24, 2018 • edited Loading

TomFinley left a comment

Choose a reason for hiding this comment

justinormont commented Jul 25, 2018

Zruty0 left a comment

Choose a reason for hiding this comment

Zruty0 commented Jul 25, 2018

TomFinley commented Jul 25, 2018

justinormont commented Jul 26, 2018

dnfclas commented Jul 24, 2018 •

edited

Loading