Add a note that NDCG returns values between 0 and 100 #2637

rogancarr · 2019-02-19T22:35:23Z

NDCG, the normalized discounted cumulative gain metric that we often use in ranking, ranges in value between 0 and 100 in ML.NET. In the wide world, NDCG is often considered to be between 0 and 1 (e.g. see Wikipedia). We should add a note to the ranking metrics object that the values range between 0 and 100 so that we don't confuse people who are new to the toolkit.

justinormont · 2019-02-20T01:23:43Z

Would it be possible to move the range to 0 to 1.0? This would be inline with most of our other metrics, and the industry standard.

rogancarr · 2019-02-20T01:33:55Z

It's definitely possible, but it's a breaking change for a lot of folks out there.

@TomFinley what's your thought?

TomFinley · 2019-02-20T04:45:55Z

It would be more consistent to do as @justinormont suggests -- we phrase no other metric as a percentage, as far as I am aware. At the same time I know that the people that use this metrics are more accustomed to having it expressed as a percentage. (Which I assume is what you mean by a breaking change.) But of course, the custom of some people can be adjusted and changed, whereas if we have an inconsistency now we are probably going to commit to it forever. This strikes me as unattractive.

This leads me to prefer the 0 to 1 range. It seems logically defensible and consistent, though I understand it is more work than the documentation change, and might annoy some people in the short term.

If you decide you agree @rogancarr please change this to be under API and project 13 appropriately, since it will then represent a breaking change.

rogancarr · 2019-02-20T16:09:53Z

@TomFinley @justinormont Great! Let's move it from 0 to 1 and add a note to the API docs.

I'll self-assign and send this in shortly.

Maybe we could write up a doc for internal users with a summary of small changes.

rogancarr · 2019-02-21T22:17:01Z

I found two other measures being scaled to percentages: The relative information gains for Binary Classification LogLossReduction and Multiclass Classification Reduction.

rogancarr · 2019-02-21T23:29:00Z

Just a quick note. I implemented the change, and about 41 tests fail. We'll have to remake baselines for a bunch of tests, so it'll be a rather big PR.

rogancarr added the usability Smoothing user interaction or experience label Feb 19, 2019

rogancarr mentioned this issue Feb 19, 2019

Cleaning up metric classes #2624

Closed

TomFinley added the documentation Related to documentation of ML.NET label Feb 19, 2019

rogancarr self-assigned this Feb 20, 2019

rogancarr mentioned this issue Feb 22, 2019

Move metrics from percentages to [0,1] #2697

Merged

rogancarr closed this as completed in #2697 Mar 21, 2019

justinormont mentioned this issue May 7, 2019

Improvements to definitions of metrics.md dotnet/docs#12220

Merged

ghost locked as resolved and limited conversation to collaborators Mar 24, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a note that NDCG returns values between 0 and 100 #2637

Add a note that NDCG returns values between 0 and 100 #2637

rogancarr commented Feb 19, 2019

justinormont commented Feb 20, 2019

rogancarr commented Feb 20, 2019 •

edited

Loading

TomFinley commented Feb 20, 2019 •

edited

Loading

rogancarr commented Feb 20, 2019

rogancarr commented Feb 21, 2019

rogancarr commented Feb 21, 2019

Add a note that NDCG returns values between 0 and 100 #2637

Add a note that NDCG returns values between 0 and 100 #2637

Comments

rogancarr commented Feb 19, 2019

justinormont commented Feb 20, 2019

rogancarr commented Feb 20, 2019 • edited Loading

TomFinley commented Feb 20, 2019 • edited Loading

rogancarr commented Feb 20, 2019

rogancarr commented Feb 21, 2019

rogancarr commented Feb 21, 2019

rogancarr commented Feb 20, 2019 •

edited

Loading

TomFinley commented Feb 20, 2019 •

edited

Loading