Reduce variance of classification references evaluation #4609

NicolasHug · 2021-10-13T12:08:18Z

Closes #4559
Closes #4600

This is a follow up to #4559 (comment). This PR does a few things:

remove the cudnn auto benchmarking when test-only is True. This removes the variance observed in Evaluation code of references is slightly off #4559 (comment)
set shuffle=False for the test_dataloader
Add a --use-deterministic-algorithms flag to the scripts
Add a warning when the number of processed samples in the validation is different from len(dataset).

Out of these, I think only the first one is really important. I don't feel strongly about the rest, LMK what you think.

NicolasHug · 2021-10-13T12:09:28Z

references/classification/train.py

+        # See FIXME above
+        num_processed_samples = utils.reduce_across_processes(num_processed_samples)
+        if hasattr(data_loader.dataset, "__len__") and len(data_loader.dataset) != num_processed_samples:
+            warnings.warn(


We might want to only warn is rank == 0 to avoid having world_size warnings? It's not really a problem when running with submitit, but it can be a bit too much when running with torchrun directly

I agree with warning only when rank == 0.

NicolasHug · 2021-10-13T12:11:33Z

references/classification/train.py

@@ -277,6 +297,10 @@ def main(args):
            model_ema.load_state_dict(checkpoint["model_ema"])

    if args.test_only:
+        # We disable the cudnn benchmarking because it can noticeably affect the accuracy
+        torch.backends.cudnn.benchmark = False
+        torch.backends.cudnn.deterministic = True


I deliberately choose not to set torch.use_deterministic_algorithms(True) here because:

just removing the cudnn bencharmking is enough to get constant results, at least for resnet18 (maybe not for others)

using torch.use_deterministic_algorithms(True) forces the user to set some env variables like CUBLAS_WORKSPACE_CONFIG=:4096:8, otherwise the script would crash.

Users can always set the new --use-deterministic-algorithms flag if they really want to

datumbox

LGTM, thanks Nicolas.

I left minor comments which are non-blocking and purely optional.

Edit: Do you plan to bring similar updates on the rest of the references in separate PRs?

references/classification/train.py

datumbox · 2021-10-13T12:38:16Z

references/classification/train.py

+        # See FIXME above
+        num_processed_samples = utils.reduce_across_processes(num_processed_samples)
+        if hasattr(data_loader.dataset, "__len__") and len(data_loader.dataset) != num_processed_samples:
+            warnings.warn(


I agree with warning only when rank == 0.

…domness

NicolasHug · 2021-10-13T12:57:28Z

Thanks for the review!

Edit: Do you plan to bring similar updates on the rest of the references in separate PRs?

Yes, I just wanted to test the waters with this one first :)

Reviewed By: fmassa Differential Revision: D31649961 fbshipit-source-id: 34d72840adcc84a50d8b96c2cf4776e5450343e8

NicolasHug added 2 commits October 13, 2021 09:29

WIP

9f61b17

i'm not inspired to write a message

5264b1a

pytorch-probot bot added the ciflow/default label Oct 13, 2021

NicolasHug commented Oct 13, 2021

View reviewed changes

datumbox approved these changes Oct 13, 2021

View reviewed changes

facebook-github-bot added the cla signed label Oct 13, 2021

datumbox added module: reference scripts enhancement labels Oct 13, 2021

NicolasHug added 4 commits October 13, 2021 12:42

avoid some duplication

40a42ed

Only warn on rank == 0

4985725

Merge branch 'main' of github.com:pytorch/vision into ref_classif_ran…

cb4c7d2

…domness

hopefully fix flake8

eaa0536

NicolasHug merged commit 5b81c05 into pytorch:main Oct 13, 2021

facebook-github-bot pushed a commit that referenced this pull request Oct 14, 2021

[fbsync] Reduce variance of classification references evaluation (#4609)

da0aa01

Reviewed By: fmassa Differential Revision: D31649961 fbshipit-source-id: 34d72840adcc84a50d8b96c2cf4776e5450343e8

mszhanyi pushed a commit to mszhanyi/vision that referenced this pull request Oct 19, 2021

Reduce variance of classification references evaluation (pytorch#4609)

6ac7a2f

datumbox mentioned this pull request Oct 22, 2021

Fix quantization error on Reference Scripts #4722

Closed

NicolasHug mentioned this pull request Oct 25, 2021

Reduce variance of model evaluation in references #4730

Closed

cyyever pushed a commit to cyyever/vision that referenced this pull request Nov 16, 2021

Reduce variance of classification references evaluation (pytorch#4609)

faeda9e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Reduce variance of classification references evaluation #4609

Reduce variance of classification references evaluation #4609

Uh oh!

NicolasHug commented Oct 13, 2021 •

edited by pytorch-probot bot

Loading

Uh oh!

NicolasHug Oct 13, 2021

Uh oh!

datumbox Oct 13, 2021

Uh oh!

NicolasHug Oct 13, 2021

Uh oh!

datumbox left a comment •

edited

Loading

Uh oh!

Uh oh!

datumbox Oct 13, 2021

Uh oh!

NicolasHug commented Oct 13, 2021

Uh oh!

Uh oh!

Reduce variance of classification references evaluation #4609

Reduce variance of classification references evaluation #4609

Uh oh!

Conversation

NicolasHug commented Oct 13, 2021 • edited by pytorch-probot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

NicolasHug Oct 13, 2021

Choose a reason for hiding this comment

Uh oh!

datumbox Oct 13, 2021

Choose a reason for hiding this comment

Uh oh!

NicolasHug Oct 13, 2021

Choose a reason for hiding this comment

Uh oh!

datumbox left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

datumbox Oct 13, 2021

Choose a reason for hiding this comment

Uh oh!

NicolasHug commented Oct 13, 2021

Uh oh!

Uh oh!

NicolasHug commented Oct 13, 2021 •

edited by pytorch-probot bot

Loading

datumbox left a comment •

edited

Loading