Make test precision stricter for Classification #6380

datumbox · 2022-08-08T10:07:01Z

Currently we are using extremely low precision thresholds on some of our classification tests (image, quantized and video). Especially on video, the value is so high that it doesn't capture breaking changes on the code. This PR tries to update the values to stricter thesholds without causing flakiness.

jdsgomes

LGTM, thanks

NicolasHug · 2022-08-18T11:17:00Z

test/test_models.py

@@ -841,7 +841,7 @@ def test_video_model(model_fn, dev):
    # RNG always on CPU, to ensure x in cuda tests is bitwise identical to x in cpu tests
    x = torch.rand(input_shape).to(device=dev)
    out = model(x)
-    _assert_expected(out.cpu(), model_name, prec=0.1)
+    _assert_expected(out.cpu(), model_name, prec=1e-5)


FYI @datumbox I am suspecting that this chang might cause flakyness in our internal tests

D38824237 shows that pytorch_vision_gpu-buck_2 is failing: https://www.internalfb.com/intern/testinfra/diagnostics/844425187765605.562950021904293.1660819094/

I don't see that on the other diffs (yet?) so I assume it's flaky.

Thanks for the heads up. I would recommend turning off the internal test and relying on Github for verifying the models. Historically the test_models.py at Meta has been a source of flakyness, so feel free to turn off anything that breaks. Let me know if you need any help from me.

Edit: Hmm actually the very specific model is internal only and I wonder if its expected file is produced properly. The previous precision value was so high that was doing nothing effectively. looping @jdsgomes in case he wants to have a look on this.

Looking into it, could be that the expected files are wrong.

Summary: * Make test precision stricter for Classification * Update classification threshold. * Update quantized classification threshold. Reviewed By: datumbox Differential Revision: D38824223 fbshipit-source-id: aa5adbf9fa7d55c0343c97cbe162c40a7ca0f984

Make test precision stricter for Classification

870d8ef

facebook-github-bot added the cla signed label Aug 8, 2022

datumbox added module: tests enhancement labels Aug 8, 2022

datumbox added 2 commits August 8, 2022 11:19

Update classification threshold.

a4b5930

Update quantized classification threshold.

dd9504a

jdsgomes approved these changes Aug 8, 2022

View reviewed changes

Merge branch 'main' into tests/stricter_precision

53de922

datumbox merged commit 8446983 into pytorch:main Aug 8, 2022

datumbox deleted the tests/stricter_precision branch August 8, 2022 11:49

NicolasHug reviewed Aug 18, 2022

View reviewed changes

YosuaMichael mentioned this pull request Jan 23, 2023

[WIP]Use float64 to reduce differences between cpu and gpu result in model test #7114

Closed

YosuaMichael added a commit to YosuaMichael/vision that referenced this pull request Jan 25, 2023

Relaxing test_models precision, revert pytorch#6380

dcdc8db

YosuaMichael mentioned this pull request Jan 25, 2023

Use real weight and image for classification model test and relaxing precision requirement for general model tests #7130

Open

NicolasHug mentioned this pull request Feb 9, 2023

Put back previous tolerance for test_classification and test_video #7202

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make test precision stricter for Classification #6380

Make test precision stricter for Classification #6380

datumbox commented Aug 8, 2022 •

edited

Loading

jdsgomes left a comment

NicolasHug Aug 18, 2022 •

edited

Loading

datumbox Aug 18, 2022 •

edited

Loading

jdsgomes Aug 18, 2022

Make test precision stricter for Classification #6380

Make test precision stricter for Classification #6380

Conversation

datumbox commented Aug 8, 2022 • edited Loading

jdsgomes left a comment

Choose a reason for hiding this comment

NicolasHug Aug 18, 2022 • edited Loading

Choose a reason for hiding this comment

datumbox Aug 18, 2022 • edited Loading

Choose a reason for hiding this comment

jdsgomes Aug 18, 2022

Choose a reason for hiding this comment

datumbox commented Aug 8, 2022 •

edited

Loading

NicolasHug Aug 18, 2022 •

edited

Loading

datumbox Aug 18, 2022 •

edited

Loading