Prevent tests from leaking their respective RNG #4497

NicolasHug · 2021-09-28T18:11:06Z

This PR adds a new autouse pytest fixture to prevent each test from leaking its RNG consumption to other tests.

We've had strange errors and major headaches because of this in the past #3032 (comment)

After a bit of thoughts and discussions with @pmeier and @vfdev-5, I don't think there's a better alternative than relying on an autouse fixture. It's a bit implicit, but that's the best we can do considering that pytorch only offers manual_seed() to contrrol the RNG, which leaks to the entire program execution.

cc @pmeier

NicolasHug · 2021-09-29T12:47:43Z

test/test_transforms_tensor.py

@@ -714,6 +714,7 @@ def test_random_apply(device):
 @pytest.mark.parametrize('channels', [1, 3])
 def test_gaussian_blur(device, channels, meth_kwargs):
    tol = 1.0 + 1e-10
+    torch.manual_seed(12)


I had to add this to prevent the test from failing (see https://app.circleci.com/pipelines/github/pytorch/vision/10830/workflows/946973f0-bad1-48ed-aaf3-f3720ab5f56a/jobs/828462).

If anything, this shows that the PR is working as expected and that this test is a bit flaky, and sensible to the _create_data() call.

To confirm I parametrized it over 100 random seeds, and I got 8 failures over 1000+ test instances. Each time, just 1 pixel was off. Considering the low failure rate, I think it's fine to keep the manual seeding here.

NicolasHug · 2021-09-29T12:55:34Z

test/conftest.py

+@pytest.fixture(autouse=True)
+def prevent_leaking_rng():


This is pretty much the same as freeze_rng_state() https://github.com/pytorch/vision/blob/main/test/common_utils.py#L86

autouse=True means that this fixture will be used by every single test automatically.

vfdev-5

Cool ! LGTM

pmeier · 2021-09-29T13:55:31Z

Nicolas and I had an offline discussion about this a while back and I wanted to also give my opinion here:

IMO, this is not needed. There should be only two types of tests for this purpose

The test uses random data.
The test uses fixed data, that we generate by setting the random seed and afterwards generate "random" data.

In case 1. we don't need a fixed RNG state, because it shouldn't matter. If it does, the test actually belongs to category 2. and should be fixed.

In case 2. the RNG state doesn't matter since we overwrite the seed anyway.

@NicolasHug Is there an actual issue that this will fix?

fmassa

LGTM, thanks!

PyTorch does provide a generator that we can pass to (some) functions, but last time I checked it didn't work for CUDA (but that was a long time ago). If we had a better support for passing the generator in PyTorch (as we do in numpy), we could use it, but that's not the case yet IIRC.

Your solution seems ok with me, thanks!

NicolasHug · 2021-09-29T14:03:00Z

@pmeier I think that #3032 (comment) and the very fact that test_gaussian_blur() broke clearly show that this is in fact needed.

IMO, this is not needed. There should be only two types of tests for this purpose

While you're right in principle, the reality is much less simple unfortunately. We have tests for which the RNG is not entirely controlled and in fact depends on other tests. test_gaussian_blur() was one of them. It's likely that there are others (and if they're flaky they'll pop up soon, which is a good thing).

NicolasHug · 2021-09-29T14:09:57Z

Also, and possibly most importantly:

The test uses random data.

Before this PR, we had no such tests: all tests were using a pre-defined RNG state that was leaked by the previous tests that were run before. There was almost zero test that was relying on the global pytorch RNG that would change across executions. The only tests that did were the tests that were run before the first test that called manual_seed().

pmeier · 2021-09-29T14:21:56Z

Maybe I misunderstood the purpose. From your comments I get that you want this to detect flaky tests that we don't realize are there, because they pass until the the order of execution changes and the get different "random" data. Is that correct?

NicolasHug · 2021-09-29T14:27:45Z

detect flaky tests that we don't realize are there, because they pass until the the order of execution changes and the get different "random" data. Is that correct?

This is indeed a positive consequence of the changes, but that's not the main intent.

The main intent is that any test x should never be dependent on any other test y. I agree with you that in an ideal world we would never write such tests, but the reality is that we often do, whether we know it or not. The main reason being that it's not always obvious to figure out where randomness is involved.

Summary: * Add autouse fixture to save and reset RNG in tests * Add other RNG generators * delete freeze_rng_state * Hopefully fix GaussianBlur test * Alternative fix, probably better * revert changes to test_models Reviewed By: datumbox Differential Revision: D31270915 fbshipit-source-id: e60273bc90985aaac5e0aaa20cfbfdb86639b8b8 Co-authored-by: Francisco Massa <[email protected]>

* Add autouse fixture to save and reset RNG in tests * Add other RNG generators * delete freeze_rng_state * Hopefully fix GaussianBlur test * Alternative fix, probably better * revert changes to test_models Co-authored-by: Francisco Massa <[email protected]>

Add autouse fixture to save and reset RNG in tests

7c20323

facebook-github-bot added the cla signed label Sep 28, 2021

NicolasHug and others added 6 commits September 29, 2021 09:24

Add other RNG generators

910d213

delete freeze_rng_state

97d7c4a

Hopefully fix GaussianBlur test

7566dc4

Alternative fix, probably better

92cc80a

Merge branch 'main' into prevent_leaking_rng

e6159d9

revert changes to test_models

3d58993

NicolasHug commented Sep 29, 2021

View reviewed changes

NicolasHug added code quality module: tests labels Sep 29, 2021

NicolasHug changed the title ~~NOMRG Add autouse fixture to save and reset RNG in tests~~ Add autouse fixture to save and reset RNG in tests Sep 29, 2021

vfdev-5 approved these changes Sep 29, 2021

View reviewed changes

NicolasHug changed the title ~~Add autouse fixture to save and reset RNG in tests~~ Prevent tests from leaking their respective RNG Sep 29, 2021

fmassa approved these changes Sep 29, 2021

View reviewed changes

Merge branch 'main' into prevent_leaking_rng

eb19521

fmassa merged commit 13bd09d into pytorch:main Sep 29, 2021

NicolasHug mentioned this pull request Sep 30, 2021

Fix flaky tests that are recently popping up #4506

Closed

14 tasks

vmoens mentioned this pull request Jul 21, 2022

[Feature Request] Prevent tests from leaking seeds from one test to another pytorch/rl#313

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prevent tests from leaking their respective RNG #4497

Prevent tests from leaking their respective RNG #4497

NicolasHug commented Sep 28, 2021 •

edited by pytorch-probot bot

Loading

NicolasHug Sep 29, 2021

NicolasHug Sep 29, 2021

vfdev-5 left a comment

pmeier commented Sep 29, 2021

fmassa left a comment •

edited

Loading

NicolasHug commented Sep 29, 2021 •

edited

Loading

NicolasHug commented Sep 29, 2021

pmeier commented Sep 29, 2021

NicolasHug commented Sep 29, 2021

Prevent tests from leaking their respective RNG #4497

Prevent tests from leaking their respective RNG #4497

Conversation

NicolasHug commented Sep 28, 2021 • edited by pytorch-probot bot Loading

NicolasHug Sep 29, 2021

Choose a reason for hiding this comment

NicolasHug Sep 29, 2021

Choose a reason for hiding this comment

vfdev-5 left a comment

Choose a reason for hiding this comment

pmeier commented Sep 29, 2021

fmassa left a comment • edited Loading

Choose a reason for hiding this comment

NicolasHug commented Sep 29, 2021 • edited Loading

NicolasHug commented Sep 29, 2021

pmeier commented Sep 29, 2021

NicolasHug commented Sep 29, 2021

NicolasHug commented Sep 28, 2021 •

edited by pytorch-probot bot

Loading

fmassa left a comment •

edited

Loading

NicolasHug commented Sep 29, 2021 •

edited

Loading