-
Notifications
You must be signed in to change notification settings - Fork 7.1k
Randomness in reference scripts with --test-only #4587
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Have you tried setting batch_size to 1 to see if it's related to the padding discussed at #4559? The alternative faster way to confirm this might be to |
ah, yes, thanks. With
Which confirms that we can't get exact results unless we have Just setting the number of GPUs to 1 isn't enough. |
BTW looks like the actual accuracy of resnet18 is a fair bit higher than what we report on the docs: 69.494 | 88.882 EDIT: I was looking at an old doc version, the difference isn't that big (69.758 | 89.078) |
Yep, looks like I was wrong here, my bad! |
While running some experiments related to #4559, I tried the following:
I was hoping to get reproducible results across executions, i.e. always the same accuracy, but it seems like I still get a bit of variations across a few runs:
Does anyone know where this randomness might come from?
The text was updated successfully, but these errors were encountered: