Skip to content
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.

Commit 1e1e3d0

Browse files
authoredNov 18, 2024··
Merge branch 'site' into 11-13
2 parents 2a6304d + 713337a commit 1e1e3d0

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed
 

‎_posts/2020-08-18-pytorch-1.6-now-includes-stochastic-weight-averaging.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
---
22
layout: blog_detail
33
title: 'PyTorch 1.6 now includes Stochastic Weight Averaging'
4-
author: Pavel Izmailov, Andrew Gordon Wilson and Vincent Queneneville-Belair
4+
author: Pavel Izmailov, Andrew Gordon Wilson and Vincent Quenneville-Belair
55
---
66

77
Do you use stochastic gradient descent (SGD) or Adam? Regardless of the procedure you use to train your neural network, you can likely achieve significantly better generalization at virtually no additional cost with a simple new technique now natively supported in PyTorch 1.6, Stochastic Weight Averaging (SWA) [1]. Even if you have already trained your model, it’s easy to realize the benefits of SWA by running SWA for a small number of epochs starting with a pre-trained model. [Again](https://twitter.com/MilesCranmer/status/1282140440892932096) and [again](https://twitter.com/leopd/status/1285969855062192129), researchers are discovering that SWA improves the performance of well-tuned models in a wide array of practical applications with little cost or effort!

0 commit comments

Comments
 (0)
Please sign in to comment.