MovingAverage: add dynamic decay and swap weights #1726

tf-marissaw · 2020-04-24T21:35:23Z

Adds two new features to MovingAverage:

Dynamic decay: This helps the early accuracy of MovingAverage by starting the decay at 0.1 and gradually increasing it to average_decay.
Swap weights: This makes it easier to temporarily swap the model weights and the average weights during eval.

tensorflow_addons/optimizers/moving_average.py

gabrieldemarmiesse · 2020-04-25T17:18:25Z

tensorflow_addons/optimizers/tests/moving_average_test.py

+def test_swap_weights(sequential_update):
+    for sequential_update in [True, False]:
+
+        strategy = tf.distribute.MirroredStrategy()


We can't use distributed strategies in the test suite yet. It's work in progress, a first step is there: #1713

swap_weights has to be called in a cross replica context.

Can I use OneDeviceStrategy here?

We're going to have very soon distributed capabilities in the tests, could you take a look at #1770 and tell me if this would work for you?

This will enable true multi-Gpu (virtual at least) strategy testing, even with multiple pytest workers.

That should work for this test case. Do you know when those patches will be merged?

Two pull requests need to get reviewed to get there. So I'd expect a week or so. If next week it's not merged, I'll ping the other maintainers about it.

Since the blocking pull requests are still under review, would it be possible to merge this in with either the test commented out or just using OneDeviceStrategy in the test? I could immediately make a pull request that fixed the test case so it could be merged once the blocking pull requests are submitted.

Would either of these options work for you? We would like to get these changes in soon because they are fixing real problems with MovingAverage.

Adding dynamic decay to MovingAverage improves early accuracy. When using dynamic decay, decay starts at 0.1 and gradually increases up to `average_decay`.

This patch makes it easier to swap the model weights and the MovingAverage weights before eval and swap them back after eval.

tf-marissaw · 2020-06-10T22:32:26Z

Since #1770 has been merged, I have updated the patches to use "@pytest.mark.with_device([tf.distribute.MirroredStrategy])". Please take a look and let me know if there is anything else I should do.

tf-marissaw · 2020-06-18T20:56:57Z

@Squadrick Is there anything you need from me before you can review this pull request? We would like to submit it soon. Thanks!

Squadrick

LGTM, thanks for the contribution!

@gabrieldemarmiesse Can you please merge this?

Squadrick · 2020-06-18T21:12:37Z

tensorflow_addons/optimizers/moving_average.py

+            a.assign_sub(b)
+            return a
+
+        def swap(strategy, a, b):


Love this trick for swapping.

gabrieldemarmiesse · 2020-06-18T21:40:45Z

Thanks @tf-marissaw for the pull request and thanks @Squadrick for the review!

* Add dynamic decay to MovingAverage. Adding dynamic decay to MovingAverage improves early accuracy. When using dynamic decay, decay starts at 0.1 and gradually increases up to `average_decay`. * Add ability to swap weights to MovingAverage. This patch makes it easier to swap the model weights and the MovingAverage weights before eval and swap them back after eval.

tf-marissaw requested a review from Squadrick as a code owner April 24, 2020 21:35

boring-cyborg bot added the optimizers label Apr 24, 2020

googlebot added the cla: yes label Apr 24, 2020

tf-marissaw commented Apr 24, 2020

View reviewed changes

tensorflow_addons/optimizers/moving_average.py Outdated Show resolved Hide resolved

gabrieldemarmiesse reviewed Apr 25, 2020

View reviewed changes

gabrieldemarmiesse mentioned this pull request Apr 26, 2020

Dropping support for tensorflow < 2.2. #1734

Closed

seanpmorgan added blocked Pending something elses completion and removed blocked Pending something elses completion labels Apr 26, 2020

tf-marissaw force-pushed the moving-average-01 branch 3 times, most recently from 775c98d to addaa4a Compare April 27, 2020 20:16

gabrieldemarmiesse added the blocked Pending something elses completion label May 2, 2020

tf-marissaw added 2 commits June 10, 2020 15:00

Add dynamic decay to MovingAverage.

90c190b

Adding dynamic decay to MovingAverage improves early accuracy. When using dynamic decay, decay starts at 0.1 and gradually increases up to `average_decay`.

Add ability to swap weights to MovingAverage.

d9629a0

This patch makes it easier to swap the model weights and the MovingAverage weights before eval and swap them back after eval.

tf-marissaw force-pushed the moving-average-01 branch from addaa4a to d9629a0 Compare June 10, 2020 22:03

gabrieldemarmiesse removed the blocked Pending something elses completion label Jun 11, 2020

Squadrick approved these changes Jun 18, 2020

View reviewed changes

tensorflow_addons/optimizers/moving_average.py

a.assign_sub(b)

return a

def swap(strategy, a, b):

Copy link

Member

Squadrick Jun 18, 2020

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Love this trick for swapping.

gabrieldemarmiesse merged commit a0bfe3f into tensorflow:master Jun 18, 2020

allenwang28 mentioned this pull request Jun 22, 2020

'MovingAverage' object has no attribute '_average_weights' tensorflow/models#8719

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

MovingAverage: add dynamic decay and swap weights #1726

MovingAverage: add dynamic decay and swap weights #1726

Uh oh!

tf-marissaw commented Apr 24, 2020

Uh oh!

Uh oh!

gabrieldemarmiesse Apr 25, 2020

Uh oh!

tf-marissaw Apr 27, 2020 •

edited

Loading

Uh oh!

gabrieldemarmiesse May 2, 2020 •

edited

Loading

Uh oh!

tf-marissaw May 4, 2020

Uh oh!

gabrieldemarmiesse May 4, 2020

Uh oh!

tf-marissaw May 20, 2020

Uh oh!

tf-marissaw May 28, 2020

Uh oh!

tf-marissaw commented Jun 10, 2020

Uh oh!

tf-marissaw commented Jun 18, 2020

Uh oh!

Squadrick left a comment

Uh oh!

Squadrick Jun 18, 2020

Uh oh!

gabrieldemarmiesse commented Jun 18, 2020

Uh oh!

Uh oh!

MovingAverage: add dynamic decay and swap weights #1726

MovingAverage: add dynamic decay and swap weights #1726

Uh oh!

Conversation

tf-marissaw commented Apr 24, 2020

Uh oh!

Uh oh!

gabrieldemarmiesse Apr 25, 2020

Choose a reason for hiding this comment

Uh oh!

tf-marissaw Apr 27, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gabrieldemarmiesse May 2, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tf-marissaw May 4, 2020

Choose a reason for hiding this comment

Uh oh!

gabrieldemarmiesse May 4, 2020

Choose a reason for hiding this comment

Uh oh!

tf-marissaw May 20, 2020

Choose a reason for hiding this comment

Uh oh!

tf-marissaw May 28, 2020

Choose a reason for hiding this comment

Uh oh!

tf-marissaw commented Jun 10, 2020

Uh oh!

tf-marissaw commented Jun 18, 2020

Uh oh!

Squadrick left a comment

Choose a reason for hiding this comment

Uh oh!

Squadrick Jun 18, 2020

Choose a reason for hiding this comment

Uh oh!

gabrieldemarmiesse commented Jun 18, 2020

Uh oh!

Uh oh!

tf-marissaw Apr 27, 2020 •

edited

Loading

gabrieldemarmiesse May 2, 2020 •

edited

Loading