Skip to content

Conversation

vbourgin
Copy link

@vbourgin vbourgin commented Oct 11, 2024

Summary:
Update torchtnt auto_unit to use self.device for the EMA / SWA model, which may be set from environment in the superclass init. This enables model evaluation in GPU.

Differential Revision: D64206735

@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D64206735

Summary:

Add EMA to the recognizer:
- Separate out learning rate scheduler updates and EMA model updates: in d2go, the EMA weights were updated every step, while the scheduler was updated every epoch. We separate them to implement the same functionality in Vizard and override `on_train_step_end` to update the EMA weights every step (irrespective of other parameters).
- Update torchtnt auto_unit to use self.device for the EMA / SWA model, which may be set from environment in the superclass init. This enables model evaluation in GPU.

Differential Revision: D64206735
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D64206735

@vbourgin vbourgin changed the title Recognizer: Incorporate EMA Use AutoUnit.device for SWA Model Oct 11, 2024
@vbourgin vbourgin closed this Oct 11, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants