Skip to content

Commit 7d10fd7

Browse files
author
Ervin Teng
committed
Use tanh squash in PPO trainer
1 parent 60c2ae5 commit 7d10fd7

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

ml-agents/mlagents/trainers/ppo/trainer.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -221,6 +221,7 @@ def create_policy(self, brain_parameters: BrainParameters) -> TFPolicy:
221221
self.load,
222222
condition_sigma_on_obs=False, # Faster training for PPO
223223
create_tf_graph=False, # We will create the TF graph in the Optimizer
224+
tanh_squash=True,
224225
)
225226

226227
return policy

0 commit comments

Comments
 (0)