File tree Expand file tree Collapse file tree 1 file changed +19
-19
lines changed Expand file tree Collapse file tree 1 file changed +19
-19
lines changed Original file line number Diff line number Diff line change @@ -404,27 +404,27 @@ and include the following hyperparameter values:
404
404
``` yml
405
405
behaviors :
406
406
RollerBall :
407
- trainer : ppo
408
- batch_size : 10
409
- beta : 5.0e-3
410
- buffer_size : 100
411
- epsilon : 0.2
412
- hidden_units : 128
413
- lambd : 0.95
414
- learning_rate : 3.0e-4
415
- learning_rate_schedule : linear
416
- max_steps : 5.0e4
417
- memory_size : 128
418
- normalize : false
419
- num_epoch : 3
420
- num_layers : 2
407
+ trainer_type : ppo
408
+ hyperparameters :
409
+ batch_size : 10
410
+ buffer_size : 100
411
+ learning_rate : 3.0e-4
412
+ beta : 5.0e-4
413
+ epsilon : 0.2
414
+ lambd : 0.99
415
+ num_epoch : 3
416
+ learning_rate_schedule : linear
417
+ network_settings :
418
+ normalize : false
419
+ hidden_units : 128
420
+ num_layers : 2
421
+ reward_signals :
422
+ extrinsic :
423
+ gamma : 0.99
424
+ strength : 1.0
425
+ max_steps : 500000
421
426
time_horizon : 64
422
427
summary_freq : 10000
423
- use_recurrent : false
424
- reward_signals :
425
- extrinsic :
426
- strength : 1.0
427
- gamma : 0.99
428
428
` ` `
429
429
430
430
Since this example creates a very simple training environment with only a few
You can’t perform that action at this time.
0 commit comments