Skip to content
Merged
Show file tree
Hide file tree
Changes from 26 commits
Commits
Show all changes
45 commits
Select commit Hold shift + click to select a range
a2b4fb7
Move sampler config into main YAML
Apr 14, 2020
94c684c
Make CLI override YAML
Apr 14, 2020
54e9914
Bring back default functionality, curriculum loader
Apr 15, 2020
df4a358
Load curriculum from same YAML
Apr 15, 2020
3a84c13
Example WallJump curriculum
Apr 15, 2020
92c9682
New-format YAML files
Apr 15, 2020
9dcf38d
Fix walljump curriculum
Apr 16, 2020
a926d4c
Commit SAC parameters
Apr 16, 2020
419a156
Delete old configs and add gail
Apr 16, 2020
c80c359
Change some of the documentation
Apr 17, 2020
f020ecc
Merge master into develop-single-config
Apr 17, 2020
0fa8f8b
More doc updates
Apr 17, 2020
72b39f0
Fix Yamato test
Apr 17, 2020
0c89258
Fix learn.py test
Apr 17, 2020
b84396f
More docs updates
Apr 17, 2020
756a75f
Update migrating.md file
Apr 17, 2020
cb97315
Update changelog and improve migrating
Apr 17, 2020
7bb6366
Don't hard break trying to get curriculum out of bad config
Apr 17, 2020
e0b8c9c
Use behavior name instead of brain
Apr 17, 2020
8d37045
Fix yamato_utils
Apr 17, 2020
b20ab5d
Merge branch 'master' of github.com:Unity-Technologies/ml-agents into…
Apr 17, 2020
50eafc2
Delete curricula
Apr 17, 2020
cf920b6
Merge branch 'master' of github.com:Unity-Technologies/ml-agents into…
Apr 17, 2020
eb3df94
Make RunOptions and YAML compatible
Apr 20, 2020
690da4a
Use entire YAML as run options, and override via CLI
Apr 20, 2020
3c26e57
Add test
Apr 21, 2020
4171565
Rename walljump yaml SAC
Apr 21, 2020
4330c02
Fix newline formatting
Apr 21, 2020
41dd3f7
Merge branch 'master' into develop-single-config
Apr 22, 2020
75ad833
Update SAC configurations
Apr 22, 2020
5a75d7f
Edit Changelog
Apr 22, 2020
9ba2ef3
Fix learn.py tests
Apr 22, 2020
36c9591
Update strikers vs goalie and add Worm
Apr 23, 2020
79c8a6c
Merge branch 'master' into develop-single-config
Apr 23, 2020
850be42
Merge branch 'develop-single-config' into develop-default-overrides
Apr 23, 2020
7c0b901
Change name to run_options_dict
Apr 29, 2020
54929d5
use custom action to store config file
Apr 29, 2020
3568c2e
Merge branch 'master' into develop-single-config
Apr 29, 2020
4214745
remove leftover debug
Apr 29, 2020
4d27ed5
Use hard links in Migrating.md
Apr 29, 2020
d3c611a
Merge branch 'develop-single-config' into develop-default-overrides
Apr 29, 2020
27d2f10
Add capture framerate back into CLI options
Apr 29, 2020
d9757b4
Merge branch 'master' into develop-default-overrides
Apr 29, 2020
4ab60ce
Move CLI args back into learn.py
Apr 29, 2020
20b664f
Update CHANGELOG
Apr 29, 2020
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Binary file added Project/Recordings/movie.mp4
Binary file not shown.
5 changes: 4 additions & 1 deletion com.unity.ml-agents/CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -36,7 +36,10 @@ and this project adheres to
communication between Unity and the Python process.
- The obsolete `Agent` methods `GiveModel`, `Done`, `InitializeAgent`,
`AgentAction` and `AgentReset` have been removed.
- The GhostTrainer has been extended to support asymmetric games and the asymmetric example environment Strikers Vs. Goalie has been added.
- The GhostTrainer has been extended to support asymmetric games and the asymmetric example environment Strikers Vs. Goalie has been added.
- Curriculum and Parameter Randomization configurations have been merged
into the main training configuration file. Note that this means training
configuration files are environment-specific. (#3791)

### Minor Changes

Expand Down
16 changes: 0 additions & 16 deletions config/3dball_randomize.yaml

This file was deleted.

7 changes: 0 additions & 7 deletions config/curricula/soccer.yaml

This file was deleted.

9 changes: 0 additions & 9 deletions config/curricula/test.yaml

This file was deleted.

16 changes: 0 additions & 16 deletions config/curricula/wall_jump.yaml

This file was deleted.

129 changes: 0 additions & 129 deletions config/gail_config.yaml

This file was deleted.

29 changes: 29 additions & 0 deletions config/imitation/CrawlerStatic.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,29 @@
behaviors:
CrawlerStatic:
trainer: ppo
batch_size: 2024
beta: 0.005
buffer_size: 20240
epsilon: 0.2
hidden_units: 512
lambd: 0.95
learning_rate: 0.0003
max_steps: 1e7
memory_size: 256
normalize: true
num_epoch: 3
num_layers: 3
time_horizon: 1000
sequence_length: 64
summary_freq: 30000
use_recurrent: false
reward_signals:
gail:
strength: 1.0
gamma: 0.99
encoding_size: 128
demo_path: Project/Assets/ML-Agents/Examples/Crawler/Demos/ExpertCrawlerSta.demo
behavioral_cloning:
demo_path: Project/Assets/ML-Agents/Examples/Crawler/Demos/ExpertCrawlerSta.demo
strength: 0.5
steps: 50000
29 changes: 29 additions & 0 deletions config/imitation/FoodCollector.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,29 @@
behaviors:
FoodCollector:
trainer: ppo
batch_size: 64
beta: 0.005
buffer_size: 10240
epsilon: 0.2
hidden_units: 128
lambd: 0.95
learning_rate: 0.0003
max_steps: 2.0e6
memory_size: 256
normalize: false
num_epoch: 3
num_layers: 2
time_horizon: 64
sequence_length: 32
summary_freq: 10000
use_recurrent: false
reward_signals:
gail:
strength: 0.1
gamma: 0.99
encoding_size: 128
demo_path: Project/Assets/ML-Agents/Examples/FoodCollector/Demos/ExpertFood.demo
behavioral_cloning:
demo_path: Project/Assets/ML-Agents/Examples/FoodCollector/Demos/ExpertFood.demo
strength: 1.0
steps: 0
28 changes: 28 additions & 0 deletions config/imitation/Hallway.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,28 @@
behaviors:
Hallway:
trainer: ppo
batch_size: 128
beta: 0.01
buffer_size: 1024
epsilon: 0.2
hidden_units: 128
lambd: 0.95
learning_rate: 0.0003
max_steps: 1.0e7
memory_size: 256
normalize: false
num_epoch: 3
num_layers: 2
time_horizon: 64
sequence_length: 64
summary_freq: 10000
use_recurrent: true
reward_signals:
extrinsic:
strength: 1.0
gamma: 0.99
gail:
strength: 0.1
gamma: 0.99
encoding_size: 128
demo_path: Project/Assets/ML-Agents/Examples/Hallway/Demos/ExpertHallway.demo
25 changes: 25 additions & 0 deletions config/imitation/PushBlock.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,25 @@
behaviors:
PushBlock:
trainer: ppo
batch_size: 128
beta: 0.01
buffer_size: 2048
epsilon: 0.2
hidden_units: 256
lambd: 0.95
learning_rate: 0.0003
max_steps: 1.5e7
memory_size: 256
normalize: false
num_epoch: 3
num_layers: 2
time_horizon: 64
sequence_length: 64
summary_freq: 60000
use_recurrent: false
reward_signals:
gail:
strength: 1.0
gamma: 0.99
encoding_size: 128
demo_path: Project/Assets/ML-Agents/Examples/PushBlock/Demos/ExpertPush.demo
36 changes: 36 additions & 0 deletions config/imitation/Pyramids.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,36 @@
behaviors:
Pyramids:
trainer: ppo
batch_size: 128
beta: 0.01
buffer_size: 2048
epsilon: 0.2
hidden_units: 512
lambd: 0.95
learning_rate: 0.0003
max_steps: 1.0e7
memory_size: 256
normalize: false
num_epoch: 3
num_layers: 2
time_horizon: 128
sequence_length: 64
summary_freq: 30000
use_recurrent: false
reward_signals:
extrinsic:
strength: 1.0
gamma: 0.99
curiosity:
strength: 0.02
gamma: 0.99
encoding_size: 256
gail:
strength: 0.01
gamma: 0.99
encoding_size: 128
demo_path: Project/Assets/ML-Agents/Examples/Pyramids/Demos/ExpertPyramid.demo
behavioral_cloning:
demo_path: Project/Assets/ML-Agents/Examples/Pyramids/Demos/ExpertPyramid.demo
strength: 0.5
steps: 150000
25 changes: 25 additions & 0 deletions config/ppo/3DBall.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,25 @@
behaviors:
3DBall:
trainer: ppo
batch_size: 64
beta: 0.001
buffer_size: 12000
epsilon: 0.2
hidden_units: 128
lambd: 0.99
learning_rate: 0.0003
learning_rate_schedule: linear
max_steps: 5.0e5
memory_size: 128
normalize: true
num_epoch: 3
num_layers: 2
time_horizon: 1000
sequence_length: 64
summary_freq: 12000
use_recurrent: false
vis_encode_type: simple
reward_signals:
extrinsic:
strength: 1.0
gamma: 0.99
25 changes: 25 additions & 0 deletions config/ppo/3DBallHard.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,25 @@
behaviors:
3DBallHard:
trainer: ppo
batch_size: 1200
beta: 0.001
buffer_size: 12000
epsilon: 0.2
hidden_units: 128
lambd: 0.95
learning_rate: 0.0003
learning_rate_schedule: linear
max_steps: 5.0e6
memory_size: 128
normalize: true
num_epoch: 3
num_layers: 2
time_horizon: 1000
sequence_length: 64
summary_freq: 12000
use_recurrent: false
vis_encode_type: simple
reward_signals:
extrinsic:
strength: 1.0
gamma: 0.995
Loading