Skip to content

Commit 0f93186

Browse files
Update the basic example documentation (#3281)
1 parent ba18b66 commit 0f93186

File tree

1 file changed

+2
-1
lines changed

1 file changed

+2
-1
lines changed

docs/Learning-Environment-Examples.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -26,6 +26,7 @@ If you would like to contribute environments, please see our
2626
* Goal: Move to the most reward state.
2727
* Agents: The environment contains one agent.
2828
* Agent Reward Function:
29+
* -0.01 at each step
2930
* +0.1 for arriving at suboptimal state.
3031
* +1.0 for arriving at optimal state.
3132
* Behavior Parameters:
@@ -34,7 +35,7 @@ If you would like to contribute environments, please see our
3435
right).
3536
* Visual Observations: None
3637
* Float Properties: None
37-
* Benchmark Mean Reward: 0.94
38+
* Benchmark Mean Reward: 0.93
3839

3940
## [3DBall: 3D Balance Ball](https://youtu.be/dheeCO29-EI)
4041

0 commit comments

Comments
 (0)