Update the basic example documentation (#3281)

vincentpierre · web-flow · commit 0f9318675df2 · 2020-01-23T11:38:39.000-08:00
diff --git a/docs/Learning-Environment-Examples.md b/docs/Learning-Environment-Examples.md
@@ -26,6 +26,7 @@ If you would like to contribute environments, please see our
 * Goal: Move to the most reward state.
 * Agents: The environment contains one agent.
 * Agent Reward Function:
+  * -0.01 at each step
   * +0.1 for arriving at suboptimal state.
   * +1.0 for arriving at optimal state.
 * Behavior Parameters:
@@ -34,7 +35,7 @@ If you would like to contribute environments, please see our
     right).
   * Visual Observations: None
 * Float Properties: None
-* Benchmark Mean Reward: 0.94
+* Benchmark Mean Reward: 0.93
 
 ## [3DBall: 3D Balance Ball](https://youtu.be/dheeCO29-EI)