Add frozen flag to VisualFoodCollector #4511

dongruoping · 2020-09-25T18:57:23Z

Proposed change(s)

The FoodCollector environment does not train with pure visual observation due to insufficient information. The agent lacks of information about the frozen state of itself, and adding a boolean flag indicating the frozen state (which is also used in the vector observation version of FoodCollector) makes the training work.

This will add an example of using both vector and visual observation to our example environments.

Useful links (Github issues, JIRA tickets, ML-Agents forum threads etc.)

Types of change(s)

Checklist

Added tests that prove my fix is effective or that my feature works
Updated the changelog (if applicable)
Updated the documentation (if applicable)
Updated the migration guide (if applicable)

Other comments

dongruoping · 2020-09-25T18:58:17Z

Docs and changelogs WIP

vincentpierre

If you verified that it trains, this is great news!
I left a comment, please address before merging.
do you plan on adding this scene to the daily CI?

vincentpierre · 2020-09-28T17:06:50Z

Project/Assets/ML-Agents/Examples/FoodCollector/Scripts/FoodCollectorAgent.cs

@@ -30,6 +30,7 @@ public class FoodCollectorAgent : Agent
    public GameObject myLaser;
    public bool contribute;
    public bool useVectorObs;
+    public bool useFrozenFlag;


I think this should always be turned on. The user might have issues with the number of observations when unchecking this box. Besides, you mentioned that the Agent will not train without it, so I think it should be always on and un-uncheckable.

My concern was that it would be a bit confusing to use one vector flag by default in a "visual" scene.
Or is it actually not too big a problem? if so I can make it un-uncheckable.

dongruoping · 2020-09-28T18:29:34Z

Yes I think it would be good to be include in CI, so that we can capture it if we change the way combining vector and visual observations. It could take quite some time though.
FYI: Using the current config, the reward bounce between -1~1 in the first 1.5M steps, get to ~20 at around 2M steps and ~50 around 3M steps.

dongruoping · 2020-10-02T01:45:33Z

Added the trained model and updated the docs, so I think it'd be better to do re-review

vincentpierre

Looks, good to me. I still think the useFrozenFlag should not be an option and should always be on. I tried the model and it worked. I did not try to train it.

Project/Assets/ML-Agents/Examples/FoodCollector/Scripts/FoodCollectorAgent.cs

…llectorAgent.cs Co-authored-by: Vincent-Pierre BERGES <[email protected]>

dongruoping · 2020-10-06T17:38:16Z

Looks, good to me. I still think the useFrozenFlag should not be an option and should always be on. I tried the model and it worked. I did not try to train it.

Removed useFrozenFlag as suggested and verified the change does not affect the model.

Ruo-Ping Dong added 4 commits September 24, 2020 15:57

add forzen flag to VisualFoodCollector

4b3300f

add forzen flag to VisualFoodCollector

bbd5f0b

add config file

280e9da

fix typo

0c532f2

dongruoping requested a review from vincentpierre September 25, 2020 18:58

vincentpierre approved these changes Sep 28, 2020

View reviewed changes

Ruo-Ping Dong added 2 commits September 29, 2020 16:51

add trained model file

8de51d7

add documents

e25b6b4

dongruoping requested a review from vincentpierre October 2, 2020 01:45

vincentpierre approved these changes Oct 5, 2020

View reviewed changes

Project/Assets/ML-Agents/Examples/FoodCollector/Scripts/FoodCollectorAgent.cs Outdated Show resolved Hide resolved

Project/Assets/ML-Agents/Examples/FoodCollector/Scripts/FoodCollectorAgent.cs Outdated Show resolved Hide resolved

Ruo-Ping (Rachel) Dong and others added 4 commits October 5, 2020 13:38

Update Project/Assets/ML-Agents/Examples/FoodCollector/Scripts/FoodCo…

a99455d

…llectorAgent.cs Co-authored-by: Vincent-Pierre BERGES <[email protected]>

Update Project/Assets/ML-Agents/Examples/FoodCollector/Scripts/FoodCo…

591731b

…llectorAgent.cs Co-authored-by: Vincent-Pierre BERGES <[email protected]>

fix

ca25688

formatting

5821518

Merge branch 'master' into develop-vis-food

892669c

dongruoping merged commit a261b40 into master Oct 6, 2020

delete-merged-branch bot deleted the develop-vis-food branch October 6, 2020 21:02

dongruoping mentioned this pull request Oct 9, 2020

Add useVectorFrozenFlag option in FoodCollector #4552

Merged

10 tasks

github-actions bot locked as resolved and limited conversation to collaborators Oct 7, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add frozen flag to VisualFoodCollector #4511

Add frozen flag to VisualFoodCollector #4511

Uh oh!

dongruoping commented Sep 25, 2020 •

edited

Loading

Uh oh!

dongruoping commented Sep 25, 2020

Uh oh!

vincentpierre left a comment

Uh oh!

vincentpierre Sep 28, 2020

Uh oh!

dongruoping Sep 28, 2020

Uh oh!

dongruoping commented Sep 28, 2020

Uh oh!

dongruoping commented Oct 2, 2020

Uh oh!

vincentpierre left a comment

Uh oh!

Uh oh!

Uh oh!

dongruoping commented Oct 6, 2020

Uh oh!

Uh oh!

Add frozen flag to VisualFoodCollector #4511

Add frozen flag to VisualFoodCollector #4511

Uh oh!

Conversation

dongruoping commented Sep 25, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Proposed change(s)

Useful links (Github issues, JIRA tickets, ML-Agents forum threads etc.)

Types of change(s)

Checklist

Other comments

Uh oh!

dongruoping commented Sep 25, 2020

Uh oh!

vincentpierre left a comment

Choose a reason for hiding this comment

Uh oh!

vincentpierre Sep 28, 2020

Choose a reason for hiding this comment

Uh oh!

dongruoping Sep 28, 2020

Choose a reason for hiding this comment

Uh oh!

dongruoping commented Sep 28, 2020

Uh oh!

dongruoping commented Oct 2, 2020

Uh oh!

vincentpierre left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

dongruoping commented Oct 6, 2020

Uh oh!

Uh oh!

dongruoping commented Sep 25, 2020 •

edited

Loading