A couple fixes for recording demonstrations #1999

ervteng · 2019-04-30T05:40:34Z

During sanitation of demo filename, cut the string if it is too long. A long name may exceed the 32-bytes reserved for metadata, and cause corruption.
Store lastActions[] in Agent.cs so that we can write it to the demonstration on Done(). Previously, the last action recorded was always 0.
Fix for on demand actions to prevent the Agent from taking an additional action after Done() is called. This is problematic for some games where the reset causes the scene to be in a state where taking an action causes undesirable behavior.

vincentpierre · 2019-04-30T15:52:29Z

UnitySDK/Assets/ML-Agents/Scripts/Agent.cs

@@ -1011,6 +1021,8 @@ void ResetIfDone()
                            // as it is done
                            _AgentReset();
                            hasAlreadyReset = true;
+                            requestDecision = false;


How can we make sure this does not break stuff ?

Do any of our environments (aside from Snoopy Pop, where this does work) use On Demand Decisions?

Bouncer uses ODD

Training Bouncer, seems to work (1.0 reward after 50k steps and climbing)

vincentpierre · 2019-04-30T15:53:00Z

UnitySDK/Assets/ML-Agents/Scripts/DemonstrationRecorder.cs

@@ -14,6 +14,7 @@ public class DemonstrationRecorder : MonoBehaviour
        private Agent recordingAgent;
        private string filePath;
        private DemonstrationStore demoStore;
+        private int maxNameLength = 16;


Make this a const and capitalize variable name

vincentpierre · 2019-04-30T15:54:06Z

UnitySDK/Assets/ML-Agents/Scripts/Agent.cs

@@ -584,7 +586,15 @@ void SendInfoToBrain()
            }

            info.memories = action.memories;
-            info.storedVectorActions = action.vectorActions;
+            if(done)


I fail to see how this solves the problem, could you add some comments explaining what problem it solves and how ?

I added a short comment - my initial description on the pull request itself was wrong. Since Reset() is called before SendInfoToBrain(), when Reset happens, action.vectorActions is zeroed out, then SendInfoToBrain() is called and the 0's are stored in storedVectorActions. This is OK for training since the Python code doesn't use the storedVectorActions, but when recording a demo, it saves the 0's to the file. (now that I think about it, we might need to do this for the textActions as well.)

I don't neccessarily like this fix as it isn't very clean. But it seems like the only thing possible without reordering the Academy step loop.

Should we not zero out action.vectorActions at reset ?

Is there a reason we do this? Maybe we can initialize them to 0 on creation but not zero them out? I guess it's problematic since 0 isn't null, it has an actual meaning when it comes to actions

UnitySDK/Assets/ML-Agents/Scripts/DemonstrationRecorder.cs

This reverts commit d09d209.

Ervin Teng added 3 commits April 29, 2019 22:24

Crop demonstration name to avoid overflow

5bd52ab

Fix last action recorded is always 0

e080503

Fixed issue where extra action taken after Done()

d09d209

ervteng requested a review from vincentpierre April 30, 2019 05:40

Ervin Teng added 2 commits April 29, 2019 22:42

Remove whitespace

35ff235

Append comment

bfc968a

vincentpierre suggested changes Apr 30, 2019

View reviewed changes

Ervin Teng added 2 commits April 30, 2019 10:41

Make max length a const and add comments

fbfec7e

Fix comment and test for DemonstrationRecorder

81bd12d

Unity-Technologies deleted a comment Apr 30, 2019

vincentpierre approved these changes Apr 30, 2019

View reviewed changes

Adjust comment

c7a0166

Unity-Technologies deleted a comment May 1, 2019

Ervin Teng added 2 commits May 1, 2019 10:55

Revert "Fixed issue where extra action taken after Done()"

abdb0cb

This reverts commit d09d209.

More elegant fix to 0 action recording

8f23448

ervteng requested a review from vincentpierre May 1, 2019 18:44

Unity-Technologies deleted a comment May 1, 2019

vincentpierre approved these changes May 3, 2019

View reviewed changes

ervteng merged commit e1d2b69 into develop May 3, 2019

ervteng deleted the develop-demofixes branch July 9, 2019 22:01

github-actions bot locked as resolved and limited conversation to collaborators May 18, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

A couple fixes for recording demonstrations #1999

A couple fixes for recording demonstrations #1999

Uh oh!

ervteng commented Apr 30, 2019 •

edited

Loading

Uh oh!

vincentpierre Apr 30, 2019

Uh oh!

ervteng Apr 30, 2019

Uh oh!

awjuliani Apr 30, 2019

Uh oh!

ervteng Apr 30, 2019

Uh oh!

vincentpierre Apr 30, 2019

Uh oh!

vincentpierre Apr 30, 2019

Uh oh!

ervteng Apr 30, 2019 •

edited

Loading

Uh oh!

vincentpierre Apr 30, 2019

Uh oh!

ervteng Apr 30, 2019

Uh oh!

Uh oh!

Uh oh!

A couple fixes for recording demonstrations #1999

A couple fixes for recording demonstrations #1999

Uh oh!

Conversation

ervteng commented Apr 30, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ervteng Apr 30, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ervteng commented Apr 30, 2019 •

edited

Loading

ervteng Apr 30, 2019 •

edited

Loading