Release 0.10.0 #2648

ervteng · 2019-09-30T23:23:22Z

Merge Release 0.10.0 into Master. Release notes: https://github.com/Unity-Technologies/ml-agents/releases/tag/untagged-11e16747f20c203bae73

Release 0.9.2 to develop

Fixed small typo in documentation.

* specify dirs, exclude test files * update comments * html coverage in CI artifacts * add destination * ignore coverage files * check gym-unity too

- Reduces memory usage of buffer.

* Fix bug with construct_curr_info * Add more tests

* initialize trainer step count * remove step init from RLTrainer

* Add Soft Actor-Critic model, trainer, and policy and sac_trainer_config.yaml * Add documentation for SAC and tweak PPO documentation to reference the new pages. * Add tests for SAC, change simple_rl test to run both PPO and SAC.

) * This addresses #1835. Baselines expects single environments used with their ppo2 algorithm to be wrapped in a DummyVecEnv. The old readme did not instruct the reader to do so and the code failed to run with the latest version of baselines. This imports the correct function from baselines and fixes the make_unity_env function described in the readme. * added line to gym-unity/README.md to note the version of baselines the examples were tested with

This change was requested for clarity during the async EnvManager PR. It's a simple rename of the StepInfo class.

Merege 0.9.3 changes to develop

* unit test - don't use global random generator * Update test_simple_rl.py

Clean up SAC config

…ne-bc Update the offline_bc_config path

… the test will not randomly fail (#2520)

* initialize random instance correctly * restore threshold (I hope)

Only cosmetic and readability improvements. No functional changes were intended. Utilities.cs - Fixed comments across file - Made class static - Removed unnecessary imports - Removed unused method arguments - Renamed variables as appropriate to make usage clearer - In AddRangeNoAlloc, disabled (by comment) Rider’s suggestion to revert to use of built-in Range field (Fixed) - In TextureToTensorProxy, swapped order of first two arguments to be more in-line with convention of input, output UtilitiesTests.cs - Removed unnecessary imports - Simplified array creation commands GeneratorImp.cs - Rider automatically deleted spaces on empty lines - Changed call to TextureToTensorProxy to mirror new argument ordering * Clean-up to UnityAgentsException.cs - Removed unnecessary imports - Fixed comment warning - Fixed method header * Improvements to Startup.cs - Created const for SCENE_NAME field - Fixed string formatting for exception message - Added new line at EOF * Adding team-wide settings file * Clean up to RandomNormal - Removed FillTensor from RandomNormal and moved to TensorUtils - Removed the tests in RandomNormalTest that are for FillTensor to TensorUtilsTest - Fixed GeneratorImpl to use to TensorUtils.FillTensor method * Decouples DiscreteActionOutputApplier from Multinomial - Add use of var where appropriate - Makes variables readonly where appropriate - Makes private variables start with “_” - Removes DiscreteActionOutputApplier context from Multinomial - class is much simpler now - Moves most of the tests in MultinomialTest to a new DiscreteActionOutputApplierTest class - Adds a basic Multinomial test to MultinomialTest * Minor formatting to TensorProxy - Limiting code to 100 characters - renaming private vars to start with “_” - fixing indentation in file Couple of other minor fixes to TensorUtilsTest and Discrete…Test.cs * Adding new line to EOF for TensorProxy * Adding missing meta files. * Removing three unused methods from TensorProxy * (Big) Rename of TensorProxy fields Changed Name —> name; ValueType —> valueType; Shape —> shape; Data —> data * Additional cosmetic changes to a number of files previously modified - Limit to 100 chars per line - Appropriate formatting - Removed Rider warnings * Improvements to Multinomial(Test).cs Fixed the tests for Multinomial as they didn’t include cumulative mass functions. Also, improved the documentation.

* Add a note on Custom messages about needing trainer changes * move to top (aside doesn't render as expected) * Don't blockquote * Update wording

* [memory] Fix for tensors not being disposed of. * Fix member name.

We have been ignoring unused imports and star imports via flake8. These are both bad practice and grow over time without automated checking. This commit attempts to fix all existing import errors and add back the corresponding flake8 checks.

This wasn't working before because of several remaining partially defined function definitions.

…DKlog Remove UnitySDK.log file

* encapsulate commandline args * fix tests * add tests on cmdline parsing * cleanup * remove docopt * simplify --slow

* TensorFlowSharp is no more * Removed old documents

* Added option to use environment arguments in learn * hook into argparse * add example to readme

Our multi-GPU training had a regression such that freezing the graph was broken. This change fixes that issue by making a few changes: * Removes the top level "tower" variable scope added by multi-GPU so that the output nodes have correct names * Removes the use of "freeze_graph" and replaces it with our own similar functionality. * Adds the "auto reuse" to network layers which require them

* image decompress timer * dont lazy load the image

…cy brain, we will remove the action descriptions from the dictionary of things we need to compare. This is to prevent the case where a user has different descriptions for his actions but still wants to train a brain using expert demonstrations. (#2517)

* WIP cleanup loading * better exceptions for parser errors - refer to online lint tools * feedback - rename variable

* new env styles rebased on develop * added new trained models * renamed food collector platforms * reduce training timescale on WallJump from 100 to 10 * uncheck academy control on walljump * new banner image * rename banner file * new example env images * add foodCollector image * change Banana to FoodCollector and update image * change bouncer description to include green cube * update image * update gridworld image * cleanup prefab names and tags * updated soccer env to reference purple agent instead of red * remove unused mats * rename files * remove more unused tags * update image * change platform to agent cube * update text. change platform to agents head * cleanup * cleaned up weird unused meta files * add new wall jump nn files and rename a prefab * walker change stacked states from 5 to 1 walker collects physics observations so stacked states are not needed * fixed reacher platform and crawlerstatic scene * add more space between agents * adjust x pos of camera based on new agent spacing * adjust colliders & testing new nn models * change speed on hallway agent and make rb continousDynamic also update goal collider size in pushblock * pushblock add new model, crawler decrease spawn radius * reset cube pos each agent reset * added new hallway nn model, fixed goal gameobjects * add new files for 3dball * added new models * fixed crawlerStatic target spawn * Increase bouncer steps * [format] Format code to be complaint with Unity coding conventions. (#2608) * Revert Batcher changes

* Add note about using GPU inference for ResNet

* Tick versions for pip packages * Tick API version to 10

* Re-record demos for new envs * Add better Hallway brain * Remove Banana

* Tweak SAC hyperparams * Make network bigger * Properly report entropy * Revert "Properly report entropy" This reverts commit 383a8d8.

vincentpierre

🚢 🇮🇹

DanAmador and others added 30 commits August 20, 2019 22:06

Fixed small typo in documentation.

835c736

Merge pull request #2470 from Unity-Technologies/release-0.9.2

2b64e45

Release 0.9.2 to develop

Merge pull request #2451 from DanAmador/patch-1

982f01e

Fixed small typo in documentation.

Fixed typo in Training-Imitation-Learning.md (#2485)

9cde007

python coverage: specify dirs, exclude test files (#2473)

4b1f1a3

* specify dirs, exclude test files * update comments * html coverage in CI artifacts * add destination * ignore coverage files * check gym-unity too

Change update buffer to float32 instead of float64 (#2461)

fe97df8

- Reduces memory usage of buffer.

Fix bug with construct_curr_info and test

863610b

Add more tests

77f105c

Add 2 visual obs test

2a4724d

Fix bug with construct_curr_info (#2490)

5566760

* Fix bug with construct_curr_info * Add more tests

Minor fix to link to GAIL reward signal doc (#2435)

3d324db

initialize trainer step count (#2498)

d150c51

* initialize trainer step count * remove step init from RLTrainer

Add Soft Actor-Critic as trainer option (#2341)

5cd2118

* Add Soft Actor-Critic model, trainer, and policy and sac_trainer_config.yaml * Add documentation for SAC and tweak PPO documentation to reference the new pages. * Add tests for SAC, change simple_rl test to run both PPO and SAC.

Renamed "StepInfo" to "EnvironmentStep"

5ada924

This change was requested for clarity during the async EnvManager PR. It's a simple rename of the StepInfo class.

Merge pull request #2516 from Unity-Technologies/master

035045e

Merege 0.9.3 changes to develop

Delete VisualBanana

8892581

unit test - don't use global random generator (#2521)

31529b8

* unit test - don't use global random generator * Update test_simple_rl.py

Merge pull request #2522 from Unity-Technologies/develop-cleanupconfig

bb82962

Clean up SAC config

Use numpy for random sample in buffer (#2524)

ec913ba

Update the offline_bc_config path

ebe7e7c

Merge pull request #2526 from Unity-Technologies/develop-update-offli…

7deade9

…ne-bc Update the offline_bc_config path

Changing Training-RewardSignals.md --> Reward-Signals.md (#2525)

7208853

Made the _check_environment_trains test a little more easy to pass so…

56ea0a8

… the test will not randomly fail (#2520)

Fix determinism in unit test (#2530)

77c83cc

* initialize random instance correctly * restore threshold (I hope)

Add a note on Custom messages about needing trainer changes (#2534)

d79df88

* Add a note on Custom messages about needing trainer changes * move to top (aside doesn't render as expected) * Don't blockquote * Update wording

[memory] Fix for tensors not being disposed of. (#2541)

27a4629

* [memory] Fix for tensors not being disposed of. * Fix member name.

Fix run_id typing in trainer.py (#2537)

a737e83

Fixes missing camera resolution info in demos (#2523)

7fc5f54

Jonathan Harper and others added 23 commits September 18, 2019 14:11

Fix flake8 import warnings (#2584)

e594ba8

We have been ignoring unused imports and star imports via flake8. These are both bad practice and grow over time without automated checking. This commit attempts to fix all existing import errors and add back the corresponding flake8 checks.

Allow mypy to reject incomplete defs for mlagents-envs (#2585)

b787e81

This wasn't working before because of several remaining partially defined function definitions.

Merge pull request #2580 from Unity-Technologies/develop-removeUnityS…

88caccf

…DKlog Remove UnitySDK.log file

Use argparse for arg parsing (#2586)

e618318

* encapsulate commandline args * fix tests * add tests on cmdline parsing * cleanup * remove docopt * simplify --slow

TensorFlowSharp is no more (#2590)

f3042b5

* TensorFlowSharp is no more * Removed old documents

Added option to use environment arguments in learn (#2594)

aa65274

* Added option to use environment arguments in learn * hook into argparse * add example to readme

Fix crash with VAIL + GAIL (#2598)

8e580ff

image decompress timer (#2596)

d5ea5af

* image decompress timer * dont lazy load the image

fix hang with multiple envs (#2600)

97b9951

Develop yaml json loading errors (#2601)

efc9d84

* WIP cleanup loading * better exceptions for parser errors - refer to online lint tools * feedback - rename variable

Add note about using GPU inference for ResNet (#2607)

e557dab

* Add note about using GPU inference for ResNet

Tick version of API and pypi packages to 10 (#2610)

23404c0

* Tick versions for pip packages * Tick API version to 10

Update project version to 2017.4.32 (#2613)

4d3f2e5

Update Bouncer learning NN file (#2614)

03453e1

Record new demos for new envs (#2622)

6502feb

* Re-record demos for new envs * Add better Hallway brain * Remove Banana

Update Migrating.md with note about environments (#2624)

75b149f

Remove Soccer .nn files (#2615)

62e8fb1

Improved SAC hyperparameters for Crawler, Walker (#2635)

f5b98ca

* Tweak SAC hyperparams * Make network bigger * Properly report entropy * Revert "Properly report entropy" This reverts commit 383a8d8.

Fix spelling error in documentation (#2636)

83e3924

Fix visual hallway and visual pushblock brains and scenes. (#2645)

600d94c

ervteng requested review from chriselion, vincentpierre and harperj September 30, 2019 23:23

harperj approved these changes Sep 30, 2019

View reviewed changes

vincentpierre approved these changes Sep 30, 2019

View reviewed changes

ervteng merged commit a7c1fcc into master Sep 30, 2019

github-actions bot locked as resolved and limited conversation to collaborators May 17, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Release 0.10.0 #2648

Release 0.10.0 #2648

Uh oh!

ervteng commented Sep 30, 2019

Uh oh!

vincentpierre left a comment

Uh oh!

Uh oh!

Release 0.10.0 #2648

Release 0.10.0 #2648

Uh oh!

Conversation

ervteng commented Sep 30, 2019

Uh oh!

vincentpierre left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!