Release v0.9.0 #2380

ervteng · 2019-08-01T22:12:08Z

No description provided.

- Fix slash direction for windows in COMPILER definition - Fix missing COMPILER variables when calling protoc - Fix call to "python" instead of "python3"

- Note to use the windows batch file on windows

- Put rem back in make_fo_win.bat

- Notes for where to enter commands to start with - Select a particular version of grpcio-tools - Note how to get nuget if needed - Directory independent nuget install - Remove instruction to download protoc since it comes with grpc.tools - Add instructions for windows in ##Running and directories for clarification

- Update protobuf folder ignores

- Revert version to what is currently in develop branch to clarify what changed.

- Ignore grpc installation on any platform

- make_for_win_.bat now has the same comment as make.bat - The instructions for editing those files will both use line 7 now.

- Add required version for grpc tools. Newer versions cause UnityToExternalGrpc.cs to fail to compile inside Unity due to a new function in the file

- Include grpc required versions - Clarify which steps are install and which are run every time

- Fix re-install directions to include -e modifer - Move re-install directions from creating-custom... to protobuf readme - Add how to see confirmation that install worked

Markdown for Github shows better with that change

Merge from release 0.8.2 to develop

* WIP precommit on top level * update CI * circleci fixes * intentionally fail black * use --show-diff-on-failure in CI * fix command order * rebreak a file * apply black * run on whole repo

* WIP precommit on top level * update CI * circleci fixes * intentionally fail black * use --show-diff-on-failure in CI * fix command order * rebreak a file * apply black * WIP enable mypy * run mypy on each package * fix trainer_metrics mypy errors * more mypy errors * more mypy * Fix some partially typed functions * types for take_action_outputs * fix formatting * cleanup * generate stubs for proto objects * fix ml-agents-env mypy errors * disallow-incomplete-defs for gym-unity * Add CI notes to CONTRIBUTING.md

…point to clarify is the behaviour of returning a '1' on a **no hit**.

Updated minor typos (sh for console)

Develop rayperception docs

Fix Protobuf Install Instructions

* Create new class (RewardSignal) that represents a reward signal. * Add value heads for each reward signal in the PPO model. * Make summaries agnostic to the type of reward signals, and log weighted rewards per reward signal. * Move extrinsic and curiosity rewards into this new structure. * Allow defining multiple reward signals in YAML file. Add documentation for this new structure.

At each step, an unused `last_reward` variable in the TF graph is updated in our PPO trainer. There are also related unused methods in various places in the codebase. This change removes them.

* fix bug in RandomNormal, add test for distribution * extract epsilon, rename vars

* WIP doesn't crash * return stats and assert convergence * pass lint checks * rename * fix-reset-params * add time penalty * _get_measure_vals always returns something * fix tests * unused import * single env, fix double step * move LocalEnvManager to ml-agents-envs * move and rename EnvManager * remove obsolete docstring and method * clean up

* Don't 0 value bootstrap for GAIL and Curiosity * Add gradient penalties to GAN to help with stability * Add gail_config.yaml with GAIL examples * Cleaned up trainer_config.yaml and unnecessary gammas * Documentation updates * Code cleanup

* Removes unused SubprocessEnvManager import in trainer_controller * Removes unused `steps` argument to `TrainerController._save_model` * Consolidates unnecessary branching for curricula in `TrainerController.advance` * Moves `reward_buffer` into `TFPolicy` from `PPOPolicy` and adds `BCTrainer` support so that we don't have a broken interface / undefined behavior when BCTrainer is used with curricula.

* Timer proof-of-concept * micro optimizations * add some timers * cleanup, add asserts * Cleanup (no start/end methods) and handle exceptions * unit test and decorator * move output code, add a decorator * cleanup * module docstring * actually write the timings when done with training * use __qualname__ instead * add a few more timers * fix mock import * fix unit test * get timers from worker process (WIP) * clean up timer merging * typo * WIP * cleanup merging code * bad merge * undo accidental change * remove reset command * fix style * fix unit tests * fix unit tests (they got overwrote in merge) * get timer root though a function * timer around communicate

Bringing bucket of temp memory allocation optimizations: * switched to Barracuda backed tensor across the board, helps to leverage allocators and reuse of the internal buffers * added Barracuda 0.2.4 release, which bring another set of temp memory allocation fixes

refactor vis_encoder_type and add to doc

* fix azure link * fix imitation learning links

…2258) Banana Collectors: Length of laser and agent scale Bouncer: Size of the banana

Pushblock: dynamic_friction, static_friction, block_drag, block_scale Reacher: Gravity, non-linear goal movement Walker: Gravity, torso mass

* profiling docs * clean up debug option, move csv info * Imitation Learning -> Behavioral Cloning

* Add Sampler and SamplerManager * Enable resampling of reset parameters during training * Documentation for Sampler and example YAML configuration file

* Tick versions of gym, ml-agents, ml-agents-envs * Tick communication API to 9

* Removed obsolete 'TestDstWrongShape' test as it does not reflect how Barracuda tensors work * Added proper test cleanup, to avoid warning messages from finalizer thread.

* Fix naming conventions for consistency * Add generalization link to ML-Agents Overview * Add generalization to main Readme * Include types of samplers available for use

* Removed obsolete 'TestDstWrongShape' test as it does not reflect how Barracuda tensors work * Added proper test cleanup, to avoid warning messages from finalizer thread. * Hotfix for recurrent + continous action nets in ML Agents

* add kor ver of README.md and empty docs, images * add Installation.md translated to korean * Fixed main readme docs and move all the English documents in the docs folder * modify contents of 'Installation.md' and add kr version 'Installation-Windows.md'(not completed) with related image * completed 1st translation of 'Installation-Windows.md' and added related images for korean docs * add kr version 'Using-Docker.md'(not completed) * translate Training-PPO.md to Korean * Change word about epsilon in Training-PPO.md * Fix Training PPO about epsilon * completed korean translation of 'Using-Docker.md' * Training Imitation Learning translation to Korean is finished! Also information about the translators are added * modified all 'blogs.unity3d.com/' to 'blogs.unity3d.com/kr' * removed all non-translated doc * add translator information

* Included explicit version # for ZN * added explicit version for KR docs * minor fix in installation doc * Consistency with numbers for reset parameters * Removed extra verbiage. minor consistency * minor consistency * Cleaned up IL language * moved parameter sampling above in list * Cleaned up language in Env Parameter sampling * Cleaned up migrating content * updated consistency of Reset Parameter Sampling * Rename Training-Generalization-Learning.md to Training-Generalization-Reinforcement-Learning-Agents.md * Updated doc link for generalization * Rename Training-Generalization-Reinforcement-Learning-Agents.md to Training-Generalized-Reinforcement-Learning-Agents.md * Re-wrote the intro paragraph for generalization * add titles, cleaned up language for reset params * Update Training-Generalized-Reinforcement-Learning-Agents.md * cleanup of generalization doc * More cleanup in generalization * Fixed title * Clean up included sampler type section * cleaned up defining new sampler type in generalization * cleaned up training section of generalization * final cleanup for generalization * Clean up of Training w Imitation Learning doc * updated link for generalization, reordered * consistency fix * cleaned up training ml agents doc * Update and rename Profiling.md to Profiling-Python.md * Updated Python profiling link * minor clean up in profiling doc * Rename Training-BehavioralCloning.md to Training-Behavioral-Cloning.md * Updated link to BC * Rename Training-RewardSignals.md to Reward-Signals.md * fix reward links to new * cleaned up reward signal language * fixed broken links to reward signals * consistency fix * Updated readme with generalization * Added example for GAIL reward signal * minor fixes and consistency to Reward Signals * referencing GAIL in the recording demonstration * consistency * fixed desc of bc and gail * comment fix * comments fix * Fix broken links * Fix grammar in Overview for IL * Add optional params to reward signals comment to GAIL

This fixes an issue where stopping the game when training in the Editor won't end training, due to the new asynchronous SubprocessEnvManager changes. Another minor change was made to move the `env_manager.close()` in TrainerController to the end of `start_learning` so that we are more likely to save the model if something goes wrong during the environment shutdown (this occurs sometimes on Windows machines).

CLAassistant · 2019-08-01T22:12:21Z

All committers have signed the CLA.

rsfutch77 and others added 30 commits June 9, 2019 01:48

Update make_for_win.bat

1c4bffa

- Fix slash direction for windows in COMPILER definition - Fix missing COMPILER variables when calling protoc - Fix call to "python" instead of "python3"

Update Creating-Custom-Protobuf-Messages.md

df8f2a0

- Note to use the windows batch file on windows

Update make_for_win.bat

627ce19

- Put rem back in make_fo_win.bat

Update .gitignore

7ebf3ed

- Update protobuf folder ignores

Update make_for_win.bat

0c46cf8

- Revert version to what is currently in develop branch to clarify what changed.

Update .gitignore

f0729ff

- Ignore grpc installation on any platform

Update make_for_win_.bat to match make.bat

034dc70

- make_for_win_.bat now has the same comment as make.bat - The instructions for editing those files will both use line 7 now.

Update protobuf readme

e38e81f

- Add required version for grpc tools. Newer versions cause UnityToExternalGrpc.cs to fail to compile inside Unity due to a new function in the file

Update protobuf readme

15ebe45

- Include grpc required versions - Clarify which steps are install and which are run every time

Fix mlagents re-install directions

15a3fa8

- Fix re-install directions to include -e modifer - Move re-install directions from creating-custom... to protobuf readme - Add how to see confirmation that install worked

Updated minor typos (sh for console)

e4d83f6

Markdown for Github shows better with that change

Merge pull request #2179 from Unity-Technologies/release-v0.8.2

be5a60a

Merge from release 0.8.2 to develop

Use pre-commit for CI tests, add black hook (#2163)

d767905

* WIP precommit on top level * update CI * circleci fixes * intentionally fail black * use --show-diff-on-failure in CI * fix command order * rebreak a file * apply black * run on whole repo

Adding docs clarifying the construction of the ray sublist. The main …

0a37a43

…point to clarify is the behaviour of returning a '1' on a **no hit**.

Update to include CLA language (#2195)

1786fb0

Merge pull request #2172 from quevedin/patch-1

a740162

Updated minor typos (sh for console)

Merge pull request #2190 from gregnz/develop-rayperception-docs

e8018e8

Develop rayperception docs

Merge branch 'develop' into protobuf_update

ebf9582

Merge pull request #2138 from rsfutch77/protobuf_update

2b4a961

Fix Protobuf Install Instructions

enable precommit for line endings, fix 1 failure (#2208)

ed1b76e

Remove unused "last reward" logic, TF nodes

f6f967a

At each step, an unused `last_reward` variable in the TF graph is updated in our PPO trainer. There are also related unused methods in various places in the codebase. This change removes them.

add flake8 to precommit

15fa740

add setup.cfg

3420421

enforce line length

654bb7b

cleanup setup.cfg

98e9d24

precommit autoupdate

3aa3310

fix accidental change

0d23c30

Chris Elion and others added 23 commits July 22, 2019 13:11

fix bug in RandomNormal (#2294)

58d03ee

* fix bug in RandomNormal, add test for distribution * extract epsilon, rename vars

Improvements for GAIL (#2296)

f576c88

* Don't 0 value bootstrap for GAIL and Curiosity * Add gradient penalties to GAN to help with stability * Add gail_config.yaml with GAIL examples * Cleaned up trainer_config.yaml and unnecessary gammas * Documentation updates * Code cleanup

refactor vis_encoder_type and add to doc

1148141

refactor vis_encoder_type and add to doc

Fix broken doc links (#2327)

5bb5331

* fix azure link * fix imitation learning links

Implemented the reset parameters for Banana Collectors and Bouncer (#…

d5bfa89

…2258) Banana Collectors: Length of laser and agent scale Bouncer: Size of the banana

Reset Parameters implemented for Pushblock, Reacher and Walker (#2322)

75f1a9a

Pushblock: dynamic_friction, static_friction, block_drag, block_scale Reacher: Gravity, non-linear goal movement Walker: Gravity, torso mass

Profiling docs (#2325)

b7dcda6

* profiling docs * clean up debug option, move csv info * Imitation Learning -> Behavioral Cloning

Enable generalization training (#2232)

5d7dd57

* Add Sampler and SamplerManager * Enable resampling of reset parameters during training * Documentation for Sampler and example YAML configuration file

Fix docs for reward signals (#2320)

b2db1d9

Fix default for vis_encode_type (#2330)

ffa50d8

Tick version number for 0.9 (#2331)

5837c71

* Tick versions of gym, ml-agents, ml-agents-envs * Tick communication API to 9

Fix tests for Barracuda (#2333)

b78c1e0

* Removed obsolete 'TestDstWrongShape' test as it does not reflect how Barracuda tensors work * Added proper test cleanup, to avoid warning messages from finalizer thread.

Fix docs for Generalization (#2334)

7a2a922

* Fix naming conventions for consistency * Add generalization link to ML-Agents Overview * Add generalization to main Readme * Include types of samplers available for use

Barracuda hotfix for LSTM and tests (#2352)

d26a502

* Removed obsolete 'TestDstWrongShape' test as it does not reflect how Barracuda tensors work * Added proper test cleanup, to avoid warning messages from finalizer thread. * Hotfix for recurrent + continous action nets in ML Agents

Added Migrating docs for 0.9 (#2347)

4279620

Updated the models for v0.9 (#2374)

efda758

ervteng requested review from harperj and xiaomaogy August 1, 2019 22:12

harperj approved these changes Aug 1, 2019

View reviewed changes

ervteng merged commit 3ee0963 into master Aug 1, 2019

ervteng deleted the release-0.9.0 branch August 10, 2019 03:13

github-actions bot locked as resolved and limited conversation to collaborators May 18, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Release v0.9.0 #2380

Release v0.9.0 #2380

Uh oh!

ervteng commented Aug 1, 2019

Uh oh!

CLAassistant commented Aug 1, 2019 •

edited

Loading

Uh oh!

Uh oh!

Release v0.9.0 #2380

Release v0.9.0 #2380

Uh oh!

Conversation

ervteng commented Aug 1, 2019

Uh oh!

CLAassistant commented Aug 1, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

CLAassistant commented Aug 1, 2019 •

edited

Loading