Skip to content

Release 0.10.0 #2648

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 69 commits into from
Sep 30, 2019
Merged

Release 0.10.0 #2648

merged 69 commits into from
Sep 30, 2019

Conversation

ervteng
Copy link
Contributor

@ervteng ervteng commented Sep 30, 2019

DanAmador and others added 30 commits August 20, 2019 22:06
Fixed small typo in documentation.
* specify dirs, exclude test files

* update comments

* html coverage in CI artifacts

* add destination

* ignore coverage files

* check gym-unity too
* Fix bug with construct_curr_info
* Add more tests
* initialize trainer step count

* remove step init from RLTrainer
* Add Soft Actor-Critic model, trainer, and policy and sac_trainer_config.yaml
* Add documentation for SAC and tweak PPO documentation to reference the new pages.
* Add tests for SAC, change simple_rl test to run both PPO and SAC.
)

* This addresses #1835. Baselines expects single environments used with their ppo2 algorithm to be wrapped in a DummyVecEnv. The old readme did not instruct the reader to do so and the code failed to run with the latest version of baselines. This imports the correct function from baselines and fixes the make_unity_env function described in the readme.

* added line to gym-unity/README.md to note the version of baselines the examples were tested with
This change was requested for clarity during the async EnvManager
PR.  It's a simple rename of the StepInfo class.
Merege 0.9.3 changes to develop
* unit test - don't use global random generator

* Update test_simple_rl.py
* initialize random instance correctly

* restore threshold (I hope)
Only cosmetic and readability improvements. No functional changes were intended.

Utilities.cs
- Fixed comments across file
- Made class static
- Removed unnecessary imports
- Removed unused method arguments
- Renamed variables as appropriate to make usage clearer
- In AddRangeNoAlloc, disabled (by comment) Rider’s suggestion to revert to use of built-in Range field (Fixed)
- In TextureToTensorProxy, swapped order of first two arguments to be more in-line with convention of input, output

UtilitiesTests.cs
- Removed unnecessary imports
- Simplified array creation commands

GeneratorImp.cs
- Rider automatically deleted spaces on empty lines
- Changed call to TextureToTensorProxy to mirror new argument ordering

* Clean-up to UnityAgentsException.cs

- Removed unnecessary imports
- Fixed comment warning
- Fixed method header

* Improvements to Startup.cs

- Created const for SCENE_NAME field
- Fixed string formatting for exception message
- Added new line at EOF

* Adding team-wide settings file

* Clean up to RandomNormal

- Removed FillTensor from RandomNormal and moved to TensorUtils
- Removed the tests in RandomNormalTest that are for FillTensor to TensorUtilsTest
- Fixed GeneratorImpl to use to TensorUtils.FillTensor method

* Decouples DiscreteActionOutputApplier from Multinomial

- Add use of var where appropriate
- Makes variables readonly where appropriate
- Makes private variables start with “_”
- Removes DiscreteActionOutputApplier context from Multinomial - class is much simpler now
- Moves most of the tests in MultinomialTest to a new DiscreteActionOutputApplierTest class
- Adds a basic Multinomial test to MultinomialTest

* Minor formatting to TensorProxy

- Limiting code to 100 characters
- renaming private vars to start with “_”
- fixing indentation in file

Couple of other minor fixes to TensorUtilsTest and Discrete…Test.cs

* Adding new line to EOF for TensorProxy

* Adding missing meta files.

* Removing three unused methods from TensorProxy

* (Big) Rename of TensorProxy fields

Changed Name —> name; ValueType —> valueType; Shape —> shape; Data —> data

* Additional cosmetic changes to a number of files previously modified

- Limit to 100 chars per line
- Appropriate formatting
- Removed Rider warnings

* Improvements to Multinomial(Test).cs

Fixed the tests for Multinomial as they didn’t include cumulative mass functions.

Also, improved the documentation.
* Add a note on Custom messages about needing trainer changes

* move to top (aside doesn't render as expected)

* Don't blockquote

* Update wording
* [memory] Fix for tensors not being disposed of.

* Fix member name.
Jonathan Harper and others added 23 commits September 18, 2019 14:11
We have been ignoring unused imports and star imports via flake8. These are
both bad practice and grow over time without automated checking.  This
commit attempts to fix all existing import errors and add back the corresponding
flake8 checks.
This wasn't working before because of several remaining partially defined
function definitions.
* encapsulate commandline args

* fix tests

* add tests on cmdline parsing

* cleanup

* remove docopt

* simplify --slow
* TensorFlowSharp is no more

* Removed old documents
* Added option to use environment arguments in learn

* hook into argparse

* add example to readme
Our multi-GPU training had a regression such that freezing the
graph was broken.  This change fixes that issue by making a few
changes:

* Removes the top level "tower" variable scope added by multi-GPU
  so that the output nodes have correct names
* Removes the use of "freeze_graph" and replaces it with our own similar 
  functionality.
* Adds the "auto reuse" to network layers which require them
* image decompress timer

* dont lazy load the image
…cy brain, we will remove the action descriptions from the dictionary of things we need to compare. This is to prevent the case where a user has different descriptions for his actions but still wants to train a brain using expert demonstrations. (#2517)
* WIP cleanup loading

* better exceptions for parser errors - refer to online lint tools

* feedback - rename variable
* new env styles rebased on develop

* added new trained models

* renamed food collector platforms

* reduce training timescale on WallJump from 100 to 10

* uncheck academy control on walljump

* new banner image

* rename banner file

* new example env images

* add foodCollector image

* change Banana to FoodCollector and update image

* change bouncer description to include green cube

* update image

* update gridworld image

* cleanup prefab names and tags

* updated soccer env to reference purple agent instead of red

* remove unused mats

* rename files

* remove more unused tags

* update image

* change platform to agent cube

* update text. change platform to agents head

* cleanup

* cleaned up weird unused meta files

* add new wall jump nn files and rename a prefab

* walker change stacked states from 5 to 1

walker collects physics observations so stacked states are not needed

* fixed reacher platform and crawlerstatic scene

* add more space between agents

* adjust x pos of camera based on new agent spacing

* adjust colliders & testing new nn models

* change speed on hallway agent and make rb continousDynamic

also update goal collider size in pushblock

* pushblock add new model, crawler decrease spawn radius

* reset cube pos each agent reset

* added new hallway nn model, fixed goal gameobjects

* add new files for 3dball

* added new models

* fixed crawlerStatic target spawn

* Increase bouncer steps

* [format] Format code to be complaint with Unity coding conventions. (#2608)

* Revert Batcher changes
* Add note about using GPU inference for ResNet
* Tick versions for pip packages
* Tick API version to 10
* Re-record demos for new envs
* Add better Hallway brain
* Remove Banana
* Tweak SAC hyperparams

* Make network bigger

* Properly report entropy

* Revert "Properly report entropy"

This reverts commit 383a8d8.
Copy link
Contributor

@vincentpierre vincentpierre left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🚢 🇮🇹

@ervteng ervteng merged commit a7c1fcc into master Sep 30, 2019
@github-actions github-actions bot locked as resolved and limited conversation to collaborators May 17, 2021
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.