Skip to content

Conversation

@awjuliani
Copy link
Contributor

@awjuliani awjuliani commented Jan 19, 2018

  • Add support for stacking past n states to allow network to learn temporal dependencies.
  • Add Banana Collector environment for demonstrating partially observable multi-agent environments.
  • Add 3DBall Hard which lacks velocity information in state representation. Used as test for LSTM and state-stacking features.
  • Rework Tennis environment to be continuous control and trainable in 100k steps.

@awjuliani awjuliani merged commit 36e8197 into development-0.3 Jan 22, 2018
@awjuliani awjuliani deleted the dev-gather branch January 22, 2018 19:43
@github-actions github-actions bot locked as resolved and limited conversation to collaborators May 20, 2021
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants