State Stacking & Banan Environment #262

awjuliani · 2018-01-19T22:00:31Z

Add support for stacking past n states to allow network to learn temporal dependencies.
Add Banana Collector environment for demonstrating partially observable multi-agent environments.
Add 3DBall Hard which lacks velocity information in state representation. Used as test for LSTM and state-stacking features.
Rework Tennis environment to be continuous control and trainable in 100k steps.

# Conflicts: # python/trainers/ppo_models.py

awjuliani and others added 17 commits December 12, 2017 17:34

Add gather environment

0cd5bc4

Allow easily turn respawning on and off

e00f191

made the banana parallel for training

9c79bd1

Use more efficient tag comparison

9fe4bf6

Add bad banana

7848c11

Add bad banana

d77dc2d

Merge remote-tracking branch 'origin/development-0.3' into dev-gather

802d8ab

Add state stacking

0acbcb6

Reorganize 3DBall to accommodate hard version

a957e90

Clean up Banana perception code & use log_sigma = ls(s) + ls_b

2005f24

Add new banana model

b64f529

Add banana environment to docs

b350a7a

Remove old tf models

e30e098

New banana bytes

5518a8d

Fix bug in raycasting

fa5a070

Fix Tennis to train in 100k steps instead of 5m (#250)

5d67a6c

Update description of Tennis environment

1c27302

awjuliani requested a review from vincentpierre January 19, 2018 22:00

awjuliani added 8 commits January 19, 2018 14:05

Scene change

24c54d6

Merge branch 'development-0.3' into dev-gather

d65e172

Merge branch 'development-0.3' into dev-gather

1067f8c

# Conflicts: # python/trainers/ppo_models.py

Fixes to reconcile stacked states with state_size checks in trainers

e87ac6f

Fix bug in buffer

72967ae

Add missing docstrings

07f54d1

Clean up some model code

d0db561

Bug fix

5ada009

vincentpierre approved these changes Jan 22, 2018

View reviewed changes

Merge branch 'development-0.3' into dev-gather

f832863

awjuliani merged commit 36e8197 into development-0.3 Jan 22, 2018

awjuliani deleted the dev-gather branch January 22, 2018 19:43

github-actions bot locked as resolved and limited conversation to collaborators May 20, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

State Stacking & Banan Environment #262

State Stacking & Banan Environment #262

Uh oh!

awjuliani commented Jan 19, 2018 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

State Stacking & Banan Environment #262

State Stacking & Banan Environment #262

Uh oh!

Conversation

awjuliani commented Jan 19, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

awjuliani commented Jan 19, 2018 •

edited

Loading