Speed-up to O(1) from O(N) of the computation of each return in REINFORCE #1083

Chris1nexus · 2022-10-17T03:11:01Z

Fix

The function finish_episode() in the reinforce.py script computes returns by insertion at the beginning of a list.
This is an expensive computation in python as shown by the official docs, since it requires O(N) time to shift all its successor elements.
The same can be achieved but with O(1) time complexity with the python deque data structure, natively supported by python.
To verify the above claims, i tested insertion of 100k elements at the beginning of a list and a deque.
The results are:

list insertion at the beginning -> 1 second
deque insertion at the beginning -> 0.004 seconds

Result

Hence, this is a 250 x speed-up, which becomes relevant when the number of episodes becomes large, as this computation is done at the end of every episode.

Python docs Reference: https://docs.python.org/3/tutorial/datastructures.html#using-lists-as-queues

Test

As per the contribution guidelines, the following tests have been run and successfully completed.
./run_python_examples.sh "install_deps,reinforcement_learning,clean"

…t the beginning of the list of returns

netlify · 2022-10-17T03:11:08Z

✅ Deploy Preview for pytorch-examples-preview canceled.

Name	Link
🔨 Latest commit	`f29a6a8`
🔍 Latest deploy log	https://app.netlify.com/sites/pytorch-examples-preview/deploys/634cc7c7727ad00008c25ab5

hudeven

LGTM. The improvement is impressive!

Chris1nexus · 2022-10-18T13:26:12Z

Thanks for the quick review!

…ORCE (pytorch#1083) Replace list with deque to obtain O(1) time complexity of insertion at the beginning of the list of returns

Replace list with deque to obtain O(1) time complexity of insertion a…

f29a6a8

…t the beginning of the list of returns

hudeven approved these changes Oct 17, 2022

View reviewed changes

hudeven added enhancement reinforcement learning labels Oct 17, 2022

hudeven merged commit 74a70e1 into pytorch:main Oct 17, 2022

helpingstar mentioned this pull request Feb 19, 2023

Replace list with deque. #1115

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Speed-up to O(1) from O(N) of the computation of each return in REINFORCE #1083

Speed-up to O(1) from O(N) of the computation of each return in REINFORCE #1083

Uh oh!

Chris1nexus commented Oct 17, 2022

Uh oh!

netlify bot commented Oct 17, 2022 •

edited

Loading

Uh oh!

hudeven left a comment

Uh oh!

Chris1nexus commented Oct 18, 2022

Uh oh!

Uh oh!

Speed-up to O(1) from O(N) of the computation of each return in REINFORCE #1083

Speed-up to O(1) from O(N) of the computation of each return in REINFORCE #1083

Uh oh!

Conversation

Chris1nexus commented Oct 17, 2022

Fix

Result

Test

Uh oh!

netlify bot commented Oct 17, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for pytorch-examples-preview canceled.

Uh oh!

hudeven left a comment

Choose a reason for hiding this comment

Uh oh!

Chris1nexus commented Oct 18, 2022

Uh oh!

Uh oh!

netlify bot commented Oct 17, 2022 •

edited

Loading