Add Efficient Numpy Replay Buffer #17

Aladoro · 2022-01-15T12:25:06Z

The default replay buffer requires very high RAM and results in frequent crashes due to Pytorch's data-loader memory leak issue. Thus, I have efficiently re-implemented DrQv2's replay buffer entirely in NumPy, taking only about 20gb of RAM for storing all 1000000 transitions. Moreover, with this implementation, there is no need to wait for a trajectory to be completed before adding a new transition to the memory used for sampling.

FPS of this NumPy implementation appears to be identical (perhaps, very slightly higher) on all machines I have tested this on. Potentially, this could also lead to (very minimal) performance gains since the agent can now sample replay transitions from its latest trajectory.

I have kept the original dataloader replay buffer as default. The new replay_buffer can be used by running train.py with the replay_buffer=numpy option.

…lay_buffer implementation

denisyarats · 2022-01-15T14:25:17Z

Hm, this is very similar to the replay buffer that I have in original DrQ (https://github.com/denisyarats/drq/blob/master/replay_buffer.py).
The reason I decided to switch to pytorch dataloaders is to take advantage of pin_memory and offload cpu->gpu data copy into a separate thread. In my experimentation this showed significant training time gains.

Aladoro · 2022-01-15T15:43:52Z

The replay buffer in the original DrQ is actually quite different and should use around x6 times the amount RAM (since each observation is 9 x 84 x 84 and is saved both in the observation and next_observation np arrays, while my implementation saves each observation 3 x 84 x 84 only once).

I have tested my implementation extensively on both my home and lab machines and I have not experienced any slow down whatsoever ^^

dvstter · 2023-05-12T02:26:49Z

I've experienced multiple times crush. The only clue is about the dataloader, but I cannot locate the error. Thanks for your contribution, I'd like to try your implementation!!!

Aladoro added 2 commits January 15, 2022 13:12

add replay_buffer hydra configuration option, add efficient numpy_rep…

b9373bc

…lay_buffer implementation

make dataloader default replay_buffer

5f89b74

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jan 15, 2022

add missing changes to replay_buffer and train

e0e2d85

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Efficient Numpy Replay Buffer #17

Add Efficient Numpy Replay Buffer #17

Uh oh!

Aladoro commented Jan 15, 2022

Uh oh!

denisyarats commented Jan 15, 2022

Uh oh!

Aladoro commented Jan 15, 2022

Uh oh!

dvstter commented May 12, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Add Efficient Numpy Replay Buffer #17

Are you sure you want to change the base?

Add Efficient Numpy Replay Buffer #17

Uh oh!

Conversation

Aladoro commented Jan 15, 2022

Uh oh!

denisyarats commented Jan 15, 2022

Uh oh!

Aladoro commented Jan 15, 2022

Uh oh!

dvstter commented May 12, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants