-
Notifications
You must be signed in to change notification settings - Fork 169
Open
Description
I'm interested as to why you decided not to create a local copy of the variables in the worker threads and sync them with the global network at the end of the rollout. Does that create issues with the global network (being used for inference in the rollout) being updated in the middle of rollout? Is there a reason why you changed your algorithm from the one described in the Async methods for RL paper?
hb128
Metadata
Metadata
Assignees
Labels
No labels