Skip to content
This repository was archived by the owner on Apr 25, 2023. It is now read-only.

Conversation

tfzee
Copy link

@tfzee tfzee commented Jan 17, 2020

It did not converge for me and it was very slow. So i did some changes it also improves performance
it solves cartpole after 30 episodes

Changes to hyperparameters
actual train in batches in the replay method(far better perfomance on big batches)
stop episode when done

It did not converge so i did some changes it also improves performance
it solves cartpole after 30 episodes

Changes to hyperparameters
actual train in batches in the replay method(far better perfomance on big batches)
stop episode when done
def replay(self, batch_size):
minibatch = random.sample(self.memory, batch_size)
stateBatch = np.zeros((batch_size,self.state_size))
targetBatch = np.zeros((batch_size,self.action_size))
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

could you use snake case to match with the rest of the code?

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants