Update ddqn.py #30

tfzee · 2020-01-17T18:34:02Z

It did not converge for me and it was very slow. So i did some changes it also improves performance
it solves cartpole after 30 episodes

Changes to hyperparameters
actual train in batches in the replay method(far better perfomance on big batches)
stop episode when done

It did not converge so i did some changes it also improves performance it solves cartpole after 30 episodes Changes to hyperparameters actual train in batches in the replay method(far better perfomance on big batches) stop episode when done

keon · 2020-01-27T06:19:06Z

ddqn.py

    def replay(self, batch_size):
        minibatch = random.sample(self.memory, batch_size)
+        stateBatch = np.zeros((batch_size,self.state_size))
+        targetBatch = np.zeros((batch_size,self.action_size))


could you use snake case to match with the rest of the code?

Update ddqn.py

85c641e

It did not converge so i did some changes it also improves performance it solves cartpole after 30 episodes Changes to hyperparameters actual train in batches in the replay method(far better perfomance on big batches) stop episode when done

keon reviewed Jan 27, 2020

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update ddqn.py #30

Update ddqn.py #30

Uh oh!

tfzee commented Jan 17, 2020

Uh oh!

keon Jan 27, 2020

Uh oh!

Uh oh!

Update ddqn.py #30

Are you sure you want to change the base?

Update ddqn.py #30

Uh oh!

Conversation

tfzee commented Jan 17, 2020

Uh oh!

keon Jan 27, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!