Skip to content

Latest commit

 

History

History
7 lines (6 loc) · 216 Bytes

README.md

File metadata and controls

7 lines (6 loc) · 216 Bytes

gridworld-RL

This project compares the SSE and steps to goal curve of three temporal difference learning algorithms on a 3x3 grid world.

The algorithms evaluated are:

  • Q-Learning
  • Double Q-Learning
  • SARSA(λ)