diff --git a/README.md b/README.md index 8402cf2..903388d 100644 --- a/README.md +++ b/README.md @@ -35,6 +35,8 @@ PPO and PPO_CNN agents playing Pong-v0 game:
11. [Introduction to RL Proximal Policy Optimization algorythm (PPO)](https://pylessons.com/PPO-reinforcement-learning/) 12. [Let’s code from scratch a discrete Reinforcement Learning rocket landing agent! (PPO)](https://pylessons.com/LunarLander-v2-PPO/) + +13. [Continuous Proximal Policy Optimization Tutorial with OpenAI gym environment! (PPO)](https://pylessons.com/BipedalWalker-v3-PPO/)

PPO Pong-v0 Learning curve: