Skip to content

Contains the implementation of SARSA, Q-learning and E-SARSA on gym environment.

Notifications You must be signed in to change notification settings

NiravRaiyani/Reinforcement_Learning

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 

Repository files navigation

Reinforcement-Learning

This repository demonstrates the implementation of SARSA, Q-Learning and Expected SARSA to the GYM - FrozenLake8x8-v0, an OpenAI environmental simulator. The work in this repository is inspired by the article by Vaibhav Kumar.

Frozen Lake Environment

The frozen lake environment is an 8x8 grid world and has total 64 states. In each episode the agent starts at S and the episode terminates when it reaches either hole H or goal G. The reward for reaching the goal is 1 while going to any other state results in 0 reward.

All three algorithms are implemented as given in Reinforcement Learning: An Introduction by Richard S. Sutton and Andrew G. Barto.

TD Q-Learning

The figure above illustrates the results obtained with Q - Learning.

  • The repository is still being updated for a better navigation through the code.

About

Contains the implementation of SARSA, Q-learning and E-SARSA on gym environment.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages