Reinforcement-Learning

This repository demonstrates the implementation of SARSA, Q-Learning and Expected SARSA to the GYM - FrozenLake8x8-v0, an OpenAI environmental simulator. The work in this repository is inspired by the article by Vaibhav Kumar.

Frozen Lake Environment

The frozen lake environment is an 8x8 grid world and has total 64 states. In each episode the agent starts at S and the episode terminates when it reaches either hole H or goal G. The reward for reaching the goal is 1 while going to any other state results in 0 reward.

All three algorithms are implemented as given in Reinforcement Learning: An Introduction by Richard S. Sutton and Andrew G. Barto.

TD Q-Learning

The figure above illustrates the results obtained with Q - Learning.

The repository is still being updated for a better navigation through the code.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
assets		assets
README.md		README.md
TD__SARSA_Q_ESARSA.py		TD__SARSA_Q_ESARSA.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement-Learning

Frozen Lake Environment

TD Q-Learning

About

Releases

Packages

Languages

NiravRaiyani/Reinforcement_Learning

Folders and files

Latest commit

History

Repository files navigation

Reinforcement-Learning

Frozen Lake Environment

TD Q-Learning

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages