REINFORCEMENT LEARNING

As a RL enthousiast I've decided to implement many of the algorithms I found in books, courses or papers. To me, it is the best way to truly understand them.

GOAL

In this repo, you will find implementation for many of the most known RL algorithms.

You will find the algorithms lists in the sub directories. I've decided to separate them into 3 classes:

Value Based Method: Algorithms that try to find the optimal policy by estimating the associated value function $V^*(s)$
Policy Based Method: Algorithms that directly try to find the optimal policy $\pi^*(a|s)$
Actor-Critic Method: Algorithms that optimize both the value and the policy functions to find the optimal policy

IMPLEMENTATION

The whole code is inpython3.

Here you'll find major libraries I used:

Environnement
- Gym (https://gym.openai.com/)
Agent
- Numpy (https://numpy.org/)
- PyTorch (https://pytorch.org/)
Visualization
- Matplotlib (https://matplotlib.org/)

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
actor_critic_methods		actor_critic_methods
policy_based_methods		policy_based_methods
ressources		ressources
value_based_methods		value_based_methods
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

REINFORCEMENT LEARNING

GOAL

IMPLEMENTATION

About

Uh oh!

Releases

Packages

Uh oh!

Languages

BenoitLeguay/Reinforcement_Learning_Basics

Folders and files

Latest commit

History

Repository files navigation

REINFORCEMENT LEARNING

GOAL

IMPLEMENTATION

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages