Skip to content

Latest commit

 

History

History
52 lines (41 loc) · 2.15 KB

File metadata and controls

52 lines (41 loc) · 2.15 KB

Reinforcement Learning: An Introduction

This repository contains (some of the) programming exercises from Reinforcement Learning: An Introduction (Second Edition) by Richard S. Sutton and Andrew G. Barto. Each subdirectory in this project contains an overview of a topic covered in the book, the results from the exercises, and Python code for the exercises. There are also reproductions of some of the figures from the book and Python code to go along with them as well.

This is a work in progress.

Topics

  1. Chapter 2 - Bandit Problems
  2. Chapter 3 - The Reinforcement Learning Problem
  3. Chapter 4 - Dynamic Programming
  4. Chapter 5 - Monte Carlo Methods
  5. Chapter 6 - Temporal-Difference Learning

Getting Started

This project uses Python 3.6 and venv (Note: This is distinct from virtualenv. There are some issues using matplotlib on OSX with virtualenv). Ensure that you have both of these installed on your system.

Then, in the project directory, create your virtual environment:

python3.6 -m venv venv

This creates a folder called venv in which we can install Python libraries like numpy and matplotlib.

To tell your system to use this environment instead of the system-wide python environment, run:

source venv/bin/activate

You will need to do this anytime you want to run examples.

Next, to install the required libraries into the virtual environment, run:

pip install -r requirements.txt

All set! Run exercises by calling runner.py followed by the path to the exercise. For example:

python runner.py bandit_problems/exercises/ex_2_2_a.py

For some of the exercises, you can pass arguments to specify certain things about their execution (for example, number of trials in the case of the n-armed-bandit problems). You can see what these parameters are by passing -h like so:

python runner.py bandit_problems/exercises/ex_2_2_a.py -h