The environment is a square world where the agent is trained to collect bananas - the yellow ones while avoiding the blue ones. A reward of +1 is provided for collecting a yellow banana, and a reward of -1 is provided for collecting a blue banana.
The state space has 37 dimensions and contains the agent's velocity, along with ray-based perception of objects around agent's forward direction. Given this information, the agent has to learn how to best select actions. Four discrete actions are available, corresponding to:
0
: move forward.1
: move backward.2
: turn left.3
: turn right.
The task is episodic, and in order to solve the environment, your agent must get an average score of +13 over 100 consecutive episodes.
- Clone the repository:
git clone https://github.com/rehamelkholy/banana-navigation.git
- Follow the instructions here to install the dependencies. By successfuly following the instructions you will install PyTorch, the ML-Agents toolkit, and a few more Python packages required to complete the project.
(For Windows users) The ML-Agents toolkit supports Windows 10. While it might be possible to run the ML-Agents toolkit using other versions of Windows, it has not been tested on other versions. Furthermore, the ML-Agents toolkit has not been tested on a Windows VM such as Bootcamp or Parallels.
-
Download the environment from one of the links below. You need only select the environment that matches your operating system:
- Linux: click here
- Mac OSX: click here
- Windows (32-bit): click here
- Windows (64-bit): click here
(For Windows users) Check out this link if you need help with determining if your computer is running a 32-bit version or 64-bit version of the Windows operating system.
(For AWS) If you'd like to train the agent on AWS (and have not enabled a virtual screen), then please use this link to obtain the environment.
-
Unzip the downloaded UnityEnvironment to the folder you will be working.
-
Makse sure you have downloaded and installed Anaconda. You can download it from https://www.anaconda.com/distribution/
-
Now you can create your environment. Since this environment refers to a Udacity project for the Deep Reinforcement Learning Nanodegree, lets call our environment
DRLND
.Linux or Mac:
conda create --name drlnd python=3.6 source activate drlnd
Windows:
conda create --name drlnd python=3.6 activate drlnd
-
We will be working with pytorch version 0.4.0 (an early version), so make sure that you install this version of pytorch first by typing:
conda install pytorch=0.4.0 -c pytorch
-
Perform a minimal installation of the OpenAI Gym environment (see instructions here: https://github.com/openai/gym)
-
For the rest of the prerequisities please do type:
pip install .
The above line of code assumes that at the folder you are working, you have the
setup.py
which includes the UnityAgents and therequirements.txt
file that contains other useful packages (that exist in that repository). -
Create a Python execution backend for Jupyter for the drlnd environment
python -m ipykernel install --user --name drlnd --display-name "drlnd"
Now you are not only ready to use the UnityAgents evnironment, but the OpenAI Gym as well. You are all set to start playing with reinforcement learning environments! Yay!
Other useful utilities will also be installed if you follow these directions, including Jupyter Notebook, so consider the above installation guide as a complete guide to setup your RL environments!
Follow the instructions in Navigation.ipynb
to get started with training your own agent!
To get a deeper understanding of the working of the code, there is a Report.pdf
file which aids in understanding under-the-hood working of the agent and the underlying model.