An implementation of Dreamer, a model-based reinforcement learning algorithm which uses model-generated (a.k.a 'imagined') experience to learn a policy in a generalized policy iteration scheme.
conda create -n jax-dreamer python=3.7
conda activate jax-dreamer
pip3 install -r requirements.txt
python3 train.py --configs defaults pendulum --log_dir pendulum