Code for the paper Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy.
For set up, make a virtual environment with Python version >=3.11, and
pip install -r requirements.txtTo run the analytical experiments, the entry point
is batch_run_analytical.py.
The RNN experiments have the entry point
batch_run_ppo.py.
Instructions to run hyperparameter sweeps are
in scripts/experiments.md.
We also include the data used to generate our results in
results/analytical_results.csv and
results/rnn_results.csv.zip, which you can unzip for the CSV.