Skip to content

Latest commit

 

History

History
11 lines (10 loc) · 595 Bytes

File metadata and controls

11 lines (10 loc) · 595 Bytes

Pommerman environment used for the paper Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization

@article{barde2020adversarial,
  title={Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization},
  author={Barde, Paul and Roy, Julien and Jeon, Wonseok and Pineau, Joelle and Pal, Christopher and Nowrouzezahrai, Derek},
  journal={arXiv preprint arXiv:2006.13258},
  year={2020}
}

This code is essentially the result of merging https://github.com/MultiAgentLearning/playground and https://github.com/YichenGong/Agent47Agent