In babyai:
levelscontains the code for all levelsbot.pyis a heuristic stack-based bot that can solve all levelsimitation.pyis an imitation learning implementationrlcontains an implementation of the Proximal Policy Optimization (PPO) RL algorithmmodel.pycontains the neural network code
In scripts:
- use
train_il.pyto train an agent with imitation learning, using demonstrations from the bot, from another agent or even provided by a human - use
train_rl.pyto train an agent with reinforcement learning - use
make_agent_demos.pyto generate demonstrations with the bot or with another agent - use
make_human_demos.pyto make and save human demonstrations - use
train_intelligent_expert.pyto train an agent with an interactive imitation learning algorithm that incrementally grows the training set by adding demonstrations for the missions that the agent currently fails - use
evaluate.pyto evaluate a trained agent - use
enjoy.pyto visualze an agent's behavior - use
gui.pyortest_mission_gen.pyto see example missions from BabyAI levels