Project DARWIN -- Diving Into Reinforcement Learning

Visit Individual Folders for Demo Videos!

Project Inspired by OpenAI's "Emergent Tool Use from Multi-Agent Autocurricula" Link Here

RL in a Nutshell

Example Training Loop

env = gym.make(
        "LunarLander-v2",
        continuous = True,
        gravity = -10.0,
        render_mode = None
    )

agent = Agent(alpha=0.000025, beta=0.00025, input_dims=[8], tau=0.001, env=env, batch_size=64, layer1_size=400, layer2_size=300, n_actions=4)

np.random.seed(0)
score_history = []

for i in range(1000):
    done = False
    score = 0
    obs, _ = env.reset()
    while not done:
        print(obs.shape)
        act = agent.choose_action(obs)
        new_state, reward, terminated, truncated, info = env.step(act)
        done = terminated or truncated
        agent.remember(obs, act, reward, new_state, int(done))
        agent.learn()
        score += reward
        obs = new_state

    score_history.append(score)
    print("episode", i, "score %.2f" % score, "100 game average %.2f" % np.mean(score_history[-100:]))
    if i % 25 == 0:
        agent.save_models()

Name		Name	Last commit message	Last commit date
Latest commit History 83 Commits
.vscode		.vscode
Half-Cheetah-DDPG-main		Half-Cheetah-DDPG-main
Humanoid		Humanoid
atari		atari
cartpole		cartpole
chess		chess
cliff-walker		cliff-walker
connect4		connect4
continuous-lunar-lander		continuous-lunar-lander
discrete-lunar-lander		discrete-lunar-lander
frozen-lake		frozen-lake
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Project DARWIN -- Diving Into Reinforcement Learning

RL in a Nutshell

Example Training Loop

The Team

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 6

Uh oh!

Languages

ghubnerr/darwin

Folders and files

Latest commit

History

Repository files navigation

Project DARWIN -- Diving Into Reinforcement Learning

RL in a Nutshell

Example Training Loop

The Team

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 6

Uh oh!

Languages

Packages