Lunar Lander - Deep Q-Learning

The goal of this project is to train an agent to safely land a lunar lander on a landing pad using reinforcement learning. The agent learns to control the lander's engines to adjust its trajectory, balance fuel efficiency, and avoid crashes. The environment is considered solved when the agent achieves an average score of 200 points over the last 100 episodes.

This project implements a Deep Q-Learning agent to solve the Lunar Lander environment from OpenAI Gym.

Features

Deep Q-Network (DQN): A reinforcement learning algorithm that uses a neural network to approximate the Q-function ( Q(s, a) ).
Experience Replay: Improves learning by sampling past experiences.
Target Network: A separate network used to stabilize training by providing consistent targets for Q-value updates.
ε-Greedy Policy: Balances exploration and exploitation during training.

Mathematical Equations

Dependencies

Python Packages:
- gym: Lunar Lander environment.
- numpy: Numerical operations.
- tensorflow: Neural network framework.
- imageio: Video generation.
- pyvirtualdisplay: Headless rendering.
System Dependencies:
- xvfb, python-opengl, ffmpeg: For display and video.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
images		images
videos		videos
Lunar Lander - Deep Q-Learning.ipynb		Lunar Lander - Deep Q-Learning.ipynb
README.md		README.md
lunar_lander_model.h5		lunar_lander_model.h5
maths.jpg		maths.jpg
maths.png		maths.png
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Lunar Lander - Deep Q-Learning

Features

Mathematical Equations

Dependencies

About

Uh oh!

Releases

Packages

Languages

ptl-harsh/Lunar_Lander-DQL

Folders and files

Latest commit

History

Repository files navigation

Lunar Lander - Deep Q-Learning

Features

Mathematical Equations

Dependencies

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages