Skip to content

Vab-jain/rl_llm

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 

Repository files navigation

rl_llm

A collection of tutorials and demos for reinforcement learning (RL) with large language models (LLMs).

Contents

  • Example scripts for training and playing with RL agents and LLMs
  • Tic-Tac-Toe PPO demos
  • Llama model learning scripts

Libraries & Technologies Used

Algorithms

  • Proximal Policy Optimization (PPO)
  • Maskable PPO (for environments with invalid action masking)

Usage

See the tutorials/ directory for example scripts and usage.

About

Playground repo for using Large Language Models for RL

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •  

Languages