Skip to content
@PRIME-RL

PRIME-RL

Researching scalable (RL) methods on language models.

Pinned Loading

  1. P1 P1 Public

    P1: Mastering Physics Olympiads with Reinforcement Learning

    15

  2. SimpleVLA-RL SimpleVLA-RL Public

    SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

    Python 894 40

  3. Entropy-Mechanism-of-RL Entropy-Mechanism-of-RL Public

    The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.

    Python 360 12

  4. RL-Compositionality RL-Compositionality Public

    FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones

    Python 30 3

  5. TTRL TTRL Public

    [NeurIPS 2025] TTRL: Test-Time Reinforcement Learning

    Python 875 64

  6. PRIME PRIME Public

    Scalable RL solution for advanced reasoning of language models

    Python 1.8k 99

Repositories

Showing 7 of 7 repositories