-
The Chinese University of Hong Kong
- https://jc-chen1.github.io/
- @Jiacheng_c
- in/jiacheng-chen-6746742b6
Highlights
- Pro
Pinned Loading
-
-
PRIME-RL/Entropy-Mechanism-of-RL
PRIME-RL/Entropy-Mechanism-of-RL PublicThe Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.
-
-
MetaEvo/MetaBox
MetaEvo/MetaBox PublicMetaBox: Benchmarking Platform for Meta-Black-Box Optimization
-
volcengine/verl
volcengine/verl Publicverl: Volcano Engine Reinforcement Learning for LLMs
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.



