verl-project · yuekaizhang · Jul 18, 2025 · Jul 18, 2025 · Jul 25, 2025 · Jul 25, 2025
diff --git a/README.md b/README.md
@@ -228,6 +228,7 @@ verl is inspired by the design of Nemo-Aligner, Deepspeed-chat and OpenRLHF. The
 - [LLaSA-TTS-GRPO](https://github.com/channel-io/ch-tts-llasa-rl-grpo): TTS fine-tuning with GRPO optimization based on LLASA models ![GitHub Repo stars](https://img.shields.io/github/stars/channel-io/ch-tts-llasa-rl-grpo)
 - [PF-PPO](https://arxiv.org/abs/2409.06957): Policy Filtration for PPO based on the reliability of reward signals for more efficient and robust RLHF.
 - [RACRO](https://github.com/gyhdog99/RACRO2): Build multi-modal reasoning models via decoupling it into query-conditioned captioning and text-only reasoning ![GitHub Repo stars](https://img.shields.io/github/stars/gyhdog99/RACRO2)
+- [CosyVoice-TTS-GRPO](https://github.com/nvidia-china-sae/mair-hub/tree/main/rl-tutorial/cosyvoice_llm): Cosyvoice TTS GRPO fine-tuning recipe![GitHub Repo stars](https://img.shields.io/github/stars/nvidia-china-sae/mair-hub)
 
 and many more awesome work listed in [recipe](recipe/README.md).