Speech RL recipe update by yuekaizhang · Pull Request #24 · nvidia-china-sae/mair-hub

yuekaizhang · 2025-08-06T06:31:23Z

Update docker image docker pull soar97/verl:app-verl0.4-vllm0.8.5-mcore0.12.2-te2.2
Changed datasets from aishell3 to emilia_zh
Fixed WER computation scripts, support compute MER for code-switch cases
Fixed cosyvoice3 zero_shot_zh input text normalization
Reran experiment using temperature=1.0 and top_p=1, disable algorithm.norm_adv_by_std_in_grpo to avoid difficulty bias
Updated reward function and metrics (disable errors caused by tone.)
Add dapo training recipe (should work with verl-project/verl@75de3de)

Model	Seed-TTS `test_zh` CER	Cosyvoice3 `zero_shot_zh`	Comment
SFT (initialized from Qwen2-0.5B-Instruct)	1.70 %	4.26%	See PR #1887
GRPO (this work, trained on AIShell-3)	1.06 %	3.01%	Commit
GRPO (this work, trained on emilia_zh subset, using top_p=1, temperature=1.0)	0.87%	2.63% (1800 steps)
DAPO (this work, trained on emilia_zh subset, using top_p=1, temperature=1.0)	0.83%	2.71% (700 steps)	See `run_dapo.sh`

sharonyu-115

LGTM. Thank you!

yuekaizhang added 9 commits July 24, 2025 12:01

minor fix

755ae2a

add dockerfile

875ee0d

update decoding results and norm text before decoding

86771b8

update hyper params and dataset

938dfe7

Merge branch 'nvidia-china-sae:main' into speech_rl

ed7e86e

update latest result

17f396a

add dapo recipe

302085b

update results

1b57059

add license

811b768

yuekaizhang assigned sharonyu-115 Aug 11, 2025

yuekaizhang requested a review from sharonyu-115 August 11, 2025 08:43

sharonyu-115 approved these changes Aug 11, 2025

View reviewed changes

sharonyu-115 merged commit 3f4582c into nvidia-china-sae:main Aug 11, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Speech RL recipe update#24

Speech RL recipe update#24
sharonyu-115 merged 9 commits into
nvidia-china-sae:mainfrom
yuekaizhang:speech_rl

yuekaizhang commented Aug 6, 2025 •

edited

Loading

Uh oh!

sharonyu-115 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

yuekaizhang commented Aug 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sharonyu-115 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

yuekaizhang commented Aug 6, 2025 •

edited

Loading