Hi,
My Zipformer model is trained on over 200k hours of speech data.
Between epoch 11 and 13, the training curve plateaus with no visible gains.
However, performance suddenly degrades heavily once epoch 14 begins.
This problem has never occurred in my prior training runs with identical configurations.
How should I fix and debug this abrupt performance drop?
thanks.
Hi,
My Zipformer model is trained on over 200k hours of speech data.
Between epoch 11 and 13, the training curve plateaus with no visible gains.
However, performance suddenly degrades heavily once epoch 14 begins.
This problem has never occurred in my prior training runs with identical configurations.
How should I fix and debug this abrupt performance drop?
thanks.