Thanks for your work! While i was training LISA model on lorel dataset with 50000 episodes, I found out that the action loss did not efficiently decline while commitment loss declines successfully . So that the total loss remaining about 0.93 did not decrease after 30 steps. So It makes evaluation success keep on zero after 2500 iterations. Can you help me?
Thanks for your work! While i was training LISA model on lorel dataset with 50000 episodes, I found out that the action loss did not efficiently decline while commitment loss declines successfully . So that the total loss remaining about 0.93 did not decrease after 30 steps. So It makes evaluation success keep on zero after 2500 iterations. Can you help me?