I'm sorry to bother you, I used the source code which provided for training. At the 50th round of training, the loss value was still large and had no convergence trend, and the accuracy was also very low. Can you tell me why? Alternatively, could you share your checkpoint
Thanks!
