You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I finished my GRPO training, however on the final epoch it gave me this message:
Train: 100%|█████████████████████████████████████| 811/812 [53:11:32<03:56, 236.12s/it]
[WARNING:swift] No training was carried out, which may be due to the dataset being too small or incorrect usage of resume_from_checkpoint.
[INFO:swift] End time of running main: 2025-04-13 06:05:03.922258
And so I did not get any trained checkpoints from this after such a long training time. I have been able to get checkpoints before, but I just changed some hyperparameters (data remains the same 16k size) and I do not use resume_from_checkpoint, so I am confused why this happened and how to fix.
My bash script is this, which I run with the command bash train_GRPO.sh:
I finished my GRPO training, however on the final epoch it gave me this message:
And so I did not get any trained checkpoints from this after such a long training time. I have been able to get checkpoints before, but I just changed some hyperparameters (data remains the same 16k size) and I do not use
resume_from_checkpoint
, so I am confused why this happened and how to fix.My bash script is this, which I run with the command
bash train_GRPO.sh
:My library versions are:
The text was updated successfully, but these errors were encountered: