You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In the official examples (both async mode and colocate mode), these parameters are set differently:
--vllm_max_model_len 8192 \
--max_length 4096 \
Questions
Is this inconsistency intentional? If so, what's the rationale behind having different values for these parameters?
What are the potential impacts of having different values for these parameters?
Expected Behavior
I would expect these parameters to be consistent since they both relate to sequence length handling. Looking forward to clarification on these parameters.
The text was updated successfully, but these errors were encountered:
Description
I noticed an inconsistency in the official examples in https://swift.readthedocs.io/en/latest/Instruction/GRPO.html regarding the relationship between
max_length
andvllm_max_model_len
parameters.In the official examples (both async mode and colocate mode), these parameters are set differently:
Questions
Expected Behavior
I would expect these parameters to be consistent since they both relate to sequence length handling. Looking forward to clarification on these parameters.
The text was updated successfully, but these errors were encountered: