Closed
Description
Description
I noticed an inconsistency in the official examples in https://swift.readthedocs.io/en/latest/Instruction/GRPO.html regarding the relationship between max_length
and vllm_max_model_len
parameters.
In the official examples (both async mode and colocate mode), these parameters are set differently:
--vllm_max_model_len 8192 \
--max_length 4096 \
Questions
- Is this inconsistency intentional? If so, what's the rationale behind having different values for these parameters?
- What are the potential impacts of having different values for these parameters?
Expected Behavior
I would expect these parameters to be consistent since they both relate to sequence length handling. Looking forward to clarification on these parameters.
Metadata
Metadata
Assignees
Labels
No labels