Replies: 1 comment 5 replies
-
yes
probably, but I haven't tried
yes
|
Beta Was this translation helpful? Give feedback.
5 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
The docs say that "fine-tuning gpt-oss-20b is now supported". Does this mean that RL is also supported, both GRPO and PPO? Also, is it supported with vLLM for gpt-oss-20b?
Does anyone have a script for this that already works? I've been struggling to set this up all day.
Beta Was this translation helpful? Give feedback.
All reactions