Hi @plaguss @lewtun
I would like to use the PRMs from the training example.
However, I can’t find a suitable PRM class for generating completions.
reward_models.py contains only PRM classes for the MathShepherd and RLHFLlow PRM types.
Am I missing something? Is this missing class available somewhere?
Thank you 🙏