Skip to content

Conversation

yfynb1111
Copy link

What this does

Fixes #2051

How it was tested

this code "policy.from_pretrained(env_cfg.pretrained_policy_name_or_path)" is wrong,because from_pretrained is a classmethod method,It will return a model, but the parameters of the instance it is called on will not be updated.So you need to change it to ”policy = policy.from_pretrained(env_cfg.pretrained_policy_name_or_path)“

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

hil-serl performs well during training, but performs poorly when using the eval_policy.py
1 participant