[BUG]: Data processing

### Description of the bug

In process_annotation_direct_attribution.py line73 show that:
sotopia_pi_utterance_reward.append(
            {
                "instruction": d['prompt'],
                "output": d['result'],
                "value": calc_reward(d['attribution']['attribution'], d['goal_score']),
            }
        )

However, For trainning train_rm.py use data.py line 100 use that:
rendered_text = self.template.render(
            messages=[
                {"role": "user", "content": item["input"]},
                {"role": "assistant", "content": item["output"]}
            ]
i dont know whether there are some issue in data processing?

### Steps To Reproduce

1

### Additional Information

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[BUG]: Data processing #189

Description of the bug

Steps To Reproduce

Additional Information

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[BUG]: Data processing #189

Description

Description of the bug

Steps To Reproduce

Additional Information

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions