Discretizing Reward Models

(arxiv.org)

1 points | by gmays 2 hours ago

0 comments