Reinforcement learning towards broadly and persistently beneficial models

(alignment.openai.com)

1 points | by jawiggins 11 hours ago ago

No comments yet.