ChatGPT: training AND RL from Human Feedback
Common descendants
4 Documents
2023-01-03 About