Favoris ; Instruction tuning ; Machine learning: techniques ; OpenAI AND RL from Human Feedback
Common descendants