Prompting/In-context learning AND RL from Human Feedback
Common descendants