NLP@Stanford ; NLP: pretraining AND RL from Human Feedback
Common descendants