Pre-Trained Language Models ; RL from Human Feedback AND USA
Common descendants