Pre-Trained Language Models ; Transfer learning in NLP AND Weak supervision
Common descendants