Multimodal Models AND Natural Language Supervision
Common descendants