Multimodal Models ; Natural Language Supervision AND Zero shot
Common descendants