Multimodal Models AND Grounded Language Learning
Common descendants