Tweet ; Transformers AND Memory in deep learning
Common descendants