Memory in deep learning ; Transformers ; Language Model AND EMNLP 2022
Common descendants