Memory in deep learning ; Transformers AND EMNLP 2022
Common descendants