Memory in deep learning ; Guillaume Lample AND Language Model
Common descendants