Transformers ; Tweet ; Attention mechanism AND LMs: context length
Common descendants