Transformers ; Attention mechanism ; Entities ; Favoris ; Named Entity Recognition ; Unsupervised deep pre-training AND Unsupervised deep pre-training
Common descendants