Favoris ; Attention mechanism ; Transformers ; Language Model ; Hugging Face AND Knowledge distillation
Common descendants