Jacob Devlin talks about BERT at the Stanford NLP seminar
Tags:
Includes new results such as the effect of the masking strategy, using synthetic training data,...
About This Document
File info