Sanjeev Arora sur Twitter : "A priori, fine-tuning a huge LM on a few datapoints could lead to catastrophic overfitting. So why doesn’t it? Our theory + experiments..."
Tags:
About This Document
File info
Documents with similar tags (experimental)