Sanjeev Arora sur Twitter : "A priori, fine-tuning a huge LM on a few datapoints could lead to catastrophic overfitting. So why doesn’t it? Our theory + experiments..."
Tags:
Au sujet de ce document
Infos sur le fichier
Documents with similar tags (experimental)