Omer Levy sur Twitter : "What if I told you that fine-tuning T5-Large (0.8B params) on a couple hundred examples could outperform GPT-3 (175B params) on a bunch of tasks?"
Tags:
Au sujet de ce document
Infos sur le fichier
Documents with similar tags (experimental)
2023-02-25 A propos