Semanlink - Omer Levy sur Twitter : "What if I told you that fine-tuning T5-Large (0.8B params) on a couple hundred examples could outperform GPT-3 (175B params) on a bunch of tasks?"

Printer friendly

Search Tag:

Search Doc:

Preferences...

Omer Levy sur Twitter : "What if I told you that fine-tuning T5-Large (0.8B params) on a couple hundred examples could outperform GPT-3 (175B params) on a bunch of tasks?"

Tags:

About This Document

sl:bookmarkOf : https://twitter.com/omerlevy_/status/1448000216511229959?s=20
sl:creationDate : 2021-10-13
sl:creationTime : 2021-10-13T12:53:20Z

File info

Bookmark of: https://twitter.com/omerlevy_/status/1448000216511229959?s=20

Documents with similar tags (experimental)

Guillaume Lample sur Twitter : "Today we release LLaMA, 4 foundation models ranging from 7B to 65B parameters..."

Tags:

2023-02-25 About

Ramsri Goutham Golla sur Twitter : "The most practical open-source competitor to @OpenAI 's GPT-3 is Google's Flan-T5 Here are 5 Flan-T5 resources to try out easily, deploy, or fine-tune it! 🧵" / Twitter

Tags:

2023-02-04 About

Andrej Karpathy sur Twitter : "Great post (5mo ago) "chinchilla's wild implications" giving context to LLM goldrush shifting from model size to dataset size..."

Tags:

2023-01-05 About

Allen Institute for AI sur Twitter : "MemPrompt, appearing at #EMNLP2022, is a new way to "fix" #GPT3 after deployment via user interaction"

Tags:

2022-12-11 About

Andrew Trask about large language models: The "bigness" is a temporary flaw, not a permanent feature of progress"

Tags:

2022-03-13 About

Inria Paris NLP (ALMAnaCH team) sur Twitter : "#PAGnol, a new, free, GPT-3-like generative LM for French

Tags:

2021-05-04 About

Emily M. Bender sur Twitter : "On the Dangers of Stochastic Parrots: Can Language Models Be Too Big?"

Tags:

2021-01-23 About