Semanlink - Prithviraj (Raj) Ammanabrolu sur Twitter : "The secret to aligning LMs to human preferences is reinforcement learning. ..."

Impression

Recherche de Mot-clé

Recherche de Doc

Préférences...

Prithviraj (Raj) Ammanabrolu sur Twitter : "The secret to aligning LMs to human preferences is reinforcement learning. ..."

Tags:

Au sujet de ce document

sl:bookmarkOf : https://twitter.com/rajammanabrolu/status/1577690380161585152
sl:creationDate : 2022-10-06
sl:creationTime : 2022-10-06T01:56:53Z

Infos sur le fichier

Bookmark of: https://twitter.com/rajammanabrolu/status/1577690380161585152

Documents with similar tags (experimental)

Peter J. Liu sur Twitter : "RLHF-alternative without RL"

Tags:

2023-05-18 A propos

Aran Komatsuzaki sur Twitter : "Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning"

Tags:

2023-04-27 A propos

David Chalmers sur Twitter : "what are some new and interesting results about the relative capacities of multimodal models and pure language models... (thinking about "do language models need sensory grounding for meaning and understanding?".)"

Tags:

2023-03-15 A propos

elvis sur Twitter : "NEW: Meta AI introduces OPT-IML, a large language model (175B) fine-tuned on 2000 NLP tasks. Uses instruction-tuning to improve zero-shot and few-shot generalization abilities...."

Tags:

2022-12-23 A propos

Ekin Akyürek @ NeurIPS sur Twitter : "How does in-context learning work?..."

Tags:

2022-12-01 A propos

Will Manidis sur Twitter : "Billions of hours of human potential every year are wasted on menial tasks. Data entry, form filling, basic knowledge work kind of stuff..."

Tags:

2022-10-26 A propos

Harrison Chase sur Twitter : "Introducing LangChain: a python package aimed at helping build LLM applications through composability..."

Tags:

2022-10-25 A propos

Yi Tay sur Twitter : "Don't retrieve, recite!..."

Tags:

2022-10-06 A propos

Timo Schick sur Twitter : "PEER, a language model trained to incrementally write texts & collaborate w/ humans ..."

Tags:

2022-08-25 A propos

Andrej Karpathy sur Twitter : "For people wondering why, as a "vision person", I am interested in language models..."

Tags:

2022-07-18 A propos

Papers with Code sur Twitter : "10 Recent Trends in Language Models In this thread..."

Tags:

2022-04-25 A propos

(((ل()(ل() 'yoav))))👾 sur Twitter : "... another step in understanding how transformer-based LMs work..."

Tags:

2022-03-30 A propos

Guillaume Lample sur Twitter : "Last year, we showed that you can outperform a 24-layer transformer in language modeling with just...

Tags:

2020-10-10 A propos