About This Document
- sl:arxiv_author :
- sl:arxiv_firstAuthor : Omar Khattab
- sl:arxiv_num : 2212.14024
- sl:arxiv_published : 2022-12-28T18:52:44Z
- sl:arxiv_summary : Retrieval-augmented in-context learning has emerged as a powerful approach
for addressing knowledge-intensive tasks using frozen language models (LM) and
retrieval models (RM). Existing work has combined these in simple
\"retrieve-then-read\" pipelines in which the RM retrieves passages that are
inserted into the LM prompt. To begin to fully realize the potential of frozen
LMs and RMs, we propose Demonstrate-Search-Predict (DSP), a framework that
relies on passing natural language texts in sophisticated pipelines between an
LM and an RM. DSP can express high-level programs that bootstrap pipeline-aware
demonstrations, search for relevant passages, and generate grounded
predictions, systematically breaking down problems into small transformations
that the LM and RM can handle more reliably. We have written novel DSP programs
for answering questions in open-domain, multi-hop, and conversational settings,
establishing in early evaluations new state-of-the-art in-context learning
results and delivering 37-120%, 8-39%, and 80-290% relative gains against the
vanilla LM (GPT-3.5), a standard retrieve-then-read pipeline, and a
contemporaneous self-ask pipeline, respectively. We release DSP at
https://github.com/stanfordnlp/dsp@en
- sl:arxiv_title : Demonstrate-Search-Predict: Composing retrieval and language models for knowledge-intensive NLP@en
- sl:arxiv_updated : 2023-01-23T17:00:01Z
- sl:bookmarkOf : https://arxiv.org/abs/2212.14024
- sl:creationDate : 2023-06-23
- sl:creationTime : 2023-06-23T09:54:22Z
Documents with similar tags (experimental)