Semanlink - [2312.10997] Retrieval-Augmented Generation for Large Language Models: A Survey

Printer friendly

Search Tag:

Search Doc:

Preferences...

[2312.10997] Retrieval-Augmented Generation for Large Language Models: A Survey

Tags:

About This Document

sl:arxiv_author :
- Yi Dai
- Haofen Wang
- Xinyu Gao
- Jinliu Pan
- Yuxi Bi
- Kangxiang Jia
- Yun Xiong
- Yunfan Gao
- Jiawei Sun
sl:arxiv_firstAuthor : Yunfan Gao
sl:arxiv_num : 2312.10997
sl:arxiv_published : 2023-12-18T07:47:33Z
sl:arxiv_summary : Large language models (LLMs) demonstrate powerful capabilities, but they still face challenges in practical applications, such as hallucinations, slow knowledge updates, and lack of transparency in answers. Retrieval-Augmented Generation (RAG) refers to the retrieval of relevant information from external knowledge bases before answering questions with LLMs. RAG has been demonstrated to significantly enhance answer accuracy, reduce model hallucination, particularly for knowledge-intensive tasks. By citing sources, users can verify the accuracy of answers and increase trust in model outputs. It also facilitates knowledge updates and the introduction of domain-specific knowledge. RAG effectively combines the parameterized knowledge of LLMs with non-parameterized external knowledge bases, making it one of the most important methods for implementing large language models. This paper outlines the development paradigms of RAG in the era of LLMs, summarizing three paradigms: Naive RAG, Advanced RAG, and Modular RAG. It then provides a summary and organization of the three main components of RAG: retriever, generator, and augmentation methods, along with key technologies in each component. Furthermore, it discusses how to evaluate the effectiveness of RAG models, introducing two evaluation methods for RAG, emphasizing key metrics and abilities for evaluation, and presenting the latest automatic evaluation framework. Finally, potential future research directions are introduced from three aspects: vertical optimization, horizontal scalability, and the technical stack and ecosystem of RAG.@en
sl:arxiv_title : Retrieval-Augmented Generation for Large Language Models: A Survey@en
sl:arxiv_updated : 2023-12-18T07:47:33Z
sl:bookmarkOf : https://arxiv.org/abs/2312.10997
sl:creationDate : 2023-12-23
sl:creationTime : 2023-12-23T09:09:28Z

File info

Bookmark of: https://arxiv.org/abs/2312.10997

Documents with similar tags (experimental)

[2401.09350] Foundations of Vector Retrieval

Tags:

2024-01-18 About

[2310.03025] Retrieval meets Long Context Large Language Models

Tags:

2023-10-07 About

[2308.00081] Towards Semantically Enriched Embeddings for Knowledge Graph Completion

Tags:

2023-08-02 About

[2306.08302] Unifying Large Language Models and Knowledge Graphs: A Roadmap

Tags:

2023-06-18 About

[2302.05019] A Comprehensive Survey on Automatic Knowledge Graph Construction

Tags:

2023-02-15 About

[2301.07014] Dataset Distillation: A Comprehensive Review

Tags:

2023-01-23 About

[2301.08210] Everything is Connected: Graph Neural Networks

Tags:

2023-01-21 About

[2011.06225] A Review of Uncertainty Quantification in Deep Learning: Techniques, Applications and Challenges

Tags:

2022-09-08 About

[2010.00711] A Survey of the State of Explainable AI for Natural Language Processing

Tags:

2022-09-08 About

[2008.07267] A Survey of Active Learning for Text Classification using Deep Neural Networks

Tags:

2022-09-06 About

[2009.00236] A Survey of Deep Active Learning

Tags:

2022-09-06 About

[2209.00099] Efficient Methods for Natural Language Processing: A Survey

Tags:

2022-09-04 About

[2208.11857] Shortcut Learning of Large Language Models in Natural Language Understanding: A Survey

Tags:

2022-08-27 About

[2207.06300] Re2G: Retrieve, Rerank, Generate

Tags:

2022-07-14 About

[2006.00632] Neural Unsupervised Domain Adaptation in NLP---A Survey

Tags:

2022-03-30 About

[2101.12294] Combining pre-trained language models and structured knowledge

Tags:

2022-03-25 About

[2108.13934] Robust Retrieval Augmented Generation for Zero-shot Slot Filling

Tags:

2022-01-19 About

[2005.11401] Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

Tags:

2022-01-19 About

[2107.12708] QA Dataset Explosion: A Taxonomy of NLP Resources for Question Answering and Reading Comprehension

Tags:

2021-08-06 About

[2107.00676] A Primer on Pretrained Multilingual Language Models

Tags:

2021-07-13 About

[2010.06467] Pretrained Transformers for Text Ranking: BERT and Beyond

Tags:

2021-07-09 About

[2010.12309] A Survey on Recent Approaches for Natural Language Processing in Low-Resource Scenarios

Tags:

2021-07-06 About

[2006.07264] Low-resource Languages: A Review of Past Work and Future Challenges

Tags:

2021-07-06 About

[2011.02260] Graph Neural Networks in Recommender Systems: A Survey

Tags:

2020-11-11 About

[2010.05234] A Practical Guide to Graph Neural Networks

Tags:

2020-10-15 About

[2004.03705] Deep Learning Based Text Classification: A Comprehensive Review

Tags:

2020-10-11 About

[2005.03675] Machine Learning on Graphs: A Model and Comprehensive Taxonomy

Tags:

2020-10-03 About

[1911.02685] A Comprehensive Survey on Transfer Learning

Tags:

2020-09-24 About

[2006.13365] Bringing Light Into the Dark: A Large-scale Evaluation of Knowledge Graph Embedding Models Under a Unified Framework

Tags:

2020-06-26 About

[1910.12507] A Survey on Knowledge Graph Embeddings with Literals: Which model links better Literal-ly?

Tags:

2020-05-04 About

[2004.14843] Knowledge Graph Embeddings and Explainable AI

Tags:

2020-05-04 About

[2004.14545] Explainable Deep Learning: A Field Guide for the Uninitiated

Tags:

2020-05-01 About

[2004.10151] Experience Grounds Language

Tags:

2020-04-22 About

[2003.08271] Pre-trained Models for Natural Language Processing: A Survey

Tags:

2020-03-19 About

[2003.00330] Graph Neural Networks Meet Neural-Symbolic Computing: A Survey and Perspective

Tags:

2020-03-15 About

[2003.02320] Knowledge Graphs

Tags:

2020-03-07 About

[2002.12327] A Primer in BERTology: What we know about how BERT works

Tags:

2020-02-28 About

[1802.07569] Continual Lifelong Learning with Neural Networks: A Review

Tags:

2020-01-01 About

[1808.02590] A Tutorial on Network Embeddings

Tags:

2019-08-25 About

[1901.00596] A Comprehensive Survey on Graph Neural Networks

Tags:

2019-07-15 About

[1812.05944] A Tutorial on Distance Metric Learning: Mathematical Foundations, Algorithms and Experiments

Tags:

2019-06-18 About

[1709.07604] A Comprehensive Survey of Graph Embedding: Problems, Techniques and Applications

Tags:

2019-05-29 About

[1812.09449] A Survey on Deep Learning for Named Entity Recognition

Tags:

2019-04-24 About

[1807.07984] Attention Models in Graphs: A Survey

Tags:

2018-11-14 About

[1601.00670] Variational Inference: A Review for Statisticians

Tags:

2018-08-07 About

[1805.04032] From Word to Sense Embeddings: A Survey on Vector Representations of Meaning

Tags:

2018-05-30 About

[1706.04902] A Survey Of Cross-lingual Word Embedding Models

Tags:

2018-05-20 About