A curated list of resources dedicated to retrieval-augmented generation (RAG).

The retrieval-augmented generation (RAG) is to combine the merits of retrieval system and llm to generation high-quality answers for users.

RAG Framework

The Framework for RAG System

Typically, the rag system consists of a set of modules, where each task are described as follows:

  1. Interpreter: This component focuses on refining and enriching the user’s initial query or question to improve the subsequent retrieval process. By generating more detailed or expanded search queries, it helps the retrieval component to more effectively recall relevant documents.
  2. Retriever: This component is responsible for finding and fetching relevant documents or passages from a large corpus based on the refined user query. It acts as the primary information access layer, providing the foundational knowledge for the generation phase.
  3. Compressor: This component processes the retrieved documents and user questions to create an optimized context for LLM. It aims to refine, condense, and organize the retrieved information, ensuring that the most pertinent and concise context is passed on for accurate generation.
  4. Generator: This component leverages a LLM to synthesize a coherent, informative, and contextually relevant answer based on the user’s question and the provided refined contexts. It transforms raw information into a human-readable response.
  5. Validator: This component aims to improve the trustworthiness and quality of the generated answer by validating its accuracy and adherence to factual information within the provided contexts. It ensures the output is reliable and grounded.
  6. Evaluator: This component measures the overall performance and quality of the RAG system, assessing various aspects such as answer accuracy, retrieval effectiveness, and generation faithfulness. It provides metrics to understand and improve the system’s capabilities.

Surveys

  • The Organization column only record the organization of the first author.
Date Title Organization Code
2024/09/16 Trustworthiness in Retrieval-Augmented Generation Systems: A Survey Tsinghua University Code
2024/09/10 Graph Retrieval-Augmented Generation: A Survey Peking University Code
2024/07/13 Large Language Models and Future of Information Retrieval: Opportunities and Challenges ChengXiang Zhai, UIUC -
2024/02/29 Retrieval-Augmented Generation for AI-Generated Content: A Survey Peking University Code
2024/01/03 Retrieval-Augmented Generation for Large Language Models: A Survey Tongji University Code
2024/01/03 A Comprehensive Survey of Hallucination Mitigation Techniques in Large Language Models Islamic University of Technology No
2023/12/07 Trends in Integration of Knowledge and Large Language Models: A Survey and Taxonomy of Methods, Benchmarks, and Applications Harbin Institute of Technology No
2023/09/19 The Rise and Potential of Large Language Model Based Agents: A Survey Fudan NLP Group Code
2023/08/14 Large Language Models for Information Retrieval: A Survey Renmin University Code
2022/02/02 A Survey on Retrieval-Augmented Text Generation Nara Institute of Science and Techonology No

Systems

  • The Organization column only record the organization of the first author.
Date Title Organization Code
2024/11/07 LightRAG: Simple and Fast Retrieval-Augmented Generation BUPT Code
2024/10/25 StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization ISCAS Code
2024/08/21 RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation Nanjing University Code
2024/07/11 Speculative RAG: Enhancing Retrieval Augmented Generation through Drafting University of California, San Diego No
2024/06/19 InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales University of Virginia Code
2024/05/22 FlashRAG: A Modular Toolkit for Efficient Retrieval-Augmented Generation Research Renmin University of China Code
2024/04/24 From Local to Global: A Graph RAG Approach to Query-Focused Summarization Microsoft Code
2023/11/22 FreshLLMs: Refreshing Large Language Models with Search Engine Augmentation Google Code
2023/11/08 PDFTriage: Question Answering over Long, Structured Documents Stanford Code
2023/10/27 WikiChat: Stopping the Hallucination of Large Language Model Chatbots by Few-Shot Grounding on Wikipedia Stanford Code
2023/10/27 LeanDojo: Theorem Proving with Retrieval-Augmented Language Models Caltech Code
2023/06/13 WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human Preferences Tsinghua University Code
2023/05/23 WebCPM: Interactive Web Search for Chinese Long-form Question Answering Tsinghua University Code
2022/06/01 WebGPT: Browser-assisted question-answering with human feedback Open AI No