The overview of RAG

A curated list of resources dedicated to retrieval-augmented generation (RAG).

The retrieval-augmented generation (RAG) is to combine the merits of retrieval system and llm to generation high-quality answers for users.

The Framework for RAG System

Typically, the rag system consists of a set of modules, where each task are described as follows:

Interpreter: This component focuses on refining and enriching the user’s initial query or question to improve the subsequent retrieval process. By generating more detailed or expanded search queries, it helps the retrieval component to more effectively recall relevant documents.
Retriever: This component is responsible for finding and fetching relevant documents or passages from a large corpus based on the refined user query. It acts as the primary information access layer, providing the foundational knowledge for the generation phase.
Compressor: This component processes the retrieved documents and user questions to create an optimized context for LLM. It aims to refine, condense, and organize the retrieved information, ensuring that the most pertinent and concise context is passed on for accurate generation.
Generator: This component leverages a LLM to synthesize a coherent, informative, and contextually relevant answer based on the user’s question and the provided refined contexts. It transforms raw information into a human-readable response.
Validator: This component aims to improve the trustworthiness and quality of the generated answer by validating its accuracy and adherence to factual information within the provided contexts. It ensures the output is reliable and grounded.
Evaluator: This component measures the overall performance and quality of the RAG system, assessing various aspects such as answer accuracy, retrieval effectiveness, and generation faithfulness. It provides metrics to understand and improve the system’s capabilities.

Surveys

The Organization column only record the organization of the first author.

Date	Title	Organization	Code
2024/09/16	Trustworthiness in Retrieval-Augmented Generation Systems: A Survey	Tsinghua University	Code
2024/09/10	Graph Retrieval-Augmented Generation: A Survey	Peking University	Code
2024/07/13	Large Language Models and Future of Information Retrieval: Opportunities and Challenges	ChengXiang Zhai, UIUC	-
2024/02/29	Retrieval-Augmented Generation for AI-Generated Content: A Survey	Peking University	Code
2024/01/03	Retrieval-Augmented Generation for Large Language Models: A Survey	Tongji University	Code
2024/01/03	A Comprehensive Survey of Hallucination Mitigation Techniques in Large Language Models	Islamic University of Technology	No
2023/12/07	Trends in Integration of Knowledge and Large Language Models: A Survey and Taxonomy of Methods, Benchmarks, and Applications	Harbin Institute of Technology	No
2023/09/19	The Rise and Potential of Large Language Model Based Agents: A Survey	Fudan NLP Group	Code
2023/08/14	Large Language Models for Information Retrieval: A Survey	Renmin University	Code
2022/02/02	A Survey on Retrieval-Augmented Text Generation	Nara Institute of Science and Techonology	No

Systems

The Organization column only record the organization of the first author.

Date	Title	Organization	Code
2024/11/07	LightRAG: Simple and Fast Retrieval-Augmented Generation	BUPT	Code
2024/10/25	StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization	ISCAS	Code
2024/08/21	RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation	Nanjing University	Code
2024/07/11	Speculative RAG: Enhancing Retrieval Augmented Generation through Drafting	University of California, San Diego	No
2024/06/19	InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales	University of Virginia	Code
2024/05/22	FlashRAG: A Modular Toolkit for Efficient Retrieval-Augmented Generation Research	Renmin University of China	Code
2024/04/24	From Local to Global: A Graph RAG Approach to Query-Focused Summarization	Microsoft	Code
2023/11/22	FreshLLMs: Refreshing Large Language Models with Search Engine Augmentation	Google	Code
2023/11/08	PDFTriage: Question Answering over Long, Structured Documents	Stanford	Code
2023/10/27	WikiChat: Stopping the Hallucination of Large Language Model Chatbots by Few-Shot Grounding on Wikipedia	Stanford	Code
2023/10/27	LeanDojo: Theorem Proving with Retrieval-Augmented Language Models	Caltech	Code
2023/06/13	WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human Preferences	Tsinghua University	Code
2023/05/23	WebCPM: Interactive Web Search for Chinese Long-form Question Answering	Tsinghua University	Code
2022/06/01	WebGPT: Browser-assisted question-answering with human feedback	Open AI	No

GoMate

Surveys

Systems