Definition

Pasted image 20250801150937.png

AI architecture that enhances large language models. The goal is to provide more accurate, up-to-date and contextually relevant responses while reducing hallucinations.

Stages:

  1. Retrieval Stage:
    User submits a query, RAG searches a knowledge base for relevant information.
  2. Augmentation Stage:
    The retrieved information is added to the users original query as context
  3. Generation Stage: The LLM generated a response grounded in the retrieved context, not just its pre-trained knowledge.

Retrieval:

Augmentation: