Generative AI 101: What Is Retrieval-Augmented Generation?

Company

DataStax

Date Published

Aug. 14, 2024

Author

Alex Leventer

Word count

441

Language

English

Hacker News points

None

URL

www.datastax.com/blog/what-is-rag-retrieval-augmented-generation

Summary

Retrieval-augmented generation (RAG) is a technique that helps enhance large language models (LLMs) by providing context from external sources, such as databases or APIs. This improves the accuracy and relevance of LLM responses in generative AI applications. RAG combines three elements: generation, augmentation, and retrieval. Generation refers to working with an LLM without tailoring or prompt engineering. Augmentation adds detailed instructions for the LLM, while retrieval fetches information from external sources. By combining these elements, developers can create more accurate results from their prompts. RAG is particularly useful when building apps that require access to proprietary or real-time data not available in standard LLM training.