RAG (retrieval-augmented generation): definition

RAG (retrieval-augmented generation)

RAG is a technique where an AI system retrieves relevant documents at query time and feeds them to a language model, so the answer is grounded in real sources rather than memory alone.

Updated May 2026Definition

MentionRadar TeamWe build tools that track where brands get cited across AI search engines.

What does RAG mean?

RAG combines two steps: retrieval (find documents relevant to the question) and generation (use those documents to write an answer). The retrieval step pulls in fresh, specific content; the generation step synthesizes it into prose, often with citations back to the retrieved sources.

How is RAG different from a plain language model?

A plain model answers only from what it learned in training, which can be stale or generic. A RAG system adds a live lookup, so it can quote current, specific pages — and attribute them. For the deeper mechanics, read RAG explained: how retrieval chooses sources.

Example

A user asks an assistant, “What is the latest pricing for tool X?” A RAG pipeline retrieves tool X’s current pricing page, passes it to the model, and the model answers with the up-to-date figure and a link to the page. Without retrieval, the model might guess from outdated training data.

Frequently asked questions

Why does RAG matter for getting cited?

Because RAG retrieves live pages and links them as sources. If your page is retrievable and clearly answers the query, RAG can surface and cite it — even content newer than the model’s training cutoff.

Is every AI answer powered by RAG?

No. Some answers come purely from the model’s training memory. Answers with clickable source links almost always involve retrieval.