AI & Web Glossary
Retrieval-augmented generation (RAG)
Retrieval-augmented generation (RAG) is a technique where an AI system looks up relevant passages from your own documents first, then writes its answer from what it found, citing the source.
Retrieval-augmented generation is the standard way to make an AI answer from your information instead of its general training. When someone asks a question, the system first searches your documents for the most relevant passages, hands those passages to the language model, and tells it to answer using only what it was given. The answer can then cite which document it came from.
The reason RAG matters: it's the difference between an AI that vaguely remembers how warranties usually work and one that quotes your warranty policy. It also means updating the AI is as simple as updating the document. No retraining, no waiting.
Practical example: a property management company with hundreds of lease agreements can ask 'which tenants have early-termination clauses?' and get an answer with the specific leases cited. That's RAG doing the unglamorous work: search first, then write. Nearly every document AI and knowledge assistant we build uses it.
Where this shows up in practice
- Our Knowledge & Document AI service puts this to work for small and mid-sized businesses.
- From the blog: The unglamorous work of document AI
- From the blog: Building a custom AI assistant your staff will actually use