RAG Evaluation: metrics and testing
Evaluation measures RAG system quality across retrieval accuracy, answer relevance, and end-to-end performance. This page is intentionally scaffolded so the structure is in place before the detailed guidance is added.
On this page
Key concepts
- Retrieval metrics:
- Generation metrics:
- Context relevance:
- Answer faithfulness: