RAG: Retrieval-Augmented Generation as an engineering system
We put everything together: ingestion, chunking, embedding, indexing, retrieval, reranking, prompting, and evaluation. The focus is not on “one trick” but on building a reliable system: latency, quality, observability, and iteration loops.