← Back to blog
2026.2.20|Talk Recording

RAG at Scale — Community Talk Recording

RAGProductionpgvectorEvaluation
RAG at Scale — Community Talk Recording

In this talk from our February 2026 meetup, we explore how to scale RAG pipelines beyond the prototype stage to production workloads handling millions of documents.

The bottleneck is never where you think it is. Profile first, optimise second.

Topics Covered

  • Chunking strategies and their impact on retrieval quality
  • Choosing between pgvector, Qdrant, and Weaviate
  • Hybrid search: combining BM25 and dense vectors
  • Re-ranking with cross-encoders
  • Evaluation: RAGAS metrics in CI/CD

Watch the full recording below. Slides are available on our GitHub.