CO/AI Subscribe
Friday · June 19, 2026 · Issue No. 900
Video

Layering every technique in RAG, one query at a time – David Karam, Pi Labs (fmr. Google Search)

Watch on YouTube

Advanced RAG techniques make AI retrievals smarter

In the rapidly evolving landscape of AI retrieval systems, getting machines to understand human queries remains a formidable challenge. A recent talk by David Karam of Pi Labs, who brings expertise from his time at Google Search, dives deep into how Retrieval Augmented Generation (RAG) can be dramatically improved through layered techniques. While most organizations implement basic RAG systems, Karam argues that stacking multiple enhancement strategies delivers exponentially better results for business applications.

RAG has become a cornerstone technology for enterprises looking to ground their AI models in accurate, up-to-date information. But as Karam demonstrates, simple implementations barely scratch the surface of what's possible. By methodically applying a series of techniques that refine how systems interpret queries, retrieve information, and generate responses, organizations can transform mediocre results into remarkably precise answers that truly understand user intent.

Key Points

  • Basic RAG implementations frequently fail because they rely on simplistic keyword matching that misses the nuance and context of human queries
  • Advanced query transformation techniques like expansion, contextual enrichment, and decomposition can dramatically improve relevance by interpreting user intent more accurately
  • Combining multiple RAG techniques in sequence creates compound improvements that far exceed what any single method can achieve
  • Retrieval quality metrics like nDCG and faithfulness are essential for measuring improvement, but the ultimate test remains human evaluation

Why Layering Matters: The Compound Effect

The most compelling insight from Karam's presentation is how dramatically different techniques can work together to overcome limitations inherent in basic implementations. While a single enhancement might incrementally improve results, the real magic happens when multiple techniques compound.

This matters tremendously in the business context because enterprises are increasingly deploying RAG systems as customer-facing solutions. The difference between a system that occasionally misses the mark and one that consistently delivers accurate, contextually appropriate responses can determine whether customers embrace or abandon an AI solution. As competition in AI-powered tools intensifies, the quality gap between basic and advanced implementations will likely become a critical competitive differentiator.

Beyond the Presentation: Real-World Applications

What Karam's talk doesn't fully explore is how these techniques translate across different business domains. In healthcare, for example, query understanding takes on additional complexity because medical terminology

Share: X LinkedIn Email
Video Feed

More videos

All videos →
Claude Fable 5: When Capability Meets Economics
Video

Claude Fable 5: When Capability Meets Economics

Anthropic released Cloud Fable 5 with a paradox built in: safeguards sophisticated enough to let a mythosclass model...

Run Agentic AI Entirely on Your Mac—No Cloud, No Latency, No Privacy Tradeoffs
Video

Run Agentic AI Entirely on Your Mac—No Cloud, No Latency, No Privacy Tradeoffs

Apple’s MLX framework is mature enough now that you can run serious agentic AI workflows locally on Silicon...

Hermes Agent Master Class
Video

Hermes Agent Master Class

Welcome to the Hermes Agent Master Class — an 11-episode series taking you from zero to fully leveraging...

CONSULTING

Outsider
Labs.

A management consulting team focused on AI transformations for executives and business owners.

Work with us →