Docs Notes 100% Accuracy with RAG RAG vs RAG + agentic planning We take 10 hard questions from the FRAMES benchmark. These questions test the AI system’s ability to handle complex reasoning tasks, and to handle multi-hop retrieval. The knowledge base is Wikipedia articles. We see that a simple RAG pipeline with Claude 3.5 Sonnet