Skip to main content
Enterprise AI Analysis: SoK: Agentic Retrieval-Augmented Generation (RAG): Taxonomy, Architectures, Evaluation, and Research Directions

Systematization of Knowledge

SoK: Agentic Retrieval-Augmented Generation (RAG): Taxonomy, Architectures, Evaluation, and Research Directions

This Systematization of Knowledge (SoK) paper introduces a unified framework for Agentic Retrieval-Augmented Generation (RAG) systems. It formalizes agentic retrieval-generation loops as finite-horizon partially observable Markov decision processes, develops a comprehensive taxonomy and modular architectural decomposition, analyzes limitations of traditional evaluation and identifies severe systemic risks, and outlines key doctoral-scale research directions for building reliable, controllable, and scalable agentic retrieval systems.

Executive Impact: Key Advantages of Agentic RAG

Agentic RAG represents a fundamental paradigm shift, offering significant improvements over traditional RAG pipelines.

0% Reduced Context Overload
0x Improved Reasoning Quality
0% Lower Hallucination Rate

Deep Analysis & Enterprise Applications

Select a topic to dive deeper, then explore the specific findings from the research, rebuilt as interactive, enterprise-focused modules.

Formalization
Taxonomy
Evaluation
Safety

Formalizing Agentic RAG

Agentic RAG is framed as a finite-horizon Partially Observable Markov Decision Process (POMDP), where LLMs autonomously coordinate multi-step reasoning, dynamic memory, and iterative retrieval strategies. This formalization highlights the shift from static pipelines to policy-driven control.

Comprehensive Taxonomy

A multi-dimensional taxonomy organizes Agentic RAG systems by planning mechanisms, retrieval orchestration, memory paradigms, and tool-invocation behaviors. This provides a structured understanding of the design space.

Advanced Evaluation Frameworks

Traditional static evaluation practices are insufficient for autonomous loops. A layered evaluation perspective is proposed, moving from static answer metrics toward trajectory-level assessment of reasoning and retrieval behavior.

Addressing Systemic Risks

Critical limitations of autonomous loops, including compounding hallucination propagation, memory poisoning, and cascading tool-execution vulnerabilities, are analyzed. Key research directions aim to build reliable and controllable agentic systems.

Agentic RAG Architectural Evolution

Traditional RAG (Static)
Iterative/Active RAG (Limited Loop)
Planning-Driven Retrieval (Tool Call)
Agentic RAG / POMDP Framing (Cyclic Control)

Active RAG vs. Agentic RAG Distinctions

Feature Active RAG Agentic RAG
Trigger Log-probability thresholds or token heuristics Policy-driven reasoning and explicit tool-calling
Control Flow Single-pass, forward-generating Iterative, multi-step planning loops
Context Management Append-only (accumulates fetched text) Read/Write/Prune capabilities over working memory

Case Study: Industrial Adoption - SWE-agent Framework

The SWE-agent framework operationalizes agentic RAG for software engineering, providing an Agent-Computer Interface (ACI) to isolate and execute codebase operations safely. Instead of full-file overwrites, it uses targeted diff patching and dynamic exploration, coupling dynamic code retrieval with iterative execution feedback for self-improvement. This demonstrates real-world application of agentic principles in complex embodied environments.

33% Observed Hallucination Rate in Legal RAG Tools (despite retrieval)

Source: Journal of Empirical Legal Studies, 2025 [108]

Closed-Loop Verification (PPAR Cycle)

Reasoning Engine Proposes Output
Verification Module (Pass/Fail)
Structured Failure Feedback
HITL Escalation (if cannot converge)

Quantify Your AI Transformation ROI

Estimate the potential cost savings and efficiency gains for your organization with Agentic RAG.

Estimated Annual Savings $0
Hours Reclaimed Annually 0

Your Agentic RAG Implementation Roadmap

A phased approach to integrate autonomous RAG into your enterprise workflows for optimal results.

Discovery & Planning

Conduct initial feasibility study, define scope, and develop a high-level architectural plan.

Duration: 2-4 Weeks

Proof of Concept (PoC)

Implement core RAG components and test basic retrieval/generation loops with sample data.

Duration: 4-8 Weeks

Iterative Agent Development

Integrate agentic control, dynamic memory, and tool invocation; establish initial evaluation metrics.

Duration: 8-16 Weeks

Advanced Evaluation & Refinement

Conduct trajectory-level assessment, address reliability risks, and optimize cost-efficiency.

Duration: 6-12 Weeks

Deployment & Monitoring

Deploy to production, set up continuous monitoring, and establish human-in-the-loop oversight.

Duration: Ongoing

Ready to Transform Your Knowledge Work?

Unlock the full potential of Agentic RAG with a tailored strategy session. Our experts will guide you through the process, from initial assessment to full-scale deployment.

Ready to Get Started?

Book Your Free Consultation.

Let's Discuss Your AI Strategy!

Lets Discuss Your Needs


AI Consultation Booking