Skip to main content
Enterprise AI Analysis: Mozi: Governed Autonomy for Drug Discovery LLM Agents

Breakthrough in AI-Driven Drug Discovery

Mozi: Governed Autonomy for Drug Discovery LLM Agents

Abstract: Tool-augmented large language model (LLM) agents promise to unify scientific reasoning with computation, yet their deployment in high-stakes domains like drug discovery is bottlenecked by two critical barriers: unconstrained tool-use governance and poor long-horizon reliability. In dependency-heavy pharmaceutical pipelines, autonomous agents often drift into irreproducible trajectories, where early-stage hallucinations multiplicatively compound into downstream failures. To overcome this, we present Mozi, a dual-layer architecture that bridges the flexibility of generative AI with the deterministic rigor of computational biology. Layer A (Control Plane) establishes a governed supervisor-worker hierarchy that enforces role-based tool isolation, limits execution to constrained action spaces, and drives reflection-based replanning. Layer B (Workflow Plane) operationalizes canonical drug discovery stages—from Target Identification to Lead Optimization—as stateful, composable skill graphs. This layer integrates strict data contracts and strategic human-in-the-loop (HITL) checkpoints to safeguard scientific validity at high-uncertainty decision boundaries. Operating on the design principle of "free-form reasoning for safe tasks, structured execution for long-horizon pipelines," Mozi provides built-in robustness mechanisms and trace-level audibility to completely mitigate error accumulation. We evaluate Mozi on PharmaBench, a curated benchmark for biomedical agents, demonstrating superior orchestration accuracy over existing baselines. Furthermore, through end-to-end therapeutic case studies, we demonstrate Mozi's ability to navigate massive chemical spaces, enforce stringent toxicity filters, and generate highly competitive in silico candidates, effectively transforming the LLM from a fragile conversationalist into a reliable, governed co-scientist.

Authors: He Cao, Siyu Liu, Fan Zhang, Zijing Liu, Hao Li, Bin Feng, Shengyuan Bai, Leqing Chen, Kai Xie, Yu Li (International Digital Economy Academy (IDEA))

Publication Date: March 4, 2026 (arXiv:2603.03655v1)

Executive Impact & Key Benefits

Mozi's novel dual-layer architecture delivers significant advancements in drug discovery, offering unprecedented reliability and efficiency for enterprise R&D.

0 Compounds Screened in 35 Mins
0 Higher HLE Drug Discovery Accuracy
0 Error Accumulation
0 Pristine Leads from Case Studies

Deep Analysis & Enterprise Applications

Select a topic to dive deeper, then explore the specific findings from the research, rebuilt as interactive, enterprise-focused modules.

Mozi's Dual-Layer Architecture Flow

User Intent & Intent Analysis
Control Plane (Layer A)
Workflow Plane (Layer B)
Dynamic Iteration & Replanning
Structured Report & Output
Dual-Layer Architecture for Reliable Drug Discovery

Mozi bridges generative AI flexibility with computational biology rigor through its Control Plane (governance) and Workflow Plane (skill graphs), ensuring strict control and scientific validity in drug discovery.

Feature Mozi's Governed Autonomy Generic LLM Agents
Tool Use
  • Role-based Isolation, Hard-Coded Filters
  • Unconstrained & Prone to Hallucination
Reproducibility
  • Trace-level Audibility & Provenance
  • Low Reproducibility
Error Handling
  • Graceful Containment via Skill Graphs
  • Multiplicative Error Propagation
Decision Boundaries
  • HITL Checkpoints for High-Uncertainty
  • Unmanaged
Critical Validation Via Human-in-the-Loop Safeguards

Mozi integrates HITL gates at key decision boundaries, allowing expert scientists to approve/reject results, correct parameters, and prevent early-stage hallucinations, ensuring clinical tractability.

End-to-End Drug Discovery Workflow

Target Identification (TI)
Hit Identification (HI)
Hit-to-Lead (H2L)
Lead Optimization (LO)
Empirical Validation

Crohn's Disease: Targeted Pathway & Pristine Leads

Mozi identified ITGA4 as a key target and designed 49 novel compounds. It successfully generated 5 pristine lead candidates with a binding score of -9.0 kcal/mol and zero penalty scores, showcasing a near-perfect synthesizability profile. The system gracefully handled docking failures during the process without workflow interruption.

Parkinson's Disease: Multi-Objective Optimization for Safety

For LRRK2, Mozi screened 377,760 compounds in 35 minutes, identifying top hits. It then navigated severe hERG channel blockade and hepatotoxicity liabilities through multi-parameter optimization, discovering a novel scaffold with -8.924 kcal/mol binding affinity and excellent safety profiles, benchmarking against clinical compound DNL-201.

Sepsis: Robust Error Containment & Optimal Leads

Mozi prioritized ADRB2, found 19 binding pockets, and generated 48 de novo molecules. Crucially, the system gracefully handled 7 AutoDock Vina docking failures without crashing the workflow. It yielded 5 optimal lead candidates with excellent physicochemical properties, a strong binding affinity of -8.4 kcal/mol, QED of 0.944, and perfect synthetic accessibility.

Metric Mozi (Deepseek-V3.2/Qwen3-235B) Baseline (Biomni Qwen3-235B)
HLE Exact Match Accuracy
  • 21.42%
  • 17.86%
MCQ Accuracy
  • 6/26
  • 4/26
Classification Accuracy
  • 33/54
  • 20/54
Regression SMAPE
  • 1.169
  • 1.599
21.42% Higher HLE Drug Discovery Accuracy

Mozi (Deepseek-V3.2) achieved the highest exact-match accuracy on the challenging HLE Drug Discovery subset, demonstrating superior scientific reasoning and tool orchestration over general-purpose LLM agents.

Calculate Your Potential AI Impact

Estimate the efficiency gains and cost savings Mozi's governed autonomy could bring to your R&D pipeline.

Annual Cost Savings $0
Annual Hours Reclaimed 0

Your Journey to Governed AI Autonomy

Our structured roadmap ensures a seamless integration of Mozi into your existing drug discovery workflows, maximizing impact while minimizing disruption.

Phase 1: Discovery & Strategy Alignment

Comprehensive assessment of current R&D pipelines, identification of high-impact use cases for Mozi, and definition of success metrics. Collaborative workshop to tailor agent roles and tool access.

Phase 2: Platform Integration & Skill Graph Development

Deployment of Mozi's MCP Platform, integration with existing biomedical databases and computational tools. Development of custom skill graphs aligned with your specific therapeutic programs.

Phase 3: Pilot Program & Iterative Refinement

Execute pilot drug discovery projects with Mozi, gather feedback from domain experts, and refine governance rules and HITL checkpoints. Optimize agent performance and reliability.

Phase 4: Full-Scale Deployment & Continuous Optimization

Roll out Mozi across relevant R&D teams, provide ongoing support and training. Monitor performance, update skill graphs, and incorporate new scientific advancements to ensure sustained value.

Ready to Transform Your Drug Discovery?

Connect with our AI specialists to explore how Mozi can accelerate your R&D, enhance data integrity, and bring new therapeutics to market faster.

Ready to Get Started?

Book Your Free Consultation.

Let's Discuss Your AI Strategy!

Lets Discuss Your Needs


AI Consultation Booking