Enterprise AI Analysis
A Novel Multi-Agent Architecture to Reduce Hallucinations of Large Language Models in Multi-Step Structural Modeling
Large language models (LLMs) such as GPT and Gemini have demonstrated remarkable capabilities in contextual understanding and reasoning. The strong performance of LLMs has sparked growing interest in leveraging them to automate tasks traditionally dependent on human expertise. Recently, LLMs have been integrated into intelligent agents capable of operating structural analysis software (e.g., OpenSees) to construct structural models and perform analyses. However, existing LLMs are limited in handling multi-step structural modeling due to frequent hallucinations and error accumulation during long-sequence operations. To this end, this study presents a novel multi-agent architecture to automate the structural modeling and analysis using OpenSeesPy. First, problem analysis and construction planning agents extract key parameters from user descriptions and formulate a stepwise modeling plan. Node and element agents then operate in parallel to assemble the frame geometry, followed by a load assignment agent. The resulting geometric and load information is translated into executable OpenSeesPy scripts by code translation agents. The proposed architecture is evaluated on a benchmark of 20 frame problems over ten repeated trials, achieving 100% accuracy in 18 cases and 90% in the remaining two. The architecture also significantly improves computational efficiency and demonstrates scalability to larger structural systems.
Key Enterprise Impact
The integration of multi-agent LLMs significantly enhances efficiency and accuracy in structural modeling, reducing manual labor and the risk of errors in complex engineering tasks.
Deep Analysis & Enterprise Applications
Select a topic to dive deeper, then explore the specific findings from the research, rebuilt as interactive, enterprise-focused modules.
Structural analysis is fundamental in civil engineering, but traditional finite element modeling is multi-step, manual, and labor-intensive. Large language models (LLMs) like GPT and Gemini show promise in automating these tasks. However, existing LLM agents struggle with multi-step structural modeling due to frequent hallucinations and error accumulation, especially in complex scenarios. This study addresses these limitations by proposing a novel multi-agent architecture for automated structural modeling and analysis using OpenSeesPy.
The novel multi-agent architecture decomposes the structural modeling workflow into four coordinated modules: analysis and planning, geometry assembly, load integration, and code translation. It features problem analysis and construction planning agents, parallel node and element agents, a load assignment agent, and code translation agents. Checkpoints are embedded for consistency checks and regeneration, mitigating error propagation. The architecture uses two lightweight LLM backbones: GPT-OSS 120B for complex reasoning and Llama-3.3 70B Instruct Turbo for information translation, leveraging their complementary strengths.
The proposed multi-agent architecture was evaluated on a benchmark of 20 frame problems, demonstrating strong reliability and robustness with 99% average accuracy (100% in 18 cases, 90% in two). It significantly outperforms sequential multi-agent architectures (91% accuracy) and general-purpose LLMs (e.g., Gemini 2.5 Pro at 37%, GPT 40 at 0%). The architecture also reduces inference runtime from 269.2-949.0 seconds to 75.4-194.2 seconds, an 85% reduction for complex cases, and shows excellent scalability to larger structural systems (7-bay, 10-bay frames) with high accuracy rates (90%-100%).
Despite its robust performance, the architecture has limitations. Prompt templates, while effective for selected LLMs, show sensitivity across different models, highlighting the need for automated prompt optimization. The current system is restricted to rectangular frames and lacks support for components like diagonal bracing or cantilevers. Additionally, the analysis scope is limited to linear elastic behavior under static loads, omitting dynamic effects (wind, seismic) or nonlinear analysis. Future work will focus on enhancing generalizability to complex typologies, supporting advanced analysis capabilities, and improving prompt robustness.
Enterprise Process Flow
| Feature | Proposed Multi-Agent Architecture | Sequential Multi-Agent Architecture |
|---|---|---|
| Accuracy (Avg) | 99% (Robust) | 91% (Degrades with complexity) |
| Efficiency (Runtime) | 75.4-194.2s (Significantly faster) | 269.2-949.0s |
| Scalability | High (Handles 7 & 10 bays) | Limited (API timeouts for large structures) |
| Error Handling | Checkpoints & Regeneration | Unidirectional Pipeline (Error accumulation) |
Robustness to Diverse Linguistic Styles
A pilot test with three architecture students confirmed the architecture's strong adaptation to diverse linguistic styles. Regardless of how the problem was described, the system consistently produced correct structural analysis results across ten repeated trials. This highlights the problem analysis agent's effectiveness in semantic reasoning and extracting key parameters into a standardized JSON format, lowering the entry barrier for non-expert users. This demonstrates the architecture's huge potential for broader adoption beyond engineering professionals.
| LLM | Avg. Accuracy |
|---|---|
| GPT-OSS 120B (Complex Reasoning) | N/A (Specialized use) |
| Llama-3.3 70B Instruct Turbo (Translation) | N/A (Specialized use) |
| GPT-powered (single LLM) | 90% (Unstable, 40% for complex cases) |
| Llama-powered (single LLM) | 79% (Fails on specific geometries) |
| Qwen-powered (single LLM) | 69% (Degrades below 50% for 5-bay frames) |
Calculate Your Potential ROI
Estimate the efficiency gains and cost savings your enterprise could achieve by integrating advanced AI solutions.
Your AI Implementation Roadmap
A structured approach to integrating AI, from initial assessment to ongoing optimization.
Phase 1: Discovery & Strategy
Comprehensive analysis of existing workflows, identification of AI opportunities, and development of a tailored implementation strategy.
Phase 2: Pilot & Development
Development of a proof-of-concept, iterative refinement based on feedback, and integration with existing systems.
Phase 3: Deployment & Training
Full-scale deployment of the AI solution, comprehensive training for your team, and establishment of monitoring protocols.
Phase 4: Optimization & Scaling
Continuous performance monitoring, iterative improvements, and strategic scaling across additional business units.
Ready to Transform Your Enterprise?
Schedule a personalized strategy session with our AI experts to explore how these advancements can specifically benefit your organization.