Skip to main content
Enterprise AI Analysis: A Novel Multi-Agent Architecture to Reduce Hallucinations of Large Language Models in Multi-Step Structural Modeling

Enterprise AI Analysis

A Novel Multi-Agent Architecture to Reduce Hallucinations of Large Language Models in Multi-Step Structural Modeling

Large language models (LLMs) such as GPT and Gemini have demonstrated remarkable capabilities in contextual understanding and reasoning. The strong performance of LLMs has sparked growing interest in leveraging them to automate tasks traditionally dependent on human expertise. Recently, LLMs have been integrated into intelligent agents capable of operating structural analysis software (e.g., OpenSees) to construct structural models and perform analyses. However, existing LLMs are limited in handling multi-step structural modeling due to frequent hallucinations and error accumulation during long-sequence operations. To this end, this study presents a novel multi-agent architecture to automate the structural modeling and analysis using OpenSeesPy. First, problem analysis and construction planning agents extract key parameters from user descriptions and formulate a stepwise modeling plan. Node and element agents then operate in parallel to assemble the frame geometry, followed by a load assignment agent. The resulting geometric and load information is translated into executable OpenSeesPy scripts by code translation agents. The proposed architecture is evaluated on a benchmark of 20 frame problems over ten repeated trials, achieving 100% accuracy in 18 cases and 90% in the remaining two. The architecture also significantly improves computational efficiency and demonstrates scalability to larger structural systems.

Key Enterprise Impact

The integration of multi-agent LLMs significantly enhances efficiency and accuracy in structural modeling, reducing manual labor and the risk of errors in complex engineering tasks.

0% Average Accuracy Rate
0% Runtime Reduction for Complex Frames
0 Avg. Cost per Analysis

Deep Analysis & Enterprise Applications

Select a topic to dive deeper, then explore the specific findings from the research, rebuilt as interactive, enterprise-focused modules.

Structural analysis is fundamental in civil engineering, but traditional finite element modeling is multi-step, manual, and labor-intensive. Large language models (LLMs) like GPT and Gemini show promise in automating these tasks. However, existing LLM agents struggle with multi-step structural modeling due to frequent hallucinations and error accumulation, especially in complex scenarios. This study addresses these limitations by proposing a novel multi-agent architecture for automated structural modeling and analysis using OpenSeesPy.

The novel multi-agent architecture decomposes the structural modeling workflow into four coordinated modules: analysis and planning, geometry assembly, load integration, and code translation. It features problem analysis and construction planning agents, parallel node and element agents, a load assignment agent, and code translation agents. Checkpoints are embedded for consistency checks and regeneration, mitigating error propagation. The architecture uses two lightweight LLM backbones: GPT-OSS 120B for complex reasoning and Llama-3.3 70B Instruct Turbo for information translation, leveraging their complementary strengths.

The proposed multi-agent architecture was evaluated on a benchmark of 20 frame problems, demonstrating strong reliability and robustness with 99% average accuracy (100% in 18 cases, 90% in two). It significantly outperforms sequential multi-agent architectures (91% accuracy) and general-purpose LLMs (e.g., Gemini 2.5 Pro at 37%, GPT 40 at 0%). The architecture also reduces inference runtime from 269.2-949.0 seconds to 75.4-194.2 seconds, an 85% reduction for complex cases, and shows excellent scalability to larger structural systems (7-bay, 10-bay frames) with high accuracy rates (90%-100%).

Despite its robust performance, the architecture has limitations. Prompt templates, while effective for selected LLMs, show sensitivity across different models, highlighting the need for automated prompt optimization. The current system is restricted to rectangular frames and lacks support for components like diagonal bracing or cantilevers. Additionally, the analysis scope is limited to linear elastic behavior under static loads, omitting dynamic effects (wind, seismic) or nonlinear analysis. Future work will focus on enhancing generalizability to complex typologies, supporting advanced analysis capabilities, and improving prompt robustness.

99% Average Accuracy across 20 Frame Problems

Enterprise Process Flow

Problem Analysis
Construction Planning
Geometry Assembly (Parallel Nodes & Elements)
Load Integration
Code Translation (Geometry & Complete Code)

Architecture Comparison: Proposed vs. Sequential

Feature Proposed Multi-Agent Architecture Sequential Multi-Agent Architecture
Accuracy (Avg) 99% (Robust) 91% (Degrades with complexity)
Efficiency (Runtime) 75.4-194.2s (Significantly faster) 269.2-949.0s
Scalability High (Handles 7 & 10 bays) Limited (API timeouts for large structures)
Error Handling Checkpoints & Regeneration Unidirectional Pipeline (Error accumulation)
85% Reduction in Inference Time for Complex Frames

Robustness to Diverse Linguistic Styles

A pilot test with three architecture students confirmed the architecture's strong adaptation to diverse linguistic styles. Regardless of how the problem was described, the system consistently produced correct structural analysis results across ten repeated trials. This highlights the problem analysis agent's effectiveness in semantic reasoning and extracting key parameters into a standardized JSON format, lowering the entry barrier for non-expert users. This demonstrates the architecture's huge potential for broader adoption beyond engineering professionals.

Backbone LLM Performance Comparison (Avg Accuracy)

LLM Avg. Accuracy
GPT-OSS 120B (Complex Reasoning) N/A (Specialized use)
Llama-3.3 70B Instruct Turbo (Translation) N/A (Specialized use)
GPT-powered (single LLM) 90% (Unstable, 40% for complex cases)
Llama-powered (single LLM) 79% (Fails on specific geometries)
Qwen-powered (single LLM) 69% (Degrades below 50% for 5-bay frames)

Calculate Your Potential ROI

Estimate the efficiency gains and cost savings your enterprise could achieve by integrating advanced AI solutions.

Annual Savings $0
Hours Reclaimed Annually 0

Your AI Implementation Roadmap

A structured approach to integrating AI, from initial assessment to ongoing optimization.

Phase 1: Discovery & Strategy

Comprehensive analysis of existing workflows, identification of AI opportunities, and development of a tailored implementation strategy.

Phase 2: Pilot & Development

Development of a proof-of-concept, iterative refinement based on feedback, and integration with existing systems.

Phase 3: Deployment & Training

Full-scale deployment of the AI solution, comprehensive training for your team, and establishment of monitoring protocols.

Phase 4: Optimization & Scaling

Continuous performance monitoring, iterative improvements, and strategic scaling across additional business units.

Ready to Transform Your Enterprise?

Schedule a personalized strategy session with our AI experts to explore how these advancements can specifically benefit your organization.

Ready to Get Started?

Book Your Free Consultation.

Let's Discuss Your AI Strategy!

Lets Discuss Your Needs


AI Consultation Booking