Enterprise AI Analysis

Explainable Hybrid AI CAD Framework for Advanced Prediction of Steel Surface Defects

This study introduces a novel explainable hybrid AI CAD framework designed to overcome the limitations of traditional single-stage defect detectors in steel surface inspection. By decoupling localization and classification into two distinct stages, the framework optimizes defect detection and classification accuracy. The detection stage utilizes Fusion YOLO, integrating DCBS-YOLO with state-of-the-art YOLO models and enhanced with attention mechanisms and background suppression for precise localization. The classification stage employs a hybrid ensemble of CNNs and Vision Transformers to capture both local and global features, significantly reducing misclassification. The entire pipeline is optimized via MLOps-based auto hyperparameter tuning and provides explainability through Grad-CAM. Achieves 83.8% AP for detection and 99.7% F1-score for classification on NEU-DET, with strong generalization (71.5% mAP, 94.8% F1-score) on GC10-DET, confirming its robustness for reliable industrial inspection.

Schedule Your Strategy Session

Executive Impact at a Glance

Our explainable hybrid AI CAD framework delivers unparalleled precision and reliability in steel surface defect detection, setting new benchmarks for industrial quality control.

0 AP for Detection (NEU-DET)

0 F1-Score for Classification (NEU-DET)

0 mAP for Detection (GC10-DET)

Discuss Implementation for Your Operations

Deep Analysis & Enterprise Applications

Select a topic to dive deeper, then explore the specific findings from the research, rebuilt as interactive, enterprise-focused modules.

Novel Explainable Hybrid AI CAD Framework

This framework innovates by decoupling the complex tasks of defect localization and classification into a two-stage process. The first stage, powered by Fusion YOLO, focuses solely on precise defect region detection, employing a binary detection approach to maximize recall. The subsequent classification stage utilizes a hybrid ensemble of CNNs and Vision Transformers (ViT) to accurately identify defect types. This separation minimizes the trade-off inherent in single-stage detectors, leading to superior overall performance. Critically, Grad-CAM is integrated for explainability, providing visual insights into the model's decision-making process.

Fusion YOLO: Optimized Defect Localization

Our detection stage introduces Fusion YOLO, which combines several top-performing YOLO models (DCBS-YOLO, YOLOv9c, YOLOv8s) to enhance robustness and reduce missed detections. DCBS-YOLO is a custom-designed model integrating DCNv3 for adaptive receptive fields, SimAM for parameter-free attention, and a Background Suppression Module (BSM) to handle low-contrast defects. By performing class-agnostic binary detection and employing Weighted Boxes Fusion (WBF), Fusion YOLO achieves more precise and reliable bounding box predictions than single models, reaching 83.8% AP on NEU-DET.

Robust Defect Classification with CNN+ViT

The classification stage employs a hybrid ensemble model combining CNNs (ResNet101, EfficientNetB5, ResNeXt101_32×8d) and Vision Transformers (ViT). This architecture is designed to leverage the strengths of both: CNNs for capturing local texture features and ViT for modeling global dependencies through self-attention. This fusion strategy is particularly effective in distinguishing between visually similar defect types, which is a common challenge in steel surface inspection. The result is a highly accurate classifier, achieving an outstanding 99.7% F1-score on NEU-DET.

MLOps-Driven Auto Hyperparameter Tuning

To ensure optimal performance and robustness, our entire framework, including both detection and classification models, is fine-tuned using an MLOps-based auto hyperparameter tuning framework (Weights & Biases). This systematic approach explores various combinations of parameters (batch size, epochs, learning rate, optimizers, weight decay) using Bayesian optimization. This automation identifies the best-performing models and configurations (e.g., YOLOv9c for detection, EfficientNetB5, ResNet101, ResNeXt101_32×8d for classification), significantly reducing manual effort and achieving superior and consistent results across different datasets.

Enhanced Defect Visibility through Preprocessing

A dedicated preprocessing module is integrated at the initial stage to enhance defect visibility and contrast while preserving overall image quality. This module transforms RGB images to LAB color space, applies Gaussian and Median filters for denoising, and utilizes CLAHE (Contrast Limited Adaptive Histogram Equalization) and Gamma Correction for contrast and brightness enhancement. A sharpening filter and morphological operations further accentuate defect boundaries. This step is crucial for improving the detectability of subtle and low-contrast defects, leading to consistent performance gains in both the detection and classification stages, as evidenced by improved SSIM, PSNR, and SNR metrics.

XAI: Understanding Model Decisions with Grad-CAM

To foster trust and provide actionable insights, our framework integrates Explainable AI (XAI) techniques, specifically Grad-CAM (Gradient-weighted Class Activation Mapping). Grad-CAM generates heatmaps that highlight the regions in the input image most influential in the model's predictions. This allows users to visually verify the model's focus, understand why certain defects might be missed (e.g., Crazing with low detection rates), and identify areas of improvement. It serves as a critical tool for both validating successful detections and analyzing undetected defect areas, making the CAD system transparent and interpretable.

Enterprise Process Flow

Image Preprocessing

→

Fusion YOLO Detection

→

Defect Region Processing (DRP)

→

Hybrid Ensemble Classification

→

Explainable Saliency Maps

→

Defect Analysis & Output

99.7% Classification F1-Score (NEU-DET) for the CAD Framework, demonstrating near-perfect accuracy in identifying defect types.

Impact of Preprocessing on NEU-DET Performance

Method	Metric	Without Preprocessing	With Preprocessing
CAD Framework (Detection & Classification)	mAP@0.5	80.8%	82.3%
Fusion YOLO (Binary Detection)	AP@0.5	82.1%	83.8%
Ensemble CNN + ViT (Classification)	F1-Score	99.3%	99.7%

Optimizing Detection for Challenging Defects with Fusion YOLO

The paper highlights the inherent difficulty in detecting defects like Crazing and Rolled-in Scale due to their irregular shapes, low contrast, and similarity to background noise. Fusion YOLO addresses this by combining multiple optimized binary detection models, specifically DCBS-YOLO, YOLOv9c, and YOLOv8s, using Weighted Boxes Fusion (WBF). This strategy ensures that even subtle defects missed by a single model are captured. For instance, in the Crazing class, Fusion YOLO successfully combines partial detections from individual models, leading to a more complete and accurate bounding box than any single model could provide. This ensemble approach also effectively handles false positives by down-weighting low-scoring detections, ensuring a robust and reliable defect localization across diverse and challenging scenarios.

Calculate Your Potential ROI

Estimate the efficiency gains and cost savings your enterprise could achieve by integrating our AI solutions for defect detection.

Your Industry

Number of Employees in Quality Control

Avg. Hours Spent on Manual Inspection per Week

Avg. Hourly Cost per Employee ($)

Estimated Annual Savings

Annual Hours Reclaimed

Your AI Implementation Roadmap

A typical deployment journey with OwnYourAI, from initial strategy to full-scale operationalization and continuous improvement.

Phase 1: Discovery & Strategy

In-depth analysis of your current defect detection processes, data infrastructure, and business objectives to tailor a bespoke AI strategy.

Phase 2: Pilot & Customization

Deployment of a pilot AI CAD system on a representative dataset, including custom model training and integration with existing quality control systems.

Phase 3: Full-Scale Integration

Seamless integration of the AI CAD framework into your full production environment, comprehensive team training, and establishing monitoring protocols.

Phase 4: Optimization & Scaling

Continuous performance monitoring, iterative model optimization, and scaling the solution across different production lines or defect types.

Begin Your AI Journey

Ready to Transform Your Quality Control?

Schedule a personalized consultation with our AI experts to explore how an explainable hybrid AI CAD framework can revolutionize your industrial inspection processes.

Book a Free Consultation

Enterprise AI Analysis

Explainable Hybrid AI CAD Framework for Advanced Prediction of Steel Surface Defects

Executive Impact at a Glance

Deep Analysis & Enterprise Applications

Novel Explainable Hybrid AI CAD Framework

Fusion YOLO: Optimized Defect Localization

Robust Defect Classification with CNN+ViT

MLOps-Driven Auto Hyperparameter Tuning

Enhanced Defect Visibility through Preprocessing

XAI: Understanding Model Decisions with Grad-CAM

Enterprise Process Flow

Impact of Preprocessing on NEU-DET Performance

Optimizing Detection for Challenging Defects with Fusion YOLO

Calculate Your Potential ROI

Your AI Implementation Roadmap

Phase 1: Discovery & Strategy

Phase 2: Pilot & Customization

Phase 3: Full-Scale Integration

Phase 4: Optimization & Scaling

Ready to Transform Your Quality Control?

Ready to Get Started?

Book Your Free Consultation.

Let's Discuss Your AI Strategy!

Lets Discuss Your Needs

Select Time Zone

Big Competitive Advantage With Ai

Learn More

Our Demos

Research Center

Contact Us

1 888 985 3025

Solutions@OwnYourAi.com

Get Your Ai