High-performance scene classification in remote sensing imagery using a custom deep CNN architecture

AI-Powered Remote Sensing for Enhanced Scene Classification

Executive Impact

This research introduces a novel, custom deep Convolutional Neural Network (CNN) architecture, named Pyramidal Net, designed for multi-class image categorization in remote sensing data. It is evaluated against popular pre-trained CNNs using the NWPU-RESISC45 and UC Merced Land Use datasets. The model achieves high accuracy (0.9428 on NWPU-RESISC45, 0.93 on UC Merced) with competitive recall, precision, IoU, and F1-scores, demonstrating robustness across diverse datasets. A key innovation is the integration of Shapley Additive Explanations (SHAP) and Class Activation Mapping (CAM) for enhanced interpretability and explainability, making its predictions more transparent and trustworthy. Pyramidal Net offers a lightweight and efficient design, balancing high performance with lower computational demands, making it suitable for real-time and resource-constrained remote sensing applications. The study also highlights its strong generalization capability across various data scenarios.

0 Accuracy (NWPU-RESISC45)

0 Accuracy (UC Merced)

0 Training Time (NWPU-RESISC45)

0 GPU Memory Usage (NWPU-RESISC45)

Deep Analysis & Enterprise Applications

Select a topic to dive deeper, then explore the specific findings from the research, rebuilt as interactive, enterprise-focused modules.

The research introduces 'Pyramidal Net', a novel deep CNN architecture designed for multi-class image categorization in remote sensing data. It features multiple convolutional layers interleaved with max-pooling, batch normalization, and fully connected layers. The design emphasizes modularity and efficiency, enabling deep hierarchical feature extraction while maintaining computational feasibility.

Enterprise Process Flow

Input (256x256x3)

→

Convolutional Layers + Max Pooling + Batch Norm

→

Flatten Layer

→

Fully Connected Layers

→

Softmax Classification

Feature	Proposed Pyramidal Net	Standard CNNs (e.g., VGG, ResNet)
Computational Efficiency	Lightweight design, faster training times	Often heavier, longer training times
Interpretability	Integrated SHAP and CAM	Often requires external tools
Generalization	Robust across diverse datasets (NWPU, UC Merced)	Can suffer from overfitting on smaller datasets
Input Size Adaptability	Optimized for 256x256x3 input	Fixed input sizes (e.g., 224x224x3 for VGG)
Feature Extraction	Pyramidal structure for progressive spatial reduction and abstraction	Sequential or residual blocks, less explicit pyramidal design

The model was rigorously evaluated on NWPU-RESISC45 and UC Merced datasets, demonstrating superior performance compared to five popular pre-trained CNN models (VGG16, VGG19, MobileNet, ResNet50, Xception). Achieved accuracies of 0.9428 (NWPU) and 0.93 (UC Merced) with strong recall, precision, IoU, and F1-scores confirm its robustness. Efficient training times and GPU memory usage highlight its practical applicability.

0 Accuracy on NWPU-RESISC45 Dataset

Model	Test Accuracy	Precision	Recall	F1-score	IoU	Training Time (s)
Proposed Model	0.9428	0.95	0.94	0.94	0.89	3692
Xception	0.948	0.95	0.95	0.95	0.900	4749.9
VGG16	0.949	0.95	0.95	0.95	0.902	2303.5
ResNet 50	0.865	0.87	0.86	0.87	0.762	3454.5
VGG19	0.858	0.86	0.86	0.86	0.751	2360.1
Mobile Net	0.810	0.82	0.81	0.81	0.680	1995.6

A core novelty is the integration of Shapley Additive Explanations (SHAP) for global feature attribution and Class Activation Mapping (CAM) for local visual explanations. This combined framework enhances model transparency and trustworthiness. A pixel-wise overlap of 85% between SHAP and Grad-CAM outputs confirms the consistency of the interpretability framework.

0 Pixel-wise Overlap between SHAP and Grad-CAM Outputs

SHAP/CAM in Action: Airplane Class

For the Airplane class, SHAP values prioritized runway length and straight edges, aligning with human-interpretable features. Grad-CAM maps spatially highlighted these long, linear regions, confirming the model's reliance on meaningful structural cues for accurate classification.

Impact: This integration ensures predictions are grounded in understandable visual evidence, critical for high-stakes remote sensing applications like urban development and disaster response.

Projected Efficiency Gains for Your Enterprise

Estimate the potential operational efficiency and cost savings by integrating advanced AI for remote sensing image analysis into your business. Adjust parameters to reflect your team's size, typical manual analysis hours, and average hourly rates.

Your Industry

Automated image analysis significantly reduces manual review time across all industries, with specific efficiency gains varying by sector due to different image complexities and data volumes.

Number of Employees Performing Manual Analysis

Average Hours Spent Weekly Per Employee on Analysis

Average Hourly Rate of These Employees ($)

Projected Annual Savings $0

Annual Hours Reclaimed 0

Your AI Implementation Roadmap

A strategic phased approach to integrate high-performance remote sensing AI into your operations for maximum impact and minimal disruption.

Phase 1: Initial Assessment & Data Preparation

Conduct a detailed analysis of current remote sensing data workflows, identify key datasets, and establish data preparation pipelines (resizing, augmentation, normalization). Define classification objectives specific to your enterprise needs.

Phase 2: Model Customization & Training

Customize the Pyramidal Net architecture for your specific dataset characteristics. Train the model using optimized hyperparameters and evaluate initial performance. Integrate SHAP/CAM for early interpretability checks.

Phase 3: Integration & Validation

Deploy the trained model into a testing environment. Validate performance against real-world data, focusing on accuracy, efficiency, and generalization. Fine-tune parameters based on feedback and interpretability insights. Ensure seamless integration with existing systems.

Phase 4: Scaling & Continuous Improvement

Scale the solution for production use across relevant departments. Establish monitoring protocols for model performance and data drift. Implement a feedback loop for continuous improvement, retraining the model with new data as needed to adapt to evolving environmental conditions or business requirements.

Ready to Transform Your Operations?

Connect with our AI specialists to discuss how custom deep learning solutions can revolutionize your remote sensing image analysis.

Schedule Your AI Consultation

High-performance scene classification in remote sensing imagery using a custom deep CNN architecture

AI-Powered Remote Sensing for Enhanced Scene Classification

Executive Impact

Deep Analysis & Enterprise Applications

Enterprise Process Flow

SHAP/CAM in Action: Airplane Class

Projected Efficiency Gains for Your Enterprise

Your AI Implementation Roadmap

Phase 1: Initial Assessment & Data Preparation

Phase 2: Model Customization & Training

Phase 3: Integration & Validation

Phase 4: Scaling & Continuous Improvement

Ready to Transform Your Operations?

Ready to Get Started?

Book Your Free Consultation.

Let's Discuss Your AI Strategy!

Lets Discuss Your Needs

Select Time Zone

Big Competitive Advantage With Ai

Learn More

Our Demos

Research Center

Contact Us

1 888 985 3025

Solutions@OwnYourAi.com

Get Your Ai