Enterprise AI Analysis: Geoscience & Energy Resources

Machine learning prediction of compressional slowness in fractured carbonates: balancing data volume and incremental log features

Slowness is the reciprocal of velocity and known as a standard parameter recorded in sonic logs. Knowledge of compressional (P-wave) slowness (DTC) is vital for structural, geomechanical, and petrophysical analyses of subsurface formations. This study evaluates five machine learning (ML) models, Linear Regression (LR), Decision Tree (DecTr), Random Forest (RF), K-Nearest Neighbors (KNN), and Support Vector Machine (SVM), to predict DTC in two fractured carbonate reservoirs with differing fracture intensities, referred to as Formation-A (more fractured) and Formation-B (less fractured). A dataset encompassing four wells in each formation was divided into ten incremental sets of petrophysical logs (e.g., GR, RHOB, MSFL, LLS, LLD, RT, PEF, CALD, and CALM). Model accuracy was measured using R2 and RMSE across both training and test phases under 1-, 2-, 3-, and 4-Well(s) scenarios. In the training phase, RF consistently attained the highest R2 values, up to 0.94 (RMSE≈ 0.26) in Formation-A and 0.92 (RMSE≈0.27) in Formation-B, followed closely by KNN and SVM. In contrast, LR and DecTr showed poor performance in both test and training phases for both formations. Formation-B showed a more stable test performance, often yielding R2 of 0.60-0.75 for advanced models. Although data augmentation through more wells and logs typically improved training scores, the best test metrics did not always coincide with the largest feature set (Set #10). Intermediate sets (e.g., Set #5 or #8) sometimes produced stronger generalization, underscoring a trade-off between model complexity and overfitting risk. RHOB, GR and MSFL emerged as consistently pivotal logs, although deeper resistivity and caliper data also enhanced accuracy under certain conditions. Future research should consider hybrid or ensemble ML methods, data augmentation for underrepresented depth intervals, and the incorporation of seismic attributes to refine sonic predictions in fractured carbonates.

By Hamed Aghaei

Schedule Your Strategy Session

Executive Impact: At a Glance

Key metrics highlighting the groundbreaking contributions and enterprise relevance of this research.

0.75R2 Average Test R2 for Advanced Models

+0.30R2 Increase From Linear to Ensemble Models

25,000 Data Points Max Training Set Size

3 Logs Consistently Pivotal Logs Identified

Discuss Your Implementation

Deep Analysis & Enterprise Applications

Select a topic to dive deeper, then explore the specific findings from the research, rebuilt as interactive, enterprise-focused modules.

Machine Learning Models

This study evaluated five ML models: Linear Regression (LR), Decision Tree (DecTr), Random Forest (RF), K-Nearest Neighbors (KNN), and Support Vector Machine (SVM). RF consistently achieved the highest R2 values in the training phase (up to 0.94 in Formation-A and 0.92 in Formation-B), followed by KNN and SVM. LR and DecTr showed poor performance in both test and training phases.

Data Volume and Features

The dataset included four wells per formation, divided into ten incremental sets of petrophysical logs. Increasing data volume and features generally improved training scores. However, the best test metrics did not always align with the largest feature set, highlighting a trade-off between model complexity and overfitting risk. Intermediate sets (e.g., Set #5 or #8) sometimes yielded stronger generalization.

Fracture Intensity Impact

The study compared two fractured carbonate reservoirs: Formation-A (more fractured) and Formation-B (less fractured). Formation-B generally exhibited more stable test performance, often yielding R2 of 0.60-0.75 for advanced models, suggesting that less fractured environments allow for more consistent predictions.

Key Log Features

RHOB (Bulk Density), GR (Gamma Ray), and MSFL (Micro Spherically Focused Log) emerged as consistently pivotal logs across both formations, significantly enhancing model accuracy. Deeper resistivity (LLS, LLD) and caliper data (CALD, CALM) also provided incremental gains under specific conditions.

RF Model Dominance in Training

0.94R2 Highest R2 achieved by Random Forest in Formation-A during training, showcasing its robustness.

Optimized Feature Progression for DTC Prediction

Start with NPHI (Set #1)

→

Add RHOB (Set #2) for significant boost

→

Introduce GR, RT (Set #4) for further refinement

→

Incorporate MSFL (Set #5) for micro-resistivity insights

→

Include LLD, LLS (Set #7) for deeper resistivity

→

Finalize with PEF, CALM, CALD (Set #10) for comprehensive dataset

Performance Comparison: Fractured Carbonates

Comparison Point	Feature	Current Study Findings	Previous Studies/Literature
Formation-A (More Fractured)	Best Test R2	0.17 (RF/SVM, Set #2/3/9) Highest R2 is lower and more variable due to heterogeneity	Typically higher R2 (0.85-0.95) in less fractured zones Limited direct comparisons in highly fractured settings
Formation-B (Less Fractured)	Best Test R2	0.75 (SVM/LR, Set #7/8) More stable and consistently higher R2 than Formation-A	Similar R2 (0.60-0.75) for advanced models in homogeneous carbonates Benefits from multi-well data more consistently
Data Volume	Max Training Data	~25,000 data points (4 wells per formation) Largest dataset for this type of comparative study	Typically 2,000-8,000 data points Often single-well or single-formation analyses
Key Log Influence	Pivotal Logs	RHOB, GR, MSFL consistently pivotal Deeper resistivity (LLS, LLD) and caliper data (CALD, CALM) provide incremental gains	GR, RHOB, NPHI, Vp commonly used Less emphasis on incremental feature contribution and caliper data for DTC

This table compares the predictive performance and key findings from the current study on fractured carbonates against typical outcomes reported in prior literature, highlighting the unique challenges and contributions.

Addressing Overfitting in Fractured Reservoirs

One of the critical challenges observed was the trade-off between model complexity and overfitting risk, especially pronounced in Formation-A (more fractured). While advanced models like RF achieved high training R2, their test performance sometimes declined with the largest feature sets.

Intermediate feature sets (e.g., Set #5 or #8) sometimes produced stronger generalization than the full Set #10.
Simpler models (LR, DecTr) were particularly prone to overfitting, showing negative R2 values in complex scenarios.
Formation-B (less fractured) exhibited more stable test performance, indicating that reduced heterogeneity mitigates overfitting risks.
This underscores the importance of careful feature selection and validation in heterogeneous environments, rather than simply adding more data.

Quantify Your AI Impact

Use our interactive calculator to estimate the potential cost savings and efficiency gains your enterprise could achieve with AI integration, tailored to your industry and scale.

Your Industry

Number of Employees Affected by AI

Average Weekly Hours Saved per Employee

Average Hourly Cost per Employee ($)

Annual Cost Savings $0

Annual Hours Reclaimed 0

Optimize Your Operations

Your AI Implementation Roadmap

A typical journey to integrate AI within an enterprise, from initial assessment to ongoing optimization.

Phase 1: Discovery & Strategy

Comprehensive assessment of current workflows, identification of AI opportunities, and development of a tailored AI strategy with clear objectives.

Phase 2: Pilot Program & MVP

Development and deployment of a Minimum Viable Product (MVP) on a small scale, gathering feedback and demonstrating initial ROI.

Phase 3: Scaled Deployment

Full-scale integration of AI solutions across relevant departments, ensuring seamless adoption and robust performance.

Phase 4: Optimization & Future-Proofing

Continuous monitoring, performance tuning, and exploration of advanced AI capabilities to maintain competitive advantage.

Plan Your AI Journey

Ready to Transform Your Enterprise with AI?

Schedule a personalized consultation with our AI strategists to discuss how these insights can be applied to your unique business challenges and drive tangible results.

Book Your Free Consultation

Enterprise AI Analysis: Geoscience & Energy Resources

Machine learning prediction of compressional slowness in fractured carbonates: balancing data volume and incremental log features

Executive Impact: At a Glance

Deep Analysis & Enterprise Applications

Machine Learning Models

Data Volume and Features

Fracture Intensity Impact

Key Log Features

RF Model Dominance in Training

Optimized Feature Progression for DTC Prediction

Performance Comparison: Fractured Carbonates

Addressing Overfitting in Fractured Reservoirs

Quantify Your AI Impact

Your AI Implementation Roadmap

Phase 1: Discovery & Strategy

Phase 2: Pilot Program & MVP

Phase 3: Scaled Deployment

Phase 4: Optimization & Future-Proofing

Ready to Transform Your Enterprise with AI?

Ready to Get Started?

Book Your Free Consultation.

Let's Discuss Your AI Strategy!

Lets Discuss Your Needs

Select Time Zone

Big Competitive Advantage With Ai

Learn More

Our Demos

Research Center

Contact Us

1 888 985 3025

Solutions@OwnYourAi.com

Get Your Ai