Skip to main content
Enterprise AI Analysis: Database and deep-learning scalability of anharmonic phonon properties by automated brute-force first-principles calculations

Enterprise AI Analysis

Database and deep-learning scalability of anharmonic phonon properties by automated brute-force first-principles calculations

This study introduces Phonix, a comprehensive first-principles database of anharmonic phonon properties for over 6500 inorganic compounds. It details an automated workflow for high-throughput calculations and demonstrates a deep learning scaling law, where prediction accuracy for thermal conductivity improves with increasing data size. The database facilitates the identification of materials with extreme thermal conductivities, accelerating the design of advanced functional materials.

Key Impact Metrics

0 Materials Processed
0 Avg Klat (300K) Wm⁻¹K⁻¹
0 MAE Scaling Factor

Deep Analysis & Enterprise Applications

Select a topic to dive deeper, then explore the specific findings from the research, rebuilt as interactive, enterprise-focused modules.

Anharmonic Phonons
Deep Learning Scalability
High-Throughput Screening

This section details the critical insights into how anharmonic phonon properties are calculated and their importance in understanding thermal transport.

Automated Anharmonic Phonon Calculation Workflow

Crystal Structure Identification
Symmetry Analysis
Strict Structure Optimization (EOS)
Born Effective Charge Calculation
Harmonic FC2 Calculation
Harmonic Phonon Properties Analysis
Cubic FC3 Calculation (LASSO/FD)
Anharmonic Phonon Properties Analysis
~0.17 MAE Scaling Factor for Kp

The prediction accuracy for Peierls thermal conductivity (Kp) shows a scaling factor of ~0.17, indicating significant improvement with larger training datasets.

Challenges in High-Throughput Calculations

The study highlights significant computational challenges in performing high-throughput anharmonic phonon calculations, including rigorous structural optimization, handling of imaginary frequencies, and the computational cost associated with larger supercells for force constant calculations. The 'auto-kappa' workflow was developed to overcome these hurdles, enabling the construction of the Phonix database. Accurately estimating high-κ values demands rigorous treatment of anharmonic phonon interactions and highly converged computational parameters, such as dense q-point meshes, since even small errors in force constants can significantly impact the results.

Explore how the prediction accuracy of deep learning models scales with the size of the Phonix database.

6500+ Materials in Phonix Database

The Phonix database encompasses over 6500 inorganic compounds, providing a rich dataset for training and validating deep learning models.

Deep Learning Scaling Laws

A clear scaling law was observed for predicting anharmonic phonon properties. Mean absolute error (MAE) decreased with increasing training data size, demonstrating that prediction accuracy improves with more data. For instance, the MAE for log10κp is expected to decrease to 0.15 as the training dataset size approaches 2.5 × 10⁵. This validates the strategy of building large-scale first-principles databases to enhance machine learning model performance for complex material properties.

Feature Phonix (This Study) Other Major Databases (e.g., MP, OQMD, AFLOW) Microsoft Database (2025)
  • Property Type
  • Anharmonic phonon properties (Kp, Kcumul, spectra)
  • Mainly harmonic, band structures, band gaps
  • Anharmonic phonon properties (Kt)
  • Number of Materials
  • ~6500+
  • ~170,000 to ~3.5 Million (excluding duplicates)
  • ~246,000
  • Calculation Method
  • Automated brute-force first-principles
  • DFT-derived, high-throughput
  • Machine Learning Potentials (MLP) trained on FP
  • Material Complexity
  • Wide range, up to 160 atoms/cell
  • Diverse, but mainly simple for deep properties
  • Binary/ternary, up to 7 atoms/cell for MLPs
  • Data Accessibility
  • Publicly available (ARIM-mdx)
  • Publicly available (APIs)
  • Limited public access to specific materials

Discover how the Phonix database and deep learning enable efficient screening for materials with extreme thermal conductivities.

Identification of Extreme Thermal Conductivities

High-throughput screening identified materials exhibiting both extremely high and low thermal conductivities. For high-κ, materials like triclinic Hg(BiS2)2 (Kp = 292 Wm⁻¹K⁻¹) and hexagonal NpPH (Kp = 172 Wm⁻¹K⁻¹) were found, some with high anisotropy. For low-κ, several compounds with Kp < 0.1 Wm⁻¹K⁻¹ were identified. These discoveries highlight the database's potential to accelerate the search for novel functional materials.

Case Study: High Thermal Conductivity Material - NpPH

Hexagonal NpPH (space group 194) exhibited very high Peierls thermal conductivity (Kp,zz = 172 Wm⁻¹K⁻¹ for 3-phonon, reduced to 53 Wm⁻¹K⁻¹ with 4-phonon scattering). Its unique structure, characterized by heavy Np atoms surrounded by light H and P atoms, leads to a clear separation of phonon modes by frequency range. This suppresses anharmonic interactions, resulting in long phonon lifetimes for acoustic modes primarily composed of heavy atoms, which dominate heat transport. This provides concrete insight for synthesizing similar high-κ materials with more experimentally amenable transition metals.

Case Study: Low Thermal Conductivity Material - Cs6Rb2SnPbI12

Trigonal Cs6Rb2SnPbI12 (space group 148) exhibited extremely low thermal conductivity (Kp,xx/yy = 0.032 Wm⁻¹K⁻¹). This material's complex crystal structure leads to phonon modes (formed by a mixture of atomic species) distributed across a wide frequency range, in contrast to the localized mode behavior seen in high-κ materials. The presence of heavy atoms and complex framework contribute to strong phonon scattering, leading to a low thermal conductivity, offering insights for the design of advanced thermoelectric materials.

Calculate Your Potential ROI

See how leveraging advanced material informatics could translate into tangible benefits for your organization.

Annual Savings $0
Hours Reclaimed Annually 0

Your AI Implementation Roadmap

A structured approach to integrating material informatics into your R&D pipeline.

Phase 1: Data Acquisition & Pre-processing (Weeks 1-4)

Establish automated pipelines for extracting and cleaning data from the Phonix database. This involves API integration, data validation, and initial feature engineering to prepare the dataset for model training.

Phase 2: Model Training & Validation (Weeks 5-8)

Train and validate Graph Neural Network (GNN) and Euclidean Neural Network (e3nn) models using the pre-processed Phonix data. Focus on hyperparameter tuning and cross-validation to optimize prediction accuracy for anharmonic phonon properties.

Phase 3: High-Throughput Screening & Discovery (Weeks 9-12)

Deploy trained models for high-throughput screening of large material databases (e.g., GNOME) to identify candidates with extreme thermal conductivities. Prioritize materials based on predicted properties and experimental feasibility for further investigation.

Phase 4: Experimental Validation & Integration (Ongoing)

Collaborate with experimental teams to validate predictions and integrate successful discoveries into material design and development pipelines. Continuously refine models with new experimental data and feedback.

Ready to Transform Your Materials Discovery?

Leverage the power of AI and comprehensive first-principles data to accelerate your R&D. Book a free consultation to discuss your specific needs.

Ready to Get Started?

Book Your Free Consultation.

Let's Discuss Your AI Strategy!

Lets Discuss Your Needs


AI Consultation Booking