Skip to main content
Enterprise AI Analysis: Predictive models for the occurrence and lymph node metastasis of papillary thyroid carcinoma with regional risk heterogeneity

Enterprise AI Analysis

Predictive Models for Thyroid Carcinoma: Regional Risk Heterogeneity & Interpretability

Authors: Zhigang Zhang, Hongyu Liu, Zheng Zhao, Guoyu Tan, Yang Zhao & Xun Liu

Papillary thyroid carcinoma (PTC) is the most common thyroid malignancy, and cervical lymph node metastasis significantly impacts patient prognosis. This study aimed to develop interpretable artificial intelligence models based on transcriptomics to predict PTC occurrence and cervical lymph node metastasis, while exploring the heterogeneity of risk factors across different regions. We obtained 419 samples from the GEO database, originating from Asia, Europe, and America, comprising 158 normal samples, 203 PTC samples, and 58 metastatic samples. After comparing multiple machine learning algorithms, deep neural networks (DNN) demonstrated superior performance and were used to construct the PTC diagnostic and metastasis predictive models. The optimized PTC diagnostic model achieved an AUC of 0.987 with an accuracy of 0.945, while the PTC metastasis predictive model reached an AUC of 0.998 with an accuracy of 0.987. Model interpretation using SHapley Additive exPlanations (SHAP) and Kolmogorov-Arnold Networks (KAN) methods identified SYT1, REN, CNTN5, and ADAM12 as critical features for PTC diagnosis, whereas COL9A1, CYP4F3, and GAD1 were key predictors for PTC metastasis. Stratification analysis revealed regional differences in risk factors for PTC occurrence, while factors promoting PTC metastasis exhibited commonalities across different regions. Pathway enrichment analysis indicated that regulation of hormone levels and cell population proliferation were common pathways involved in both PTC occurrence and metastasis. Finally, we developed online predictive platforms based on the Streamlit framework to facilitate transparent model exploration and risk estimation. These tools are publicly accessible research-use prototypes intended for model demonstration and interpretability visualization. Because they require standardized gene-expression inputs and have not undergone prospective clinical validation, they should not be used as standalone tools for clinical diagnosis, risk stratification, or treatment decision-making. Overall, our findings identify candidate transcriptomic markers associated with PTC occurrence and lymph node metastasis and provide a basis for future translational evaluation.

Executive Impact: Key Performance Indicators

Leveraging advanced transcriptomic AI models for enhanced diagnostic precision and metastatic risk prediction in Papillary Thyroid Carcinoma.

0.987 PTC Diagnostic Model AUC
0.998 Metastasis Predictive Model AUC
0.945 PTC Diagnostic Accuracy
0.987 Metastasis Predictive Accuracy

Deep Analysis & Enterprise Applications

Select a topic to dive deeper, then explore the specific findings from the research, rebuilt as interactive, enterprise-focused modules.

99.8% AUC for Metastasis Prediction (DNN Model)

The Deep Neural Network (DNN) model achieved an outstanding AUC of 0.998 for predicting PTC metastasis, demonstrating robust predictive capability.

Model Type PTC Diagnosis PTC Metastasis
DNN (20-feature) 0.987 0.998
DNN (Full-feature) 0.921 0.973
Gradient Boosting 0.974 N/A
Bagging 0.963 0.996
AdaBoost 0.953 0.964
Logistic Regression 0.933 N/A
KNN N/A 0.918
Decision Tree N/A 0.833
KAN (Diagnostic) 0.94 N/A
KAN (Metastasis) N/A 0.90
SYT1, REN Top Markers for PTC Diagnosis

SHAP and KAN analyses consistently identified SYT1 and REN among the most influential transcriptomic features for Papillary Thyroid Carcinoma diagnosis.

COL9A1, CYP4F3 Top Markers for Metastasis Prediction

COL9A1 and CYP4F3 were consistently identified as key predictors for PTC lymph node metastasis across multiple analytical methods.

Region Top Diagnostic Factors (SHAP) Top Metastasis Factors (SHAP)
America SYT1, REN, CNTN5, GABRA6 COL9A1, GAD1, CYP4F3
Asia SYT1, CNTN5, ADAM12, REN COL9A1, CYP4F3, GAD1
Europe SYT1, CNTN5, ADAM12, REN COL9A1, CYP4F3, GAD1
SYT1 Consistently Influential Diagnostic Marker Across Continents

SYT1 was identified as the most influential feature for PTC diagnosis across America, Asia, and Europe, highlighting its broad importance.

Commonality in Metastasis Factors Consistent Metastasis Predictors Across Regions

Factors promoting PTC metastasis (COL9A1, CYP4F3, GAD1) exhibited greater consistency across regions, suggesting more conserved molecular mechanisms.

Enterprise Process Flow

Cell Growth
Cell-Substrate Junction Organization
DNA-Binding Transcription
Protein Serine/Threonine/Tyrosine Kinase Activity
PI3K/Akt & MAPK Signaling
Hormone Regulation & Cell Proliferation Common Pathways in PTC Occurrence & Metastasis

Pathway enrichment analysis revealed that regulation of hormone levels and cell population proliferation are common biological processes involved in both PTC occurrence and metastasis, aligning with thyroid cancer biology.

Accessible AI for Research

Our study successfully developed web-based calculators for PTC diagnosis and metastasis prediction using the Streamlit framework. These platforms provide an accessible interface for model inference and individualized feature attribution, intended as research-use prototypes for model demonstration and interpretability visualization. They are not yet prospectively validated for clinical decision-making and require standardized gene-expression inputs.

Advanced ROI Calculator: Optimize Your Diagnostic Workflow

Quantify the potential impact of AI-driven transcriptomic analysis on early detection and intervention for thyroid carcinoma within your enterprise.

Annual Cost Savings $0
Researcher Hours Reclaimed 0 hours

Our AI Implementation Roadmap

Our structured roadmap ensures a seamless integration of advanced AI predictive models into your diagnostic and research workflows, driving precision and efficiency.

Phase 1: Data Integration & Model Adaptation

Securely integrate existing transcriptomic datasets (e.g., RNA-seq, microarray) and adapt models to your specific institutional data standards and regional patient cohorts.

Phase 2: Validation & Clinical Pilot

Conduct rigorous internal and prospective validation studies with local patient samples, followed by a controlled pilot in a clinical or research setting to assess real-world performance.

Phase 3: Workflow Integration & Training

Integrate the validated predictive platforms into existing laboratory and clinical information systems (LIS/EHR). Provide comprehensive training for pathologists, oncologists, and researchers on model use and interpretation.

Phase 4: Continuous Monitoring & Iteration

Establish a framework for continuous monitoring of model performance, data drift, and clinical outcomes. Implement a feedback loop for model updates and further refinement based on new data and insights.

Ready to Get Started?

Book Your Free Consultation.

Let's Discuss Your AI Strategy!

Lets Discuss Your Needs


AI Consultation Booking