GENERATIVE AI IN HEALTHCARE
A robust vision language model for molecular status prediction and radiology report generation in adult-type diffuse gliomas
We aimed to establish a robust vision-language model (“Glio-LLaMA-Vision”) for molecular status prediction and radiology report generation (RRG) in adult-type diffuse gliomas. Multiparametric MRI data and paired radiology reports from 1,001 patients with adult-type diffuse gliomas were included in the institutional training set. A vision-language model, Glio-LLaMA-Vision, was developed from LLaMA 3.1 pre-trained on 2.79 million biomedical image-text pairs from PubMed Central and further fine-tuned from the institutional training set. The performance was validated in 100 patients and 75 patients with paired MRI-radiology reports from an institutional validation set and another tertiary institution (AMC), and in 170 and 477 patients with MRI from TCGA and UCSF datasets, respectively. In terms of IDH mutation status prediction, Glio-LLaMA-Vision showed AUCs ranging from 0.85-0.95 in the internal validation and external datasets. In terms of RRG, the BLEU-1 and ROUGE-L scores were 0.50 and 0.49 in the internal validation, respectfully, and 0.32 and 0.36 on the AMC dataset, respectively. Overall, 37.8% of generated reports were considered superior or equal to the original reports, while overall 91.0% of generated reports were considered clinically acceptable by neuroradiologists. Glio-LLaMA-Vision demonstrates promising performance in molecular status prediction and RRG in adult-type diffuse gliomas, showing potential for clinical assistance.
Authors: Yae Won Park, Myeongkyun Kang, Huiseung Ryu, Kyunghwa Han, Yongsik Sim, Ji Eun Park, Jong Hee Chang, Se Hoon Kim, Seung-Koo Lee, Sang Hyun Park & Sung Soo Ahn
Executive Impact
We aimed to establish a robust vision-language model (“Glio-LLaMA-Vision”) for molecular status prediction and radiology report generation (RRG) in adult-type diffuse gliomas. Multiparametric MRI data and paired radiology reports from 1,001 patients with adult-type diffuse gliomas were included in the institutional training set. A vision-language model, Glio-LLaMA-Vision, was developed from LLaMA 3.1 pre-trained on 2.79 million biomedical image-text pairs from PubMed Central and further fine-tuned from the institutional training set. The performance was validated in 100 patients and 75 patients with paired MRI-radiology reports from an institutional validation set and another tertiary institution (AMC), and in 170 and 477 patients with MRI from TCGA and UCSF datasets, respectively.
Deep Analysis & Enterprise Applications
Select a topic to dive deeper, then explore the specific findings from the research, rebuilt as interactive, enterprise-focused modules.
Molecular Status Prediction
The Glio-LLaMA-Vision model demonstrates robust performance in predicting IDH mutation status across various datasets, critical for glioma diagnosis and treatment planning. The AUCs consistently ranged from 0.85 to 0.95, indicating high accuracy and reliability.
Radiology Report Generation (RRG)
The model's ability to generate radiology reports was evaluated using BLEU-1 and ROUGE-L scores, which indicate moderate lexical quality and content coverage. Qualitative evaluations by neuroradiologists showed high clinical acceptability, making it a valuable tool for radiologists.
Model Architecture
Glio-LLaMA-Vision is a vision-language model developed from LLaMA 3.1, pre-trained on millions of biomedical image-text pairs and fine-tuned on MRI-report datasets. It leverages a 3D representation by averaging features across multiple axial slices for effective MRI analysis.
Enterprise Process Flow
| Metric | Internal Validation | AMC Dataset |
|---|---|---|
| BLEU-1 Score |
|
|
| ROUGE-L Score |
|
|
Case Study: Clinically Acceptable Report Generation
A 65-year-old female patient with oligodendroglioma was accurately identified with IDH-mutant status by Glio-LLaMA-Vision. The generated radiology report described a non-enhancing tumor in the insula, consistent with the original report. All three neuroradiologists rated the generated report as "clinically acceptable" and "very satisfied" or "satisfied" in terms of readability, highlighting the model's practical utility.
Key Takeaway: Despite some quantitative differences, the model's qualitative performance meets clinical standards, indicating its potential for real-world application in aiding radiologists.
Advanced ROI Calculator
Estimate the potential return on investment for implementing AI solutions in your enterprise based on key operational metrics.
Your AI Implementation Roadmap
A phased approach to integrate advanced AI solutions into your enterprise, ensuring a smooth and successful transition.
Phase 1: Discovery & Strategy
In-depth analysis of current workflows, data infrastructure, and identification of key AI opportunities. Development of a tailored AI strategy aligned with business objectives.
Phase 2: Pilot & Proof-of-Concept
Implementation of a small-scale pilot project to validate the AI model's performance on real-world data and gather initial feedback. Iterative refinement based on results.
Phase 3: Integration & Scaling
Seamless integration of the AI solution into existing enterprise systems. Scalable deployment across relevant departments, ensuring robust performance and user adoption.
Phase 4: Monitoring & Optimization
Continuous monitoring of AI system performance, regular updates, and ongoing optimization to ensure sustained efficiency and evolving business needs are met.
Ready to Transform Your Enterprise with AI?
Schedule a personalized consultation with our AI strategists to map out your success.