Research Insights

Enterprise AI Analysis: Safety and efficacy of privacy-preserving models to create Lay summaries of brain MRI reports

Patient access to radiology reports has highlighted the need for patient-friendly communication. Automated generation of patient-centered summaries using large language models (LLMs) is a promising solution. This study evaluated the safety and effectiveness of on-premise, privacy-preserving LLMs for generating lay summaries of real French brain MRI reports for emergency presentations of headache. The study sampled 105 brain MRI reports for radiologist evaluation and a subset of 30 for non-physician evaluation. Three open-weights models (Llama 3.3 70B, Athene V2, Mistral Small) generated French lay summaries. Radiologists' mean ratings across models were high for exactness (4.10), exhaustiveness (4.34), didacticness (3.83), and readiness for clinical use (3.84). Non-physicians reported higher perceived understanding with summaries (from 2.85 to 4.27, p < 0.001) and improved correct identification rate for reports (from 75.2% to 83.6%, p < 0.001). The ability to identify causal findings also improved (from 80.6% to 84.8%, p < 0.001). The overall error rate in LLM-generated lay summaries was 19.7% (62/315), warranting expert oversight. The study concludes that privacy-preserving LLMs can improve perceived and objective understanding of brain MRI reports for highly educated non-physicians, but medical errors and linguistic issues necessitate a 'human-in-the-loop' framework for clinical safety.

Schedule Your Strategy Session

Executive Impact

Quantifiable advantages of integrating advanced AI into your operations, directly linked to this research.

0% Increased Perceived Understanding

0% Improved Report Identification

0% Error Rate

Discuss Your Implementation

Deep Analysis & Enterprise Applications

Select a topic to dive deeper, then explore the specific findings from the research, rebuilt as interactive, enterprise-focused modules.

Radiologist Evaluation

Radiologists rated LLM-generated summaries on exactness, exhaustiveness, didacticness, and readiness for clinical use. Mean ratings were high across models: exactness (4.10), exhaustiveness (4.34), didacticness (3.83), and clinical readiness (3.84). This indicates a strong potential for these tools to assist in clinical workflows, albeit with the noted error rates requiring oversight.

4.34 Mean Exhaustiveness Rating

LLM Performance Across Metrics

Metric	Llama 3.3 70B	Athene V2	Mistral Small
Exactness	High (4.13)	High (4.44)	High (4.24)
Exhaustiveness	Excellent (4.38)	Excellent (4.43)	Good (4.20)
Didacticness	Good (3.90)	Good (3.95)	Moderate (3.66)
Clinical Use Readiness	Good (3.84)	Good (3.89)	Moderate (3.78)

Non-Physician Understanding

Non-physicians showed significantly improved perceived understanding (from 2.85 to 4.27, p < 0.001) and objective understanding. The correct identification rate for abnormal reports increased from 64.0% to 76.9% (p < 0.001), and for causal findings from 71.9% to 78.6% (p < 0.001). This highlights the LLMs' ability to bridge the communication gap between complex medical reports and the general public, facilitating better patient engagement.

76.9 Abnormal Report ID Rate (%)

Patient Communication Workflow

Original MRI Report

→

LLM Generates Lay Summary

→

Radiologist Review (Human-in-the-Loop)

→

Patient Receives Lay Summary

→

Improved Patient Understanding

4.27 Mean Perceived Understanding

Error Analysis & Limitations

The overall error rate was 19.7% (62/315 summaries), with medical errors at 9.2% and linguistic errors at 8.3%. These errors included incorrect acronym explanations, anatomical descriptions, and foreign-language insertions. The findings underscore the necessity of expert oversight ('human-in-the-loop') and further model refinement, especially for non-English languages and domain-specific terminology.

19.7 Overall Error Rate (%)

Addressing Linguistic Challenges

Challenge: LLMs struggled with medical acronyms and produced linguistic errors, including foreign-language insertions, particularly in French summaries.

Solution: Implementing a 'human-in-the-loop' framework for rapid editing by radiologists, alongside integrating domain-specific acronym dictionaries, is crucial to enhance accuracy and clinical safety.

Advanced ROI Calculator

Estimate your potential return on investment by integrating AI-powered solutions into your enterprise.

Your Industry

Number of Employees Impacted

Average Hours Spent on Manual Tasks per Week (per employee)

Average Hourly Cost (including benefits)

Estimated Annual Savings $0

Annual Hours Reclaimed 0

Your AI Implementation Roadmap

A structured approach to integrating AI into your operations, designed for maximum impact and minimal disruption.

Discovery & Strategy

Initial assessment of your current workflows, identification of AI opportunities, and development of a tailored strategy.

Pilot Program & Validation

Deployment of a small-scale pilot, performance monitoring, and validation of ROI against predefined metrics.

Full-Scale Integration

Phased rollout of the AI solution across relevant departments, ensuring seamless adoption and continuous optimization.

Ongoing Optimization & Support

Continuous monitoring, performance tuning, and dedicated support to ensure long-term success and adaptability.

Ready to Transform Your Enterprise?

Let's discuss how our AI solutions can specifically address your business challenges and drive measurable results.

Schedule Your Consultation

Research Insights

Enterprise AI Analysis: Safety and efficacy of privacy-preserving models to create Lay summaries of brain MRI reports

Executive Impact

Deep Analysis & Enterprise Applications

Radiologist Evaluation

LLM Performance Across Metrics

Non-Physician Understanding

Patient Communication Workflow

Error Analysis & Limitations

Addressing Linguistic Challenges

Advanced ROI Calculator

Your AI Implementation Roadmap

Discovery & Strategy

Pilot Program & Validation

Full-Scale Integration

Ongoing Optimization & Support

Ready to Transform Your Enterprise?

Ready to Get Started?

Book Your Free Consultation.

Let's Discuss Your AI Strategy!

Lets Discuss Your Needs

Select Time Zone

Big Competitive Advantage With Ai

Learn More

Our Demos

Research Center

Contact Us

1 888 985 3025

Solutions@OwnYourAi.com

Get Your Ai