Skip to main content
Enterprise AI Analysis: AI versus human-generated multiple-choice questions for medical education: a cohort study in a high-stakes examination

Enterprise AI Analysis

AI versus human-generated multiple-choice questions for medical education: a cohort study in a high-stakes examination

This study evaluates the quality of ChatGPT-40-generated MCQs compared to human-created MCQs in a high-stakes medical licensing exam. While AI-generated MCQs were easier and faster to create, human MCQs better assessed higher-order cognitive skills and had fewer factual inaccuracies and irrelevance issues. The study suggests a hybrid AI-human approach for optimal question generation.

Executive Impact: Key Findings at a Glance

Key findings from the analysis, translated into actionable enterprise insights.

AI MCQs Easier than Human MCQs (Difficulty Index)
AI MCQs Faster to Create (Time Efficiency)
Human MCQs Better for Higher-Order Skills

Deep Analysis & Enterprise Applications

Select a topic to dive deeper, then explore the specific findings from the research, rebuilt as interactive, enterprise-focused modules.

Psychometric Analysis

This section details the comparative psychometric properties of AI-generated versus human-generated MCQs, including difficulty, discrimination, and reliability. It highlights the statistical differences and similarities that impact assessment quality.

Expert Review Findings

This section covers the qualitative assessment of MCQs by medical experts, focusing on factual correctness, relevance, difficulty appropriateness, and item writing flaws, revealing specific areas where AI-generated questions fell short.

Time Efficiency & Cognitive Levels

This section contrasts the time expenditure for generating MCQs by AI versus humans and analyzes the cognitive skills (Bloom's Taxonomy) predominantly tested by each method, underscoring AI's efficiency but limitations in assessing higher-order thinking.

Mean Difficulty Index for AI-generated MCQs
Mean Difficulty Index for Human-generated MCQs

Enterprise Process Flow

AI MCQs Generated by ChatGPT-40
Human MCQs Generated by Experts
Expert Review (6 Specialists)
AI & Human MCQs Refined
Final Review for PEEM Suitability
Candidate Mock & Actual Exams
Data Analysis & Psychometrics
Feature AI-Generated MCQs Human-Generated MCQs
Factual Accuracy
  • Higher instances of factual incorrectness (6% errors)
  • Fewer factual inaccuracies (4% errors)
Relevance & Difficulty
  • Higher irrelevance (6%) and inappropriate difficulty (14%)
  • No irrelevance (0%) and appropriate difficulty (1%)
Cognitive Level (Bloom's Taxonomy)
  • Predominantly 'Remember' and 'Understand' levels
  • Higher proportion of 'Apply' and 'Analyse' levels
Time Efficiency
  • Significantly less time required (24.5 person-hours)
  • Resource-intensive and time-consuming (96 person-hours)

Optimizing MCQ Generation: A Hybrid Approach

The study concludes that a hybrid AI-human framework is ideal for high-stakes medical exams. AI can handle initial question generation efficiently, reducing the burden on human experts. However, human oversight is critical for ensuring quality, contextual relevance, and alignment with higher-order cognitive skills.

Key Takeaways:

AI for initial draft generation
Human experts for review and refinement
Focus on higher-order cognitive skills
Regular feedback loops and prompt engineering

Calculate Your Potential AI ROI

Estimate the time and cost savings AI can bring to your operations, tailored to your enterprise.

Estimated Annual Savings $0
Hours Reclaimed Annually 0

Your AI Implementation Roadmap

A typical phased approach to integrate AI capabilities into your enterprise operations successfully.

Phase 01: Discovery & Strategy

Comprehensive analysis of current workflows, identification of AI opportunities, and development of a tailored AI strategy and roadmap.

Phase 02: Pilot & Proof-of-Concept

Deployment of AI solutions in a controlled environment to validate effectiveness, measure ROI, and gather initial user feedback.

Phase 03: Scaled Integration

Full-scale deployment of validated AI solutions across relevant departments, including data migration and system integrations.

Phase 04: Training & Optimization

Comprehensive training for your teams, continuous monitoring of AI performance, and iterative optimization for maximum efficiency.

Ready to Transform Your Enterprise with AI?

Book a free 30-minute consultation with our AI specialists to explore how these insights can drive your business forward.

Ready to Get Started?

Book Your Free Consultation.

Let's Discuss Your AI Strategy!

Lets Discuss Your Needs


AI Consultation Booking