Skip to main content
Enterprise AI Analysis: A multi-centre, multi-device benchmark dataset for landmark-based comprehensive fetal biometry

Enterprise AI Analysis

A multi-centre, multi-device benchmark dataset for landmark-based comprehensive fetal biometry

This paper introduces the first open-access, multi-centre, multi-device benchmark dataset for fetal ultrasound images with expert anatomical landmark annotations. It addresses the critical need for diverse, multi-source annotated data to develop robust AI-assisted fetal growth assessment methods, overcoming limitations of single-site datasets and domain shift. The dataset comprises 4,513 de-identified US images from 1,904 subjects across three clinical sites and seven different devices, covering all primary fetal biometry measures (head, abdomen, femur). Technical validation demonstrates that multi-centre training significantly improves generalisation to unseen acquisition conditions, establishing a robust benchmark for future AI development in fetal biometry.

Key Executive Impact

Understanding the scale of data and its impact is crucial for successful AI deployment. This research unlocks significant potential for generalizable medical AI.

0 Ultrasound Images
0 Unique Subjects
0 Clinical Sites
0 US Devices

Deep Analysis & Enterprise Applications

Select a topic to dive deeper, then explore the specific findings from the research, rebuilt as interactive, enterprise-focused modules.

Highlights the crucial need for varied datasets from multiple sources and devices to overcome domain shift in AI models for medical imaging.

Details the methodology and components involved in creating a comprehensive, open-access dataset with expert annotations for fetal biometry.

Examines how training on multi-centre data improves the performance and robustness of AI models when deployed in diverse clinical settings.

Domain Shift Impact

200% Cross-domain NME for head and abdomen biometry doubled

Cross-domain NME (Normalized Mean Error) for head and abdomen biometry roughly doubled compared to within-domain performance, highlighting substantial domain shift. This translates to models trained on single-site data failing significantly when deployed elsewhere.

Enterprise Process Flow

Multi-centre Data Collection
De-identification & Standardisation
Expert Landmark Annotation
Pre-processing & Quality Control
Train/Test Split Creation
Public Release

Single-Site vs Multi-Centre Training Performance (NME)

Measure Single-Site (e.g., FP→UCL) Multi-Centre (M-C→UCL)
Head Bi-parietal Diameter (BPD) 0.38±0.26 0.02±0.02
Head Occipito-frontal Diameter (OFD) 0.22±0.22 0.03±0.11
Abdomen Transverse Diameter (TAD) 0.45±0.28 0.05±0.12
Femur Length (FL) 0.90±0.54 0.03±0.09
The multi-centre model consistently achieved significantly lower NME values across all measurements when tested on unseen UCL data, often outperforming even UCL's own trained model. This demonstrates the superior generalisation capabilities of diverse training data.

Addressing Landmark Endpoint Variability

Background: Different datasets used opposing endpoint conventions for measurements like Femur Length, which could artificially inflate cross-dataset errors.

Challenge: Manual point-swapping for consistency is time-consuming and error-prone.

Solution: Leveraged BiometryNet's Dynamic Orientation Determination (DOD) mechanism. This allowed the model to learn and remember the appropriate orientation vector for each measurement during training, automatically correcting predicted landmarks at inference.

Impact: Ensured fair cross-dataset comparison and robust generalisation without manual intervention, overcoming a key obstacle in multi-centre AI deployment.

Calculate Your Potential ROI

Estimate the efficiency gains and cost savings your enterprise could achieve by integrating AI-driven biometry. Adjust the parameters below to see a customized projection.

Annual Cost Savings $0
Annual Hours Reclaimed 0

Your AI Implementation Roadmap

A phased approach ensures smooth integration and maximum impact. Here’s a typical timeline for deploying AI solutions based on this research.

Data Ingestion & Standardisation

Collect and de-identify raw ultrasound images, standardise annotation formats, and pre-process for consistency across devices.

Duration: 4-6 Weeks

Model Adaptation & Training

Fine-tune landmark detection models (e.g., BiometryNet) on the multi-centre dataset, leveraging domain adaptation techniques.

Duration: 6-8 Weeks

Cross-Site Validation & Benchmarking

Rigorously test model performance on unseen data from diverse clinical sites to quantify generalisation and robustness.

Duration: 3-4 Weeks

Clinical Integration & Iteration

Deploy validated models into clinical workflows, gather feedback, and iterate on model improvements based on real-world performance.

Duration: Ongoing

Ready to Transform Your Operations?

This research provides a robust foundation. Let's discuss how these advancements in multi-centre fetal biometry can be tailored to your enterprise's unique needs.

Ready to Get Started?

Book Your Free Consultation.

Let's Discuss Your AI Strategy!

Lets Discuss Your Needs


AI Consultation Booking