AI-Generated Disinformation: A Human-Grounded Risk Evaluation

Beyond Surface Judgments: Understanding the Real Impact of LLMs

Our in-depth study reveals critical misalignments between LLM judges and human readers in evaluating deceptive content, highlighting the need for human-centric validation.

Schedule Your Strategy Session

Executive Summary: The Discrepancy Revealed

LLM judges often misrepresent human perception of disinformation, leading to skewed risk assessments. Internal judge consistency does not equate to human alignment.

Avg. Human-Judge Rank Alignment (Credibility)

Avg. Human-Judge Rank Alignment (Shareability)

Avg. Judge-Judge Rank Alignment (Credibility)

Deep Analysis & Enterprise Applications

Select a topic to dive deeper, then explore the specific findings from the research, rebuilt as interactive, enterprise-focused modules.

Overview

Methodology

Key Findings

This research casts LLM evaluation of disinformation as a proxy-validity problem, auditing LLM judges against human reader responses. We found persistent gaps in scoring, item ordering, and signal dependence.

Key takeaway: Internal agreement among LLM judges is not evidence of validity as a proxy for human reader response.

Our Rigorous Audit Process

We designed a human-grounded evaluation setting to compare LLM judges against human perceptions of credibility and willingness to share.

Enterprise Process Flow

Disinformation Generation (LLMs)

→

Human Reference Collection (Readers)

→

LLM Judge Protocol (8 Frontier Models)

→

Textual Signal Annotation (3 LLM Annotators)

→

Validity Audit & Comparison

This multi-faceted approach allowed us to identify distinct evaluative patterns between AI and humans.

Persistent LLM-Human Misalignment

LLM judges are typically harsher than humans, recover human rankings only weakly, and rely on different textual signals.

LLM Judges Overweight Logical Rigour vs. Humans

They place more weight on logical rigour while penalizing emotional intensity more strongly. This leads to a coherent evaluative group among judges, but one that is misaligned with real reader responses.

Evaluative Paradigms

Criteria	LLM Judges	Human Readers
Logical Rigour	Highly prioritized Reward coherent, structured texts	Less emphasis More susceptible to emotional appeals
Emotional Intensity	Strongly penalized Viewed as detracting from credibility	Less aggressively penalized Can be perceived as persuasive

The implications are significant for benchmark development and safety evaluation.

Calculate Your AI Evaluation ROI

Estimate the potential operational savings and efficiency gains by aligning your AI evaluation strategies with human perception.

Your Industry

AI Evaluation Team Employees

Avg. Hours/Week on Manual Evaluation

Avg. Hourly Rate ($)

Potential Annual Savings

Hours Reclaimed Annually

Our Human-Grounded AI Evaluation Roadmap

Our phased approach ensures a seamless integration of human-grounded AI evaluation into your enterprise.

Discovery & Audit

Comprehensive analysis of existing LLM evaluation practices and human-perception benchmarks.

Custom Alignment Framework

Development of tailored metrics and prompts for human-grounded evaluation.

Integration & Validation

Seamless integration of new frameworks and iterative validation against real-world human data.

Continuous Optimization

Ongoing monitoring and refinement of AI evaluation pipelines for peak accuracy and human alignment.

Ready to Bridge the Gap?

Don't let internal AI agreement mask real-world risks. Ensure your LLM evaluations truly reflect human perception and protect your enterprise.

Discuss Your Implementation

AI-Generated Disinformation: A Human-Grounded Risk Evaluation

Beyond Surface Judgments: Understanding the Real Impact of LLMs

Executive Summary: The Discrepancy Revealed

Deep Analysis & Enterprise Applications

Our Rigorous Audit Process

Enterprise Process Flow

Persistent LLM-Human Misalignment

Evaluative Paradigms

Calculate Your AI Evaluation ROI

Our Human-Grounded AI Evaluation Roadmap

Discovery & Audit

Custom Alignment Framework

Integration & Validation

Continuous Optimization

Ready to Bridge the Gap?

Ready to Get Started?

Book Your Free Consultation.

Let's Discuss Your AI Strategy!

Lets Discuss Your Needs

Select Time Zone

Big Competitive Advantage With Ai

Learn More

Our Demos

Research Center

Contact Us

1 888 985 3025

Solutions@OwnYourAi.com

Get Your Ai