Enterprise AI Analysis

Infusion: Shaping Model Behavior by Editing Training Data via Influence Functions

This research introduces INFUSION, a novel framework for targeted data poisoning that modifies existing training documents via influence functions to steer model behavior. Unlike traditional methods that inject explicit attack examples, INFUSION uses subtle, gradient-based perturbations. Experiments on CIFAR-10 show that editing just 0.2% of training data can achieve targeted misclassification, competing with direct data injection and transferring across architectures. Preliminary language model experiments on Caesar ciphers and TinyStories indicate success in amplifying existing behaviors, though prediction flips are rare at scale. The work highlights training data as a critical attack surface, emphasizing the need for robust data provenance and influence-based monitoring to defend against such sophisticated, stealthy attacks.

Schedule Your Strategy Session

Key Findings & Enterprise Implications

INFUSION demonstrates a new frontier in influencing AI behavior through subtle data manipulation, revealing critical vulnerabilities and underscoring the need for advanced data security protocols.

0 Attack Success Rate (CIFAR-10)

0 Training Data Perturbed

0 Top-1 Prediction Flip Rate

Architecture Transferability

Discuss Your Implementation

Deep Analysis & Enterprise Applications

Select a topic to dive deeper, then explore the specific findings from the research, rebuilt as interactive, enterprise-focused modules.

Machine Learning Security: Data Poisoning

Data poisoning involves injecting malicious data into a training set to compromise the integrity or performance of a machine learning model. INFUSION advances this by using subtle, non-explicit perturbations.

Machine Learning Security: Influence Functions

Influence functions quantify the impact of individual training data points on a model's predictions. INFUSION repurposes these to *design* training data perturbations that induce specific model behaviors.

Machine Learning Security: Model Robustness

The ability of models to withstand adversarial attacks, including data poisoning, is key to model robustness. INFUSION's cross-architecture transfer demonstrates a new challenge for defensive strategies.

100% CIFAR-10 Targeted Misclassification Success Rate with 0.2% Data Edits

Enterprise Process Flow

Define Target Behavior & Measurement

→

Identify Influential Training Docs

→

Compute Gradient-Based Perturbations

→

Generate Infused Training Data

→

Partial Retraining

→

Validate Model Behavior Shift

INFUSION vs. Direct Data Injection
Feature	INFUSION (Perturbations)	Direct Injection (Explicit Samples)
Attack Visibility	Subtle, non-explicit edits	Explicit target behavior examples
Data Budget (CIFAR-10)	0.2% (100 docs)	0.2% (100 samples)
Transferability (Architectures)	Yes (ResNet ↔ CNN)	Yes (common for backdoors)
Detection Difficulty	High (subtle changes)	Lower (explicit patterns)
Persistence	Designed for persistence	Can be filtered/aligned out

Case Study: Cross-Architecture Attack Transfer on CIFAR-10

INFUSION demonstrates that a dataset poisoned using one model architecture (e.g., ResNet) can induce targeted misclassifications when a *different* architecture (e.g., CNN) is trained on it. This suggests that adversaries could compute perturbations on publicly accessible proxy models and use them to attack proprietary systems trained on similar, shared data. The transfer is weak but non-zero, especially CNN → ResNet, indicating features that generalize even across differing model families.

Takeaway: Implication: Open-weight models pose a significant risk, enabling 'proxy' attacks on closed-source systems if training data overlaps.

Calculate Your Potential AI ROI

Estimate the efficiency gains and cost savings your enterprise could achieve by strategically implementing robust AI security measures and influence-aware training pipelines.

Your Industry

Number of Employees Impacted by AI

Avg. Weekly Hours on Repetitive AI-Related Tasks

Average Hourly Cost of Employee (incl. benefits)

Potential Annual Savings $0

Hours Reclaimed Annually 0

Get a Custom ROI Analysis

Your AI Security & Influence Roadmap

A phased approach to integrate influence-aware AI practices into your enterprise, ensuring robust and predictable model behavior.

Phase 1: Vulnerability Assessment & Influence Mapping

Identify critical AI systems and training data. Conduct an influence function analysis to map data point impact on model behavior, revealing potential manipulation vectors.

Phase 2: Data Perturbation & Validation Framework Development

Establish secure environments for simulating data infusion attacks. Develop and test gradient-based perturbation techniques, validating their efficacy and stealth.

Phase 3: Robustness Integration & Monitoring

Integrate influence-aware defenses into training pipelines. Implement continuous monitoring for subtle data anomalies and unexpected model behavior shifts. Train teams on threat detection.

Phase 4: Ongoing Optimization & Threat Intelligence

Refine defense strategies based on new research and emerging threats. Regularly update influence models and perturbation detection algorithms to maintain proactive security posture.

Begin Your AI Security Journey

Ready to Secure Your AI Future?

Don't let subtle data manipulations compromise your AI initiatives. Partner with us to build resilient, predictable, and secure machine learning systems.

Book a Free Consultation Now

Enterprise AI Analysis

Infusion: Shaping Model Behavior by Editing Training Data via Influence Functions

Key Findings & Enterprise Implications

Deep Analysis & Enterprise Applications

Machine Learning Security: Data Poisoning

Machine Learning Security: Influence Functions

Machine Learning Security: Model Robustness

Enterprise Process Flow

INFUSION vs. Direct Data Injection

Case Study: Cross-Architecture Attack Transfer on CIFAR-10

Calculate Your Potential AI ROI

Your AI Security & Influence Roadmap

Phase 1: Vulnerability Assessment & Influence Mapping

Phase 2: Data Perturbation & Validation Framework Development

Phase 3: Robustness Integration & Monitoring

Phase 4: Ongoing Optimization & Threat Intelligence

Ready to Secure Your AI Future?

Ready to Get Started?

Book Your Free Consultation.

Let's Discuss Your AI Strategy!

Lets Discuss Your Needs

Select Time Zone

Big Competitive Advantage With Ai

Learn More

Our Demos

Research Center

Jobs

Contact Us

1 888 985 3025

Solutions@OwnYourAi.com

Get Your Ai