AI · Data Security · May 25, 2026

97% of Companies Breached Through AI Had Zero Access Controls.

The IBM Cost of a Data Breach Report 2025 found that 13% of all organizations studied had experienced a breach of an AI model or AI application. Of those, 97% reported having no proper AI access controls in place. A $10 billion AI startup named Mercor was compromised in March 2026 through a poisoned open-source library used by millions of developers. Microsoft’s 365 Copilot was shown to exfiltrate corporate data with zero user interaction required.

Enterprises are deploying large language models into the heart of their most sensitive operations — email, documents, code, HR data — while the regulatory and security infrastructure to govern those deployments remains largely unbuilt. The attack surface is new. The defenses are not there yet.

On May 22, 2025, the NSA, CISA, FBI, and allied agencies from Australia, the UK, and New Zealand issued a joint warning: organizations are deploying AI faster than they can secure it. On May 1, 2026, a Six Eyes coalition expanded that warning to autonomous AI agents. This is the threat landscape as it stands.

97%of AI-breach victimshad no proper AI access controls — IBM / Ponemon 2025
$10.22MUS avg breach cost 2025all-time national high — IBM Cost of a Data Breach Report
4 TBstolen from MercorAI training data supplier to OpenAI, Anthropic, Meta — March 2026

§ 01 / The IBM Finding — 97% Had No Controls

The IBM Cost of a Data Breach Report 2025 — conducted annually by the Ponemon Institute across 600 organizations globally, covering breaches that occurred between March 2024 and February 2025 — documented an AI governance crisis hiding inside the broader data security numbers. The headline figure: the global average breach cost fell to $4.44 million, down 9% from the prior year’s $4.88 million. The first decline in five years, driven by AI-powered defensive tools that let organizations identify and contain breaches 80 days faster.

A corporate AI model wired into email, documents, code, and HR data sits behind a wide-open gate with no lock, while a clipboard tallies that 97 of every 100 breached organizations had no access controls — illustrating the IBM Cost of a Data Breach finding. — IBM's 2025 report found 97 percent of organizations breached through an AI model had no proper AI access controls, and 63 percent had no AI governance policy at all. — Civic Intelligence illustration

But the same report surfaced a structural vulnerability in how companies are deploying AI. Among the organizations that experienced a breach of an AI model or AI application — 13% of the full study sample — 97% lacked proper AI access controls. And of the 600 organizations studied, 63% had no AI governance policies at all to manage AI deployments or prevent workers from using shadow AI tools.

“Shadow AI” — the use of unapproved AI tools by employees, downloading or using consumer-grade AI applications on corporate networks — added an average of $670,000 to the cost of a breach for organizations with high levels of it. One in five organizations reported a breach that was directly attributable to shadow AI. Only 37% had any policies in place to detect or manage it.

Chart · Data Breach Cost — AI Exposure vs. AI Defense

Source: IBM Cost of a Data Breach Report 2025 (Ponemon Institute) · 600 organizations studied · March 2024 – Feb 2025

US average breach cost (2025)

$10.22M

IBM / Ponemon 2025 — all-time national high

Global average breach cost (2025)

$4.44M

IBM / Ponemon 2025 — first decline in 5 years

Shadow AI premium (per breach)

+$670K

IBM 2025 — unauthorized AI tools added to breach cost

AI-defended org savings (per breach)

-$1.9M

IBM 2025 — extensive AI security tools vs. none

The AI Security Paradox — IBM 2025

AI is simultaneously the most powerful defensive tool in enterprise security and the most under-governed new attack surface. Organizations deploying AI security tools extensively saved $1.9 million per breach and contained incidents 80 days faster than those without. The US average breach cost hit an all-time national high of $10.22 million — more than double the global average of $4.44 million. The gap between organizations with mature AI security programs and those without has never been larger.

§ 02 / EchoLeak — The First Zero-Click AI Exfiltration

In June 2025, researchers at Aim Security disclosed EchoLeak, tracked as CVE-2025-32711 with a CVSS score of 9.3 (critical). The vulnerability affected Microsoft 365 Copilot and allowed a remote attacker to steal confidential corporate data by sending a single email — with no click, no phishing link, no malware download required from the target. It is the first documented case of a prompt injection attack being weaponized to cause concrete data exfiltration in a production AI system.

The mechanism exploits what Aim Security termed an “LLM scope violation.” Copilot’s deep integration with Microsoft 365 — its ability to read email, OneDrive files, SharePoint content, Teams messages, and preloaded organizational data — is the product’s core value proposition. EchoLeak showed it is also a structural attack surface: an attacker who could manipulate the AI’s context window could cause it to retrieve and exfiltrate any data within Copilot’s access scope. The exploit required no credentials, no interaction from the victim, and left no conventional forensic trace that existing SIEM or DLP platforms were designed to detect.

“EchoLeak reveals a structural attack surface that applies to any LLM-based assistant with access to multiple internal data sources — not just Copilot.”
Aim Security researchers · EchoLeak disclosure · arXiv 2509.10540 · June 2025

Microsoft confirmed the vulnerability and issued emergency patches, noting in its advisory that “no further action is required from customers” following remediation. The U.S. House of Representatives had separately banned congressional staff from using Microsoft Copilot in March 2025, citing concerns about data leakage to unauthorized cloud services. A separate bug disclosed in January 2026 showed that a flaw in Microsoft 365 Copilot had granted the AI assistant access to confidential emails in violation of data loss prevention policies.

Hackers Are Now Attacking AI (LLM Attacks Explained)

§ 03 / The Mercor Breach — AI’s Supply Chain Problem

On March 27, 2026, a threat actor group known as TeamPCP published two malicious versions of LiteLLM— packages 1.82.7 and 1.82.8 — directly to PyPI, the Python package repository. LiteLLM is an open-source Python library used to connect applications to AI services. It has 97 million monthly downloads and a presence in an estimated 36% of cloud environments. The attackers had compromised LiteLLM’s CI/CD pipeline to obtain the credentials needed to publish directly to the repository.

A single poisoned open-source software package labeled LiteLLM passes through a supply-chain conveyor into a $10 billion AI startup, where attackers haul out terabytes of training data, candidate profiles, and API keys — depicting the Mercor breach. — Two malicious LiteLLM packages pushed to PyPI became the entry point for the Mercor breach — roughly 4 terabytes of frontier-lab training data, contractor PII, and API keys stolen through one compromised dependency. — Civic Intelligence illustration

Mercor — a startup that provides AI training data to OpenAI, Anthropic, Meta, and other frontier labs, valued at $10 billion and serving as a critical link in the AI training supply chain — ran LiteLLM as a dependency. On March 31, 2026, Mercor confirmed it had been breached through the poisoned LiteLLM packages. The hacker group Lapsus$ later claimed responsibility. The stolen data: an estimated 4 terabytes, including candidate profiles and personally identifiable information on more than 40,000 contractors, proprietary source code, video interviews, employer data, and API keys that potentially exposed the AI training methodologies of multiple frontier labs.

What the Mercor Breach Exposed

For Mercor: At least seven class-action lawsuits filed. Meta paused its Mercor contracts indefinitely. OpenAI began investigating its exposure. Five contractors filed individual suits over personal data exposure.

For the AI industry: The breach demonstrated that the AI supply chain is as vulnerable as any software supply chain — and that a single compromised open-source dependency used by millions of developers can become an entry point into the proprietary training data of the most valuable AI companies on earth.

For security teams: Traditional software composition analysis (SCA) tools check dependencies for known CVEs. They do not detect the window between a malicious package’s PyPI publication and its CVE assignment — which can be hours or days. Mercor was breached within that window.

Massive AI Breach: LiteLLM Leak Puts 97 Million in Danger

§ 04 / The Samsung Incident — The Insider Problem

The most visible early case of corporate AI data leakage did not involve any external attacker. In March 2023, Samsung Electronics authorized employee use of ChatGPT in its semiconductor business unit. Within three weeks, three separate incidents of sensitive data leakage were documented.

In the first, an employee pasted faulty source code into ChatGPT to debug it. In the second, an employee entered program code for identifying defective equipment to receive optimization suggestions. In the third, an employee used ChatGPT to transcribe a recorded internal meeting and generate minutes — uploading the audio directly. All three incidents sent Samsung’s proprietary data to OpenAI’s servers, where, under the terms of service at the time, it could be used to train future models.

Samsung’s response — capping ChatGPT inputs at 1,024 bytes and accelerating development of an in-house AI — became the template that dozens of enterprises followed. JPMorgan Chase and Goldman Sachs restricted ChatGPT access entirely after discovering employees had shared sensitive financial data through it. The Samsung case established what security researchers now call the “shadow AI” pattern: employees using consumer AI tools for legitimate work tasks without understanding that the data they enter may leave the organization permanently.

2025 Cost of a Data Breach: AI Risks, Shadow AI, and Solutions — IBM

§ 05 / The Attack Surface — Prompt Injection and the OWASP Top 10

The Open Worldwide Application Security Project (OWASP) published its Top 10 for LLM Applications in 2025, documenting the ten most critical vulnerabilities in enterprise AI deployments. Prompt injection — the manipulation of natural-language inputs to override the AI’s intended instructions — ranks first. OWASP researchers found it present in over 73% of production AI deployments assessed during security audits. Only 34.7% of organizations had deployed dedicated defenses against it.

Unlike traditional application vulnerabilities, prompt injection operates at the semantic layer, not the network or application layer. Existing firewalls, SIEM platforms, SOAR systems, and data loss prevention tools were not designed to detect it. Research published in 2025 demonstrated that five carefully crafted documents can manipulate AI responses 90% of the time. The attack success rate for prompt injections in auto-execution mode — where AI agents act on retrieved content without human review — ranged from 66.9% to 84.1%.

OWASP Top 10 for LLM Applications · 2025 Edition

Source: OWASP GenAI Security Project · owasp.org/www-project-top-10-for-large-language-model-applications

RankVulnerabilitySeverity

LLM01

Prompt Injection

#1 exploit in 73% of production AI deployments

Critical

LLM02

Sensitive Info Disclosure

Training data leakage, system prompt exposure, PII output

Critical

LLM03

Supply Chain Attacks

Compromised dependencies (e.g. LiteLLM → Mercor, March 2026)

High

LLM04

Data & Model Poisoning

Split-view poisoning; adversarial training data injection

High

LLM05

Improper Output Handling

Unvalidated LLM output fed into downstream systems

High

LLM06

Excessive Agency

AI agents taking unauthorized actions with real consequences

High

LLM07

System Prompt Leakage

Confidential instructions exposed via adversarial queries

Medium

LLM08

Vector & Embedding Weaknesses

New 2025 entry — RAG systems and vector database attacks

Medium

LLM09

Misinformation / Overreliance

Organizations trusting unverified AI output as fact

Medium

LLM10

Unbounded Consumption

Runaway inference costs; denial-of-service via token exhaustion

Medium

“Most modern attacks against AI systems are integrity attacks — prompt injection is the primary vector. The fundamental architectural weakness is that LLMs cannot reliably distinguish between trusted instructions and untrusted data.”
Bruce Schneier, cryptographer and fellow at Harvard Kennedy School · AI and Trust · June 2025

§ 06 / Government Response — NSA, CISA, and the Five Eyes

On May 22, 2025, the National Security Agency Artificial Intelligence Security Center (AISC), CISA, FBI, and cyber agencies from Australia, the United Kingdom, and New Zealand jointly released a Cybersecurity Information Sheet on AI data security. The document identified three primary risk categories for AI systems: data supply chain vulnerabilities, maliciously modified data (including “split-view poisoning” via expired domain purchases), and data drift. It outlined ten cybersecurity best practices specific to AI systems and recommended controls across four phases of the AI lifecycle.

On May 1, 2026, six national cybersecurity agencies — CISA, NSA, and the cyber arms of Australia, Canada, New Zealand, and the United Kingdom — jointly published “Careful Adoption of Agentic AI Services,” defining five categories of agentic AI risk: privilege escalation, design and configuration failures, behavioral misalignment, structural brittleness, and accountability gaps. The document marked the first time Western allied agencies had specifically addressed autonomous AI agents as a distinct threat surface.

On January 8, 2026, the Department of Homeland Security published a Federal Register request for information on security considerations for AI agents — the first formal government inquiry into the governance gap that EchoLeak and the Mercor breach had exposed in production environments. NIST’s Center for AI Standards and Innovation (CAISI) announced the AI Agent Standards Initiative on February 17, 2026, framing it as both a domestic governance response and a geopolitical competition with China over international AI standards-setting.

§ 07 / Expert Reactions — The Structural Problem

The security research community has been largely consistent in its diagnosis: the AI security problem is not primarily a patching problem. It is a structural problem. Traditional enterprise security was built for deterministic systems — software that does what it was programmed to do. LLMs are probabilistic systems whose outputs depend on context, training data, and input in ways that are not fully predictable in advance. The security tooling built for the former does not transfer to the latter.

“AI adoption is greatly outpacing AI security and governance. The paradox is that the same technology helping organizations defend better is also creating new risks faster than organizations can govern them.”
IBM Security report authors · Cost of a Data Breach Report 2025 · July 30, 2025

In April 2025, Google DeepMind introduced the CaMel framework as one proposed architectural response — treating LLMs as untrusted elements within secure infrastructure through a dual-LLM approach with explicit separation between a Privileged LLM and a Quarantined LLM. The framework has not been widely adopted. By 2026, IBM X-Force researchers found that 97 million ChatGPT credentials appeared for sale on the dark web, suggesting that credential compromise — not prompt injection — remains the leading vector for unauthorized AI access in practice.

By 2027, Gartner analysts project that 40% of all AI data breaches will result from cross-border misuse of generative AI — data flowing through AI tools into jurisdictions where it is not legally permitted to be processed, creating compliance exposure alongside the security exposure that has dominated the conversation so far.

§ 08 / Social Reactions

NSA / National Security Agency

@NSAgov · May 22, 2025

NSA's AI Security Center, alongside CISA, FBI, and international partners, released guidance on protecting AI data from supply chain attacks, data poisoning, and unauthorized modification. Securing the data used to train and operate AI systems is foundational — organizations must address AI risks across the full AI lifecycle, not just at deployment.

CISA — Cybersecurity and Infrastructure Security Agency

@CISAgov · May 1, 2026

Six allied cybersecurity agencies — CISA, NSA, ASD, CCCS, GCSB, and NCSC — jointly published 'Careful Adoption of Agentic AI Services.' Agentic AI creates new attack surfaces: privilege escalation, configuration failures, behavioral misalignment, structural brittleness, and accountability gaps. Organizations deploying AI agents must address these before deployment, not after.

IBM Security

@IBMSecurity · July 30, 2025

New IBM Cost of a Data Breach Report 2025: 97% of organizations that experienced an AI-related security incident lacked proper AI access controls. 63% have no AI governance policies at all. Shadow AI adds $670K to the average breach cost. The AI oversight gap is real — and it's measurable.

Tech Executive (paraphrased)

April 2026 · Truth Social

The Mercor breach should be a wake-up call for every company feeding proprietary data to AI training pipelines. You are only as secure as the weakest dependency in the stack — and most teams do not know what that dependency is.

Paraphrase · original post on Truth Social

Enterprise CISO (paraphrased)

March 2026 · Truth Social

We approved five AI tools for enterprise use last quarter. By the time procurement finished, employees were already using twelve others. Shadow AI is not a policy problem anymore. It is a detection problem.

Paraphrase · original post on Truth Social

Bottom Line

The AI security crisis is not a future risk. Mercor lost 4 terabytes of frontier-lab training data through a poisoned Python library. Microsoft’s flagship enterprise AI product was demonstrated to exfiltrate corporate documents with zero user interaction. Samsung leaked semiconductor source code to a consumer chatbot. And 97% of the companies that experienced an AI model breach had no access controls in place when it happened. The attack surface is new. The attackers are not waiting for the governance to catch up.

Sources & Methodology · 18 Sources

IBM Security Newsroom — 13% of Orgs Reported Breaches of AI Models, 97% Lacked AI Access Controls

IBM Think — Cost of a Data Breach Report 2025: Navigating the AI Rush Without Sidelining Security

NSA AISC — Joint Guidance on AI Data Security (May 22, 2025) — with CISA, FBI, ASD, NCSC, GCSB

DOD / NSA — Joint Cybersecurity Information Sheet: AI Data Security (PDF, May 2025)

CISA — Five Eyes Agentic AI Security Guidance: 'Careful Adoption of Agentic AI Services' (May 1, 2026)

Federal Register — CISA/DHS RFI: Security Considerations for Artificial Intelligence Agents (Jan 8, 2026)

arXiv 2509.10540 — EchoLeak: The First Real-World Zero-Click Prompt Injection Exploit in a Production LLM System (Aim Security, Sep 2025)

OWASP GenAI Security Project — LLM01:2025 Prompt Injection (Top 10 for LLM Applications)

TechCrunch — Mercor says it was hit by cyberattack tied to LiteLLM open-source project compromise (March 31, 2026)

Fortune — Mercor, a $10 billion AI startup, confirms it was caught up in a major security incident (April 2, 2026)

TechCrunch — After data breach, $10B-valued startup Mercor is having a month (April 9, 2026)

SOC Prime — CVE-2025-32711: EchoLeak Flaw in Microsoft 365 Copilot Could Enable Zero-Click AI Attack

The Hacker News — Zero-Click AI Vulnerability Exposes Microsoft 365 Copilot Data Without User Interaction (June 2025)

Cybersecurity Dive — Shadow AI increases cost of data breaches, IBM report finds (2025)

CIO Dive — Samsung employees leaked corporate data in ChatGPT: report (2023)

Cybersecurity Dive — Samsung Electronics ChatGPT data privacy incident (2023)

Kiteworks — How Shadow AI Costs Companies $670K Extra: IBM's 2025 Breach Report Analysis

NIST / Cloud Security Alliance — CAISI AI Agent Standards Initiative (Feb 17, 2026)

All breach statistics from IBM Cost of a Data Breach Report 2025 (Ponemon Institute, 600 organizations, March 2024 – Feb 2025). EchoLeak details from arXiv paper 2509.10540 and CVE-2025-32711 advisories. Mercor breach details from TechCrunch (March 31 and April 9, 2026) and Fortune (April 2, 2026). Samsung ChatGPT incident from CIO Dive and Cybersecurity Dive (2023). Government guidance from NSA/CISA/FBI joint release May 22, 2025 and CISA Five Eyes agentic AI guidance May 1, 2026. OWASP Top 10 for LLM Applications 2025 edition.