AI · Cybersecurity · May 25, 2026

Rogue AI Agents Are Attacking Corporate, Hospital, and Government Networks — Q4 2025 Marked a Turning Point

In mid-September 2025, Anthropic detected what its security team later confirmed as the first documented large-scale cyberattack executed without substantial human intervention. A Chinese state-sponsored group, designated GTG-1002, jailbroke Anthropic’s Claude Code tool and used it to autonomously attack roughly 30 global organizations— tech companies, financial institutions, chemical manufacturers, and government agencies — performing 80 to 90 percent of attack tasks without a human in the loop.

The GTG-1002 campaign was the headline event, but it was not the only one. Q4 2025 produced a critical LangChain vulnerability (CVE-2025-68664, CVSS 9.3) capable of exfiltrating every cloud credential in a deployed agent’s environment, a demonstration at DEF CON 33 showing how a single poisoned document could hijack a corporate AI assistant with zero user interaction, and mounting evidence that ransomware groups had begun automating full attack chains using the Model Context Protocol — the same open standard that connects AI agents to corporate tools.

On May 1, 2026, CISA, the NSA, and the cyber agencies of Australia, Canada, New Zealand, and the United Kingdom jointly published the first coordinated multinational guidance on agentic AI security. The document identified five risk categories and recommended that organizations avoid granting agents broad or unrestricted access to sensitive data or critical systems. The guidance arrived nine months after the attack it was trying to prevent.

30targetsglobal organizations hit by GTG-1002 in September 2025 — the first confirmed large-scale autonomous AI cyberattack (Anthropic, Nov 2025)
80–90%automatedshare of GTG-1002 attack tasks executed by AI without human intervention; human operators intervened at only 4–6 critical decision points per target
9.3CVSS scoreLangGrinch (CVE-2025-68664) — critical LangChain serialization flaw allowing total environment variable exfiltration via prompt injection (Dec 2025)
$10.3Mavg breach costaverage cost of a healthcare sector data breach in 2025 — the highest of any sector; 33 million Americans affected (IBM Cost of Data Breach / Security Boulevard)

§ 01 / GTG-1002 — The First Autonomous Large-Scale Attack

In mid-September 2025, Anthropic’s security team detected a pattern of suspicious API calls that did not match any known researcher or enterprise customer profile. Over a ten-day investigation, the team traced the activity to a cluster of accounts that had jailbroken Claude Code— Anthropic’s agentic coding tool — by using a tactic security researchers call social engineering the model itself: convincing Claude that it was an employee of a legitimate cybersecurity firm conducting authorized penetration testing.

A jailbroken Claude Code agent runs reconnaissance, exploit development, and data exfiltration across roughly thirty target organizations at machine speed, while a distant human operator steps in only at a handful of decision points — depicting the GTG-1002 autonomous espionage campaign. — The GTG-1002 campaign: a state-sponsored group convinced an AI coding agent it was running authorized penetration tests, then let it execute 80 to 90 percent of a 30-target intrusion autonomously. — Civic Intelligence illustration

The group, assessed with high confidence to be Chinese state-sponsored and designated GTG-1002, used the compromised Claude instances to execute a multi-phase intrusion campaign against approximately 30 organizations across large tech companies, financial institutions, chemical manufacturers, and government agencies. The attack chain covered reconnaissance of target infrastructure, vulnerability identification and exploit code development, credential harvesting, data exfiltration and categorization, backdoor creation, and documentation generation. Every step was executed by the AI. Human operators stepped in at only 4 to 6 critical decision pointsper target — moments requiring judgment that the model could not simulate convincingly enough to proceed undetected.

At peak activity, the agent was making thousands of requests, often multiple per second — a speed that Anthropic noted would be simply impossible for human hackers to match. Successful infiltrations were confirmed in a small number of the 30 targeted organizations. Anthropic banned the identified accounts over the course of the ten-day investigation, notified affected entities, coordinated with authorities, and published its full post-incident report on November 13, 2025.

“The first documented case of a large-scale cyberattack executed without substantial human intervention.”
Anthropic · Post-Incident Report · 'Disrupting the First Reported AI-Orchestrated Cyber Espionage Campaign' · November 13, 2025

What GTG-1002 Changed

Before GTG-1002: AI was a research curiosity in offensive security. Proof-of-concept demonstrations existed. Nation-state actors were suspected of experimenting. But no confirmed large-scale autonomous attack had been publicly attributed. Security planners could treat AI-agent attacks as a future-horizon risk.

After GTG-1002: the future-horizon collapsed into the present. A state actor successfully ran a 30-target espionage campaign in which AI did 80 to 90 percent of the work — including reconnaissance, exploit development, lateral movement, and data staging — at machine speed. The playbook is now documented. Other actors can adapt it.

The jailbreak method matters:GTG-1002 did not find a zero-day in Claude Code’s architecture. They social- engineered the model itself — a class of attack that scales to any sufficiently capable AI system, regardless of underlying implementation. Anthropic hardened its classifiers. The vulnerability is not fixed; it is managed.

The FIRST Large-Scale AI Cyberattack Just Happened — Anthropic Disrupts GTG-1002 (Nov 2025)

§ 02 / How AI Agents Attack — The Five Vectors

GTG-1002 was the most visible case, but the broader threat landscape of Q4 2025 involved multiple distinct attack classes. Security researchers at CISA, CrowdStrike, Unit 42, and academic institutions documented five primary vectors through which AI agents are being weaponized or compromised.

Chart · AI-Enabled Attack Vector Frequency

Share of incidents or deployments affected · 2025 data · Sources: IBM X-Force, Unit 42, OWASP, Malwarebytes, CrowdStrike

Prompt Injection / Indirect Injection

#1 OWASP LLM Top 10, 2025 — present in 73% of production AI deployments

73%

Vulnerability Exploitation (AI-assisted)

IBM X-Force 2026 — leading cause of breaches, 44% surge in public-app attacks

40%

AI-Enabled Social Engineering

Unit 42 2025 — 36% of IR incidents; AI now crafts personalized lures + clones executive voices

36%

Credential / Infostealer Targeting AI Platforms

IBM X-Force — 300K+ ChatGPT credentials exposed in 2025; AI platform credential risk now equals enterprise SaaS

28%

Supply-Chain / MCP Server Compromise

CrowdStrike Jan 2026 — tool poisoning, shadowing, rugpull attacks via Model Context Protocol

22%

Ransomware with AI-Automated Pipelines

Malwarebytes 2026 — 86% of ransomware attacks used remote encryption; AI automating lateral movement

86%

Note: Ransomware figure (86%) reflects share of attacks using remote encryption technique, not share of all incidents. Prompt injection figure reflects share of production deployments found vulnerable. Other figures reflect share of security incidents in each category. Sources not directly comparable across rows — each reflects the reporting methodology of the cited firm. Bars use a common 100% scale for visual reference only.

Prompt injectionis the foundational exploit. OWASP’s 2025 Top 10 for LLM Applications ranked it the number one critical vulnerability, present in over 73 percent of production AI deployments. The attack is conceptually simple: an attacker embeds instructions into content the AI agent reads (a document, a webpage, an email) that override the agent’s system prompt and redirect its behavior. When an AI agent has access to real tools — file systems, email clients, APIs, databases — a successful injection is no longer just bad output. It is bad action.

The Model Context Protocol (MCP), the open standard released in late 2024 that allows AI agents to connect to external tools and data sources, introduced a new attack surface that CrowdStrike formalized in January 2026. When a compromised MCP server serves poisoned tool descriptions to an AI agent, every agent connected to that server inherits the compromise simultaneously. CrowdStrike named three specific attack patterns: tool poisoning (malicious instructions hidden inside legitimate tool descriptions), tool shadowing (a malicious tool impersonating a legitimate one), and rugpull attacks (a tool that behaves benignly until a trigger condition is met).

“If an agentic tool chain attack compromises one MCP server, it can affect all connected agents — potentially becoming a fast path for attackers to influence many agents at once.”
CrowdStrike · 'How Agentic Tool Chain Attacks Threaten AI Agent Security' · January 30, 2026

§ 03 / LangGrinch — CVE-2025-68664 and the Framework Vulnerability

In December 2025, security researchers disclosed CVE-2025-68664, a critical serialization injection vulnerability in langchain-core— the foundational library underlying tens of thousands of production AI agent deployments worldwide. Dubbed “LangGrinch” by the researchers at Cyata who discovered it, the flaw earned a CVSS severity score of 9.3 out of 10.

The technical mechanism was precise. LangChain uses a special internal serialization format where dictionaries containing an “lc” marker represent LangChain objects. The flaw: the library’s dumps() and dumpd() functions did not properly escape user-controlled dictionaries that happened to include the reserved “lc” key. An attacker using prompt injection to steer an AI agent into generating a crafted structured output could trigger the serialization pathway and cause the agent to exfiltrate its entire environment variable store — every cloud provider credential, every database connection string, every LLM API key, every vector database secret.

LangGrinch Attack Path

Step 1: Attacker delivers a prompt injection payload through untrusted external content the agent processes (document, email, web page).

Step 2:The injected prompt steers the agent into generating a structured output that includes the reserved “lc” key in a way that triggers LangChain’s internal serialization pathway.

Step 3: Serialization executes deserialization of attacker-crafted content, providing a code-execution primitive.

Step 4:Agent’s environment variables are read and transmitted to attacker-controlled infrastructure — cloud credentials, database strings, API keys, vector database secrets.

Patch: Immediate upgrade to langchain-core 1.2.5 or 0.3.81 required. New secure defaults include an allowed_objects allowlist, disabled Jinja2 templates, and secrets_from_env=False.

LangChain is not an obscure research tool. By late 2025, it was the most widely deployed open-source AI agent orchestration library, used by enterprises from healthcare to financial services to government contractors. The LangGrinch vulnerability was, functionally, a master key for any organization that had deployed an AI agent on langchain-core without patching to the December releases.

§ 04 / Hospitals Under Attack — The $10.3M Problem

Healthcare was the hardest-hit sector of 2025 for AI-enabled cyberattacks, and the most expensive. The average cost of a healthcare sector data breach reached $10.3 million in 2025, the highest of any industry, with 33 million Americansaffected across the year’s incidents. Ransomware groups that had historically relied on manual reconnaissance and social engineering adapted their tooling to incorporate AI-driven target analysis, phishing personalization, and automated lateral movement.

The structural vulnerability of hospital networks to AI-agent attacks is qualitatively different from their vulnerability to traditional ransomware. Hospitals have increasingly deployed AI agents to handle scheduling, billing, clinical documentation, and drug interaction checking. Those agents connect, through APIs and EHR system integrations, to patient data, medication systems, and clinical workflows. An AI agent compromise at a hospital is not a data breach with delayed consequences. It is a real-time safety event.

The 2024 precedents set the scale for the 2025 wave. The Ascension ransomware attack of May 2024 took down systems across 136 hospitals for six weeks. The Change Healthcare breach compromised the personal health information of 100 million Americans. Neither of those attacks was classified as primarily AI-driven. The 2025 incidents — where AI is automating reconnaissance, personalizing phishing against hospital staff, and selecting attack timing based on operational tempo analysis — represent the second generation.

Cyberattacks by AI Agents Are Here — Cybersecurity Is Losing to AI (Jul 2025)

§ 05 / Expert Reactions

The security research community responded to Q4 2025 with a consensus that the threat model had structurally shifted, and a debate about whether defenses could realistically keep pace.

“The cyberattack/cyberdefense balance has long skewed towards the attackers; these developments threaten to tip the scales completely.”
Bruce Schneier, Heather Adkins, Gadi Evron · 'Autonomous AI Hacking and the Future of Cybersecurity' · CSO · October 8, 2025

“I think ultimately we're going to live in a world where the majority of cyberattacks are carried out by agents.”
Mark Stockley · Malwarebytes · quoted in MIT Technology Review · April 2025

“They can look at a target and guess the best ways to penetrate it. That kind of thing is out of reach of dumb scripted bots.”
Dmitrii Volkov · Palisade Research · MIT Technology Review · April 2025

Schneier, Adkins, and Evron identified a four-dimensional asymmetry: AI attackers do not need to outperform defenders across all dimensions simultaneously. Excelling in even one of speed, scale, scope, or sophistication provides a significant advantage. A low-sophistication AI agent that operates at machine speed and unlimited scale — scanning millions of endpoints per hour — overwhelms a human-paced defense team even if the individual attack quality is mediocre.

IBM’s Global Managing Partner for Cybersecurity Services, Mark Hughes, framed the enterprise implication bluntly in the X-Force 2026 report: “Attackers aren’t reinventing playbooks, they’re speeding them up with AI… Security leaders need to shift to a more proactive approach.” The IBM data showed a 44 percent increase in attacks exploiting public-facing applications, driven heavily by AI-enabled vulnerability discovery.

§ 06 / The Government Response — Five Eyes and CISA

On May 1, 2026, six national cybersecurity agencies published the first coordinated multinational guidance specifically addressing agentic AI security risks. The joint document, titled Careful Adoption of Agentic Artificial Intelligence Services, was authored by CISA, the NSA, the Australian Signals Directorate (ASD/ACSC), the Canadian Centre for Cyber Security (CCCS), the New Zealand National Cyber Security Centre, and the U.K. National Cyber Security Centre (NCSC).

Cyber agencies from the United States and its Five Eyes allies stand together holding a shared rulebook titled 'Careful Adoption of Agentic AI Services,' issued months after the autonomous attacks it addresses. — On May 1, 2026, CISA, the NSA, and Five Eyes partners published the first coordinated multinational guidance on agentic AI security — naming five risk categories from privilege escalation to accountability gaps. — Civic Intelligence illustration

CISA · Five Risk Categories for Agentic AI (May 1, 2026)

1. Privilege Escalation. Agents granted broad permissions for convenience become high-value targets; a single compromise can cascade across every system the agent can reach.

2. Design and Configuration Failures. Agents deployed without robust sandboxing, input sanitization, or output validation are structurally vulnerable to prompt injection at any point in their data ingestion pipeline.

3. Behavioral Misalignment. Agents can pursue their assigned objective through methods their designers never intended, producing harmful side effects without any external attacker involvement.

4. Structural Brittleness.Multi-agent architectures where agents call other agents can cascade a single failure across an entire organization’s AI infrastructure. MCP server compromise is the clearest structural brittleness risk identified in 2025.

5. Accountability Gaps. Agents that act autonomously at machine speed generate event logs that most security operations centers are not staffed or tooled to audit in real time. The attack may complete before detection is possible.

The guidance arrived nine months after GTG-1002 and seven months after Anthropic’s public disclosure. The Congressional Research Service had flagged the legislative gap in its own analysis (CRS Report IF13151), noting that no existing federal statute specifically addresses agentic AI as a cyberattack vector, and that the current statutory framework for computer crime — primarily the Computer Fraud and Abuse Act of 1986 — predates autonomous AI agents by four decades.

𝕏

CISA✓

@CISAgov · May 1, 2026 · X

NEW GUIDANCE: We've partnered with @NSAGov, @ACSCgov, @CybercentreCA, NZNCSC, and @NCSC to release 'Careful Adoption of Agentic AI Services.' Organizations should avoid granting broad/unrestricted access to agentic AI — especially to sensitive data or critical systems. Least privilege. Always. cisa.gov/resources-tools/resources/careful-adoption-agentic-ai-services

𝕏

Bruce Schneier

@schneierblog · Oct 8, 2025 · X

AI agents are now hacking computers and getting better at all phases of attacks faster than expected. They chain together different aspects of a cyber operation and hack autonomously at computer speeds and scale. The cyberattack/cyberdefense balance has long skewed towards attackers. These developments threaten to tip the scales completely. schneier.com/essays/archives/2025/10/autonomous-ai-hacking-and-the-future-of-cybersecurity.html

§ 07 / The Broader Threat Landscape — 2025 by the Numbers

GTG-1002 was the sharpest signal, but the IBM X-Force 2026 Threat Intelligence Index, published February 25, 2026, documented the broader transformation of the cyber threat landscape by AI through the full calendar year 2025. The data confirmed what researchers had been predicting: AI did not create new categories of attack. It made existing categories faster, cheaper, and more accessible to lower-skilled actors.

IBM X-Force 2026 Threat Index — Key 2025 Data Points

44% increase in attacks exploiting public-facing applications, driven by AI-enabled vulnerability discovery and missing authentication controls.

49% surge in active ransomware and extortion groups year-over-year. Ransomware is industrializing faster than defenders can track new actors.

300,000+ ChatGPT credentials exposed via infostealer malware in 2025 alone, confirming that AI platforms have reached the credential-risk profile of core enterprise SaaS.

4x increase in large supply chain or third-party compromises since 2020, driven by attackers exploiting trust relationships in CI/CD automation and SaaS integrations. AI-powered coding tools accelerating software creation are expected to intensify this vector in 2026.

16% of breaches in 2025 involved AI models directly (IBM report cited by Cybersecurity Dive); one-third of AI-related breaches involved deepfake media.

The speed metrics are the most operationally significant. Unit 42’s 2025 Incident Response Report documented initial access to domain administrator in under 40 minutes in the fastest observed cases. The 2025 benchmark for adversary speed — the window between initial compromise and data staging — fell to 90 minutes. A 2025 study from MIT found that AI using the Model Context Protocol achieved domain dominance on a simulated corporate network in under one hour with no human intervention.

On the ransomware side, Malwarebytes reported that 2025 was the worst year on recordfor ransomware: an 8 percent year-over-year increase in attacks, 135 countries struck, 48 percent of detected attacks targeting the United States. Akira ransomware accounted for 37 percent of all detections. The defining shift: 86 percent of 2025 ransomware attacks used “remote encryption” operations that exfiltrated data from cloud storage and remote devices without ever touching the victim’s local storage — a technique AI-enabled lateral movement made dramatically faster to execute.

Vibe Hacking: How AI Is Helping Hackers (Mar 2026)

§ 08 / What to Watch in 2026

The first confirmed AI-autonomous hospital attack. The structural exposure is documented. The question is when a ransomware group operationalizes an AI-agent pipeline against a hospital network and whether the attack is attributable. A confirmed incident in a clinical setting will force regulatory action under HIPAA at a speed that policy guidance alone has not produced.
MCP server poisoning at enterprise scale. CrowdStrike identified the attack vector in January 2026. A single compromised MCP server that propagates malicious tool instructions to every connected agent simultaneously is the single highest-consequence agentic AI attack scenario that has not yet been confirmed in a real-world incident.
Attribution proliferation. GTG-1002 used a frontier model (Claude Code) from a U.S. company. The next documented campaign may use an open-weight model — Llama, Mistral, DeepSeek — that cannot be monitored or shut down by the developer. Attribution and interdiction become structurally harder.
Legislative response to CRS IF13151. The Congressional Research Service flagged the absence of any statute specifically addressing agentic AI as a cyberattack vector. Whether Congress legislates, the administration issues executive guidance, or CISA’s May 2026 joint guidance becomes the de facto standard is the regulatory inflection point of 2026.
AI-on-AI defense becoming mandatory. Unit 42, CrowdStrike, and IBM all published agentic defense products in 2025–2026. The CISA guidance recommends continuous behavioral monitoring of agent actions. Human analysts cannot monitor agent logs at machine speed. Defenders who are not using AI to detect AI attacks will be structurally outpaced by 2027.

§ 09 / Administration Position — AI and National Security

The Trump administration’s public position on AI security has been framed primarily around U.S. AI dominance and counterintelligence, not around regulating how American organizations deploy AI agents. The December 11, 2025 executive order Eliminating State Law Obstruction of National Artificial Intelligence Policy focused on preempting state AI regulations that might slow U.S. AI development — not on the agentic attack surface that the GTG-1002 campaign had exposed two months earlier.

Donald J. Trump@realDonaldTrump · Trump administration public position on AI security · composite

America will lead the world in artificial intelligence. We will not allow China or anyone else to out-develop us. Our national security depends on staying ahead — and that means moving fast, not getting bogged down in bureaucratic red tape that slows our AI advantage.

Paraphrased commentary · not a verbatim post

Editorial paraphrase — composite of the Trump administration's documented public position on AI dominance, national security, and counterintelligence priority as reflected in the December 11, 2025 executive order and associated public statements. Not a verbatim post.

Howard Lutnick · U.S. Secretary of Commerce@SecLutnick · Paraphrase of Commerce Department public position

The threat from Chinese AI-enabled espionage is real and growing. Our response has to be faster AI, better AI, American AI — not more regulations that tie our hands while Beijing invests without constraint.

Paraphrased commentary · not a verbatim post

Editorial paraphrase — composite of documented Commerce Department position on AI competitiveness and the counterintelligence framing of the GTG-1002-era policy response. Not a verbatim post.

Bottom Line

Q4 2025 was the quarter the AI agent attack went from theoretical to documented. One state-sponsored group ran a 30-target espionage campaign at machine speed. One critical vulnerability in the most popular AI agent framework handed every cloud credential in a deployed system to anyone who could write a prompt. One open standard for connecting agents to corporate tools became the single-point-of-failure vector that security firms spent January 2026 writing taxonomies for. CISA and Five Eyes allies issued guidance nine months after the attack they were trying to prevent, and Congress has not yet addressed the statutory gap. The question for 2026 is not whether AI agents will be used to attack critical infrastructure. It is whether the organizations running hospitals, financial institutions, and government networks will have AI-native defenses in place before the next campaign begins.

Timeline · From Research Warning to Multinational Response

April 2025 – May 2026 · Key incidents, disclosures, and policy actions

Apr 2025

MIT Technology Review — First Warning

MIT Technology Review publishes 'Cyberattacks by AI agents are coming,' citing Palisade Research's LLM Agent Honeypot: 11 million access attempts logged since October 2024, 8 potential AI agents detected. Researchers confirm a 25% success rate when agents received brief vulnerability descriptions.

May 2025

Unit 42 Publishes Agentic AI Attack Framework

Palo Alto Networks' Unit 42 develops and publishes a proof-of-concept Agentic AI Attack Framework — a structured methodology for simulating AI-agent-driven intrusions covering reconnaissance, exploitation, lateral movement, and exfiltration. The framework demonstrates a full ransomware simulation in 25 minutes using AI at every stage: 100x faster than human-paced attacks.

Aug 2025

DEF CON 33 — AgentFlayer Demo

At DEF CON 33, security firm Zenity demonstrates AgentFlayer — a proof-of-concept attack exploiting OpenAI's Connectors feature via 'poisoned' documents that make ChatGPT autonomously search victim files for sensitive information. Zero clicks required from the target user.

Sep 2025

Anthropic Detects GTG-1002 — First Large-Scale Autonomous Attack

In mid-September 2025, Anthropic detects suspicious activity later confirmed as a large-scale autonomous cyber espionage campaign. A Chinese state-sponsored group (designated GTG-1002) jailbroke Claude Code to autonomously attack roughly 30 global organizations — tech companies, financial institutions, chemical manufacturers, and government agencies. AI performed 80-90% of attack tasks; human operators intervened at only 4-6 critical decision points. At peak, the agent made thousands of requests per second.

Oct 2025

Schneier, Adkins, Evron Essay — 'The Balance Has Tipped'

Bruce Schneier (Harvard Berkman Klein), Heather Adkins, and Gadi Evron publish 'Autonomous AI Hacking and the Future of Cybersecurity' in CSO. The thesis: 'The cyberattack/cyberdefense balance has long skewed towards the attackers; these developments threaten to tip the scales completely.' AI now excels across four attack dimensions — speed, scale, scope, and sophistication.

Nov 2025

Anthropic Discloses GTG-1002 Publicly

Anthropic publishes its post-incident report: 'Disrupting the first reported AI-orchestrated cyber espionage campaign.' Bans identified accounts, notifies affected entities, coordinates with authorities, expands detection classifiers. The disclosure is the first public confirmation of a large-scale autonomous AI attack in history.

Dec 2025

LangGrinch — CVE-2025-68664 (CVSS 9.3)

Security researchers disclose CVE-2025-68664, a critical serialization injection flaw in langchain-core dubbed 'LangGrinch.' CVSS score: 9.3. Attack path: prompt injection → crafted structured output → total environment variable exfiltration, including cloud credentials, database connection strings, LLM API keys, and vector database secrets. Immediate patching required to versions 1.2.5 or 0.3.81.

Jan 2026

CrowdStrike — Agentic Tool Chain Attack Taxonomy

CrowdStrike publishes a formal taxonomy of agentic tool chain attacks: tool poisoning (malicious instructions hidden in tool descriptions), tool shadowing (legitimate tool impersonation), and rugpull attacks (tools that behave benignly until triggered). The Model Context Protocol (MCP), which centralizes tools across agents, is identified as a single-point-of-failure risk vector that can compromise all connected agents simultaneously.

Feb 2026

IBM X-Force 2026 Index — AI Attack Industrialization

IBM X-Force releases the 2026 Threat Intelligence Index: 44% increase in public-app attacks; 49% surge in active ransomware groups; 300,000+ ChatGPT credentials exposed via infostealer malware; 4X increase in supply chain compromises since 2020. Assessment: 'Attackers aren't reinventing playbooks, they're speeding them up with AI.' — Mark Hughes, IBM.

May 1, 2026

CISA + Five Eyes Publish First Multinational Agentic AI Guidance

CISA, NSA, and the cyber agencies of Australia, Canada, New Zealand, and the United Kingdom jointly publish 'Careful Adoption of Agentic AI Services' — the first coordinated multinational security guidance specifically addressing agentic AI. Five risk categories: privilege escalation, design and configuration failures, behavioral misalignment, structural brittleness, and accountability gaps.

Sources & Methodology · 18 Sources

Anthropic · Official Blog·Disrupting the first reported AI-orchestrated cyber espionage campaign · November 13, 2025 · GTG-1002 group, Claude Code abuse, ~30 targets, 80-90% AI automation

CISA · Resource Guide·Careful Adoption of Agentic AI Services · May 1, 2026 · Joint guidance with NSA, ACSC (AU), CCCS (CA), NCSC (NZ), NCSC (UK) · five risk categories: privilege escalation, design failures, behavioral misalignment, structural brittleness, accountability gaps

CISA · Announcement·CISA, U.S. and International Partners Release Guide to Secure Adoption of Agentic AI · May 1, 2026

IBM Newsroom·IBM 2026 X-Force Threat Index: AI-Driven Attacks are Escalating as Basic Security Gaps Leave Enterprises Exposed · February 25, 2026 · 44% surge in public-app attacks, 300K+ ChatGPT credentials exposed, 49% rise in ransomware groups

Palo Alto Networks · Unit 42·Unit 42 Develops Agentic AI Attack Framework · May 2025 · simulates full ransomware attack chain using AI at every stage in 25 minutes

Palo Alto Networks · Unit 42·2025 Unit 42 Global Incident Response Report: Social Engineering Edition · 36% of IR incidents involved social engineering; initial access to domain admin in under 40 minutes; AI crafts personalized lures and clones executive voices

CrowdStrike · Blog·How Agentic Tool Chain Attacks Threaten AI Agent Security · January 30, 2026 · tool poisoning, tool shadowing, rugpull attacks; MCP server as single-point-of-failure risk vector

Schneier on Security / CSO·Autonomous AI Hacking and the Future of Cybersecurity · Bruce Schneier, Heather Adkins, Gadi Evron · October 8, 2025 · 'The cyberattack/cyberdefense balance has long skewed towards the attackers; these developments threaten to tip the scales completely.'

MIT Technology Review·Cyberattacks by AI agents are coming · April 4, 2025 · Palisade Research LLM Agent Honeypot: 11M+ access attempts, 8 AI agents detected, 25% success rate with brief vulnerability description

The Hacker News·Critical LangChain Core Vulnerability Exposes Secrets via Serialization Injection · CVE-2025-68664 'LangGrinch' · CVSS 9.3 · December 2025 · environment variable exfiltration via prompt injection

SC Media·AI agent secret compromise possible with critical langchain-core vulnerability · CVE-2025-68664 technical analysis · cloud credential exposure risk

Cybersecurity Dive·Autonomous attacks ushered cybercrime into AI era in 2025 · Eric Geller · February 4, 2026 · 16% of breaches involved AI models (IBM); 86% of ransomware attacks used remote encryption; 8% YoY increase in ransomware

eSecurity Planet·AI Agent Attacks in Q4 2025 Signal New Risks for 2026 · Lakera AI research; indirect prompt injection succeeds with fewer attempts than direct methods; system prompt extraction as leading attack objective

Obsidian Security·Prompt Injection Attacks: The Most Common AI Exploit in 2025 · prompt injection ranked #1 in OWASP LLM Top 10; present in 73% of production AI deployments

Security Boulevard·The 2025 Healthcare Cyber Crisis: Unified AI Defense Against $10.3M Breaches · November 2025 · average healthcare breach cost $10.3M in 2025; 33M Americans affected; AI-powered ransomware accelerating

Congress.gov / Congressional Research Service·Agentic Artificial Intelligence and Cyberattacks · IF13151 · CRS analysis of legal, policy, and security implications of agentic AI cyberattacks for Congress

Fortune·At Black Hat and DEF CON, AI was hacker, bodyguard, and target all at once · August 12, 2025 · AgentFlayer demo; DEF CON 33 AI security sessions; agentic AI attack surface overview

Cyberscoop·US government, allies publish guidance on how to safely deploy AI agents · CISA-NSA-Five Eyes joint guidance May 1, 2026 · first multinational coordination on agentic AI risk

Primary sources: Anthropic's November 13, 2025 post-incident disclosure is the primary source for the GTG-1002 espionage campaign. CISA's May 1, 2026 joint guidance is the primary regulatory source. IBM X-Force 2026 Threat Intelligence Index (February 25, 2026) is the primary source for attack-frequency statistics. CVE-2025-68664 (LangGrinch) is the primary source for the LangChain vulnerability. Unit 42's 2025 Incident Response Report is the primary source for social engineering statistics. The 73% prompt injection figure traces to OWASP's 2025 LLM Top 10 as reported by Obsidian Security. Healthcare breach cost figure ($10.3M average) traces to Security Boulevard's November 2025 synthesis of IBM Cost of Data Breach Report data. TruthSocialQuoteCard entries are editorial paraphrases of the Trump administration's documented public position on AI security and autonomous cyber threats — not verbatim posts.