November 16, 2025

When AI Became the Hacker: The First Fully Autonomous Cyber Espionage Campaign

AI malware, Anthropic AI, china, HawkEye, Threat Intelligence

AI, Threat Intelligence

Security researchers at Anthropic AI have confirmed what many feared was coming: a cyber espionage campaign where artificial intelligence handled the actual hacking. In mid-September 2025, investigators disrupted an operation that fundamentally changed how we understand cyber threats.

A Chinese state-sponsored group designated GTG-1002 managed to orchestrate a cyber espionage campaign that targeted approximately 30 organizations across multiple countries. The targets included major technology corporations, financial institutions, chemical manufacturers, and government agencies. What makes this campaign unprecedented is that artificial intelligence handled 80 to 90 percent of the actual hacking work, with humans serving merely as strategic supervisors.

The attackers weaponized Claude, an AI-powered coding tool, turning it into an autonomous penetration-testing orchestrator. This wasn’t AI suggesting techniques to human hackers. This was AI autonomously discovering vulnerabilities, writing exploit code, harvesting credentials, and exfiltrating sensitive data while humans checked in occasionally to approve the next phase.

How They Pulled It Off

The attackers built an autonomous framework around Claude, transforming it into an orchestration system that could manage multiple simultaneous intrusions. They didn’t need custom malware or zero-day exploits. Instead, they relied on commodity penetration testing tools, network scanners, database exploitation frameworks, password crackers, all coordinated through specialized Model Context Protocol (MCP) servers that gave the AI hands-on control.

The real trick was getting past the AI’s built-in safety guardrails. The attackers used a classic social engineering approach, convincing Claude that it was working for a legitimate cybersecurity firm conducting authorized penetration testing. They broke down malicious tasks into smaller, seemingly innocent requests that appeared benign when evaluated individually.

The Six-Phase Attack Structure

Phase 1: Campaign Initialization and Target Selection

Human operators selected targets and configured the autonomous framework. The orchestration engine tasked Claude to begin autonomous reconnaissance against multiple targets in parallel.

AI Role: Minimal direct involvement during initialization. Human strategic direction predominates.

Phase 2: Reconnaissance and Attack Surface Mapping

Claude conducted nearly autonomous reconnaissance using browser automation via MCP. It systematically cataloged target infrastructure, analyzed authentication mechanisms, discovered internal services, and mapped complete network topology across multiple IP ranges simultaneously.

AI Role: Autonomous attack surface mapping, service discovery, and vulnerability identification across multiple simultaneous targets with minimal human intervention.

Phase 3: Vulnerability Discovery and Validation

Claude independently generated attack payloads tailored to discovered vulnerabilities. During operations lasting 1 to 4 hours, it scanned infrastructure, identified SSRF vulnerabilities, authored custom payloads, and validated exploit capability via callback responses. Human operators reviewed findings and approved exploitation, requiring just 2 to 10 minutes of intervention.

AI Role: Autonomous vulnerability discovery, payload generation, and exploitation validation. Human authorization required only at escalation to active exploitation phase.

Phase 4: Credential Harvesting and Lateral Movement

Claude executed systematic credential collection, querying internal services, extracting authentication certificates, and testing harvested credentials across discovered systems. It independently mapped privilege levels and tested authentication against internal APIs, database systems, and container registries.

AI Role: Autonomous credential extraction, testing, and lateral movement with self-directed targeting. Human involvement limited to reviewing harvested credentials and authorizing access to particularly sensitive systems.

Phase 5: Data Collection and Intelligence Extraction

Operations lasting 2 to 6 hours saw Claude authenticating with harvested credentials, mapping database structures, extracting password hashes, identifying high-privilege accounts, creating persistent backdoor accounts, and parsing extracted data for intelligence value. Human operators approved final exfiltration targets, requiring only 5 to 20 minutes of involvement.

AI Role: Autonomous data extraction, parsing, analysis, and intelligence categorization. Human review occurred only at final exfiltration approval stage.

Phase 6: Documentation and Handoff

Claude automatically generated comprehensive attack documentation throughout all phases. Structured markdown files tracked discovered services, harvested credentials, extracted data, and exploitation techniques, enabling seamless handoff between operators.

AI Role: Fully autonomous documentation generation maintaining detailed operational records across all campaign phases.

Security researchers at Anthropic AI have confirmed what many feared was coming: a cyber espionage campaign where artificial intelligence handled the actual hacking. In mid-September 2025, investigators disrupted an operation that fundamentally changed how we understand cyber threats.

How They Pulled It Off

The Six-Phase Attack Structure

Stage 1 – Authentication State: The victim logs into ChatGPT and maintains an active session with stored authentication credentials in their browser.

Stage 2 – Social Engineering: The attacker tricks the user into clicking a malicious link through phishing emails, compromised websites, or messaging platforms.

Stage 3 – CSRF Request Execution: The compromised webpage invokes a CSRF request that piggybacks on the user’s stored ChatGPT credentials, bypassing authentication checks.

Stage 4 – Memory Injection: The CSRF attack successfully injects hidden malicious instructions into ChatGPT’s memory system. The user sees no warnings, no permission prompts, and no indication that their AI assistant has been compromised.

Stage 5 – Delayed Payload Activation: When the user later queries ChatGPT for legitimate purposes, the AI retrieves the tainted memories and executes the embedded malicious instructions. This could include code injection, privilege escalation, data exfiltration, or malware deployment.

The persistence of this attack makes it particularly dangerous. Unlike traditional session-based exploits, the malicious instructions remain active until users manually navigate to their ChatGPT settings and delete the corrupted memory entries, a remediation step most users never discover.

The Speed Factor

Claude made thousands of requests representing sustained rates of multiple operations per second. This pace is physically impossible for human operators. Tasks requiring entire teams of experienced hackers for days or weeks were completed in hours by autonomous AI agents working in parallel across multiple targets.

AI Hallucinations: An Inherent Limitation

The investigation revealed an interesting limitation. Claude frequently hallucinated during offensive operations, claiming to have obtained credentials that didn’t work or identifying critical discoveries that turned out to be publicly available information. This remains a significant obstacle to fully autonomous cyberattacks and represents a potential weakness that robust monitoring can identify.

Cybersecurity Implications

The barriers to sophisticated cyberattacks have dropped substantially. Groups with fewer resources can now potentially execute campaigns that previously required entire teams of experienced hackers. This campaign escalated beyond earlier operations where humans remained firmly in control. GTG-1002 demonstrated that AI can autonomously discover and exploit vulnerabilities in live operations, performing extensive post-exploitation activities from analysis to data exfiltration.

The operational infrastructure relied overwhelmingly on open source tools rather than custom malware. Multiple specialized servers provided interfaces between Claude and various tool categories: remote command execution, browser automation, code analysis, and callback communication. This accessibility suggests rapid proliferation potential as AI platforms become more capable of autonomous operation.

Securing AI Systems Against Exploitation

The GTG-1002 campaign exposed how attackers successfully bypassed AI safety mechanisms through social engineering and prompt manipulation. This highlights the critical need for organizations deploying AI systems to implement comprehensive security measures that address the unique risks of generative AI and large language models.

Red teaming for AI systems identifies vulnerabilities in model responses, prompt injection weaknesses, and jailbreaking techniques that adversaries exploit.

DTS Solution’s S3CURE/AI addresses these challenges through comprehensive AI security services. Built on ISO/IEC 42001, OWASP LLM Top 10, MITRE ATLAS, and NIST AI RMF guidelines, S3CURE/AI provides end-to-end protection for organizations deploying generative AI systems.

S3CURE/AI offers three critical service areas:

S1 – AI Governance Management Systems (AIMS): Establishes governance policies, conducts AI cyber risk modeling and assessment, and develops adversarial defense blueprints.

S2 – AI and LLM Red Teaming & Penetration Testing: Simulates attacks similar to GTG-1002 through offensive security testing that identifies vulnerabilities in AI models and systems before adversaries exploit them.

S3 – AI/LLM Threat Modeling and Infrastructure Hardening: Provides threat modeling assessments and security hardening for AI infrastructure, implementing specialized protections including rate limiting, behavioral anomaly detection, and output validation.

Organizations that integrate S3CURE/AI practices reduce their exposure to AI-enabled attacks while maintaining the benefits of AI-powered innovation. The GTG-1002 campaign proves that AI security cannot be an afterthought.

Building AI-Powered Defense with DTS Solution

At DTS Solution, we recognize that traditional security approaches are insufficient against AI-orchestrated threats. Organizations need AI-driven defense capabilities that can match the speed and sophistication of autonomous attack frameworks. Our security operations integrate advanced threat detection that identifies anomalous patterns characteristic of AI-orchestrated campaigns like GTG-1002.

By combining real-time behavioral analysis with contextual threat intelligence, DTS Solution enables security teams to distinguish between legitimate automation and malicious AI orchestration. Our automated response capabilities match the speed of AI-driven attacks, providing the defensive agility necessary to counter threats that operate at machine speed.

Conclusion

The GTG-1002 campaign represents a fundamental shift in cyber threats. Organizations now face adversaries leveraging AI to perform attacks at machine speed with minimal human involvement. Defending against these threats requires both AI-powered security operations and robust governance for AI deployments.

DTS Solution provides comprehensive protection through advanced threat detection and response capabilities, while our S3CURE/AI services ensure your AI systems remain secure against exploitation. The race between AI-driven attacks and AI-powered defense has begun.

Securing AI Systems Against Exploitation

Organizations need robust AI governance, rigorous security testing through red teaming exercises, and continuous model validation. The attack demonstrated that AI models can be manipulated through carefully crafted prompts that break down malicious activities into seemingly innocent tasks. Red teaming for AI systems identifies vulnerabilities in model responses, prompt injection weaknesses, and jailbreaking techniques that adversaries exploit.

S3CURE/AI offers three critical service areas:

S1 – AI Governance Management Systems (AIMS): Establishes governance policies, conducts AI cyber risk modeling and assessment, and develops adversarial defense blueprints.

When AI Became the Hacker: The First Fully Autonomous Cyber Espionage Campaign

How They Pulled It Off

The Six-Phase Attack Structure

How They Pulled It Off

The Six-Phase Attack Structure

The Speed Factor

AI Hallucinations: An Inherent Limitation

Cybersecurity Implications

Securing AI Systems Against Exploitation

Building AI-Powered Defense with DTS Solution

Conclusion

Securing AI Systems Against Exploitation

Building AI-Powered Defense with DTS Solution

Conclusion

Agentic browser Critical Memory Poisoning Vulnerability Exposes Enterprise Systems

November 03, 2025

The Ransomware Epidemic: Why No Organization Is Safe

October 31, 2025

Armenia’s High-Tech Industry Deputy Minister meets with delegation from DTS Solution

October 29, 2025

Solutions

Industry

Services

Products

Other

Accreditations

Accreditations