Agent Security | AI Red Team

Microsoft Security Blog March 12, 2026 guide

Detecting and analyzing prompt abuse in AI tools

Microsoft Incident Response walks through how to detect prompt abuse operationally, tying prompt injection risk back to logging, telemetry, and incident response workflows.

Prompt Injection Agent Security

Read summary Source link

OpenAI March 11, 2026 analysis

Designing AI agents to resist prompt injection

OpenAI frames prompt injection as an evolving agent-security problem that increasingly resembles social engineering rather than a simple string-matching issue.

Prompt Injection Agent Security

Read summary Source link

MITRE Center for Threat-Informed Defense February 9, 2026 framework

MITRE ATLAS OpenClaw Investigation Discovers New and Likeliest Techniques

MITRE maps incident patterns in an open-source agentic ecosystem to ATLAS techniques, showing how AI-first systems create distinct execution paths for attackers.

Agent Security AI Red Teaming

Read summary Source link

OpenAI December 22, 2025 analysis

Continuously hardening ChatGPT Atlas against prompt injection attacks

OpenAI describes using automated red teaming and reinforcement learning to discover agent prompt injection attacks before they appear in the wild.

Prompt Injection Agent Security AI Red Teaming

Read summary Source link

Google Cloud Blog December 4, 2025 guide

Building a Production-Ready AI Security Foundation

Google Cloud outlines a defense-in-depth view of AI security spanning application controls, data protections, and infrastructure isolation.

Agent Security Prompt Injection Adversarial ML

Read summary Source link

Google Cloud Blog June 12, 2025 analysis

Cloud CISO Perspectives: How Google secures AI Agents

Google’s CISO perspective on why agents need a new security paradigm and what changes when models can observe, plan, and act.

Agent Security

Read summary Source link

Google Cloud Blog March 5, 2025 news

Announcing AI Protection: Security for the AI era

Google introduced AI Protection and Model Armor to address prompt injection, jailbreaks, data loss, and multicloud AI workload security.

Prompt Injection Agent Security

Read summary Source link

OpenAI January 23, 2025 framework

Operator System Card

The Operator system card documents red teaming and mitigation choices for a computer-using agent, with prompt injections listed as a central risk area.

Agent Security Model Evaluation Prompt Injection AI Compliance

Read summary Source link

Microsoft Cloud Blog January 14, 2025 analysis

Enhancing AI safety: Insights and lessons from red teaming

Microsoft summarizes lessons from red teaming more than one hundred generative AI products, emphasizing system-level testing, human expertise, and automation.

AI Red Teaming Prompt Injection Agent Security

Read summary Source link

OWASP January 1, 2025 framework

OWASP Top 10 for Large Language Model Applications

OWASP’s GenAI security project remains a practical baseline for teams building or assessing LLM applications and agentic systems.

Prompt Injection Agent Security Adversarial ML

Read summary Source link