AgentGuard · by aiXcheck

Get your AI agents EU AI Act compliant in 30 minutes.

Automated red-teaming for AI agents. 130+ attack patterns, 8 categories, behavioral baselining, and a quantifiable Security Score — all in one CLI, API, and CI-friendly platform.

The problem

Your AI agents are running. Your security testing isn't.

Web-app scanners don't speak LLM. General red-team tools don't understand tool use, MCP, or multi-agent pipelines. AgentGuard was built for this gap.

Problem
Impact
Prompt injection in every input
Data leaks, tool hijacking, brand damage
Agents with too much tool access
Excessive agency, unauthorized actions
Multi-agent pipelines pass context insecurely
Privilege escalation across sub-agents
MCP servers introduce a new attack surface
Untrusted tools running in your agent's session
No behavioral baseline → can't detect drift under attack
Incidents invisible in production
EU AI Act Art. 9 & 15 require documented evidence
Up to €35M in fines by Aug 2, 2026
5 pillars

How AgentGuard works

One CLI. One API. Five things it does better than anything else.

130+ attack patterns

Prompt injection, jailbreaks, tool misuse, context manipulation, system-prompt leakage, multi-agent pipeline attacks, MCP exploits, excessive agency.
8 categories

Behavioral baselining

Fingerprint your agent's normal behavior before testing. AgentGuard measures deviation under attack instead of guessing what 'bad' looks like.
Quantified drift

Security Score

A single 0–100 score covering coverage, severity, and exploitability — embeddable as a badge on your product page.
Score + badge

EU AI Act reports

Every run produces auditor-ready evidence packages mapped to Art. 9 (risk management) and Art. 15 (accuracy & cybersecurity).
Art. 9 · Art. 15

Black-box & gray-box modes

Test over the wire like an attacker, or with system prompt and tool schema for deeper coverage. CI/CD-friendly CLI + API.
CLI · API · CI
Security Score

One number your CISO, your auditor, and your marketing team can all quote.

AgentGuard's Security Score combines attack coverage, severity weighting, and exploit success rate into a single 0–100 figure. Embed the live badge on your product site the same way you'd embed an SSL rating — only this one actually reflects agentic risk.

See the methodology
your-agent.io
AgentGuard · verified
87
/ 100
Security Score · updated Apr 27, 2026
Coverage
94%
Severity
Low
Exploit
2/130
<img src="https://badge.agentguard.io/your-agent.svg" />
Attack coverage

8 categories. 130+ real patterns. Zero theory.

Every pattern is derived from real CVEs, published research, and in-the-wild incidents. Including LangChain CVE-2025-68664 (CVSS 9.3) and CVE-2024-36480.

Prompt injection
~25 patterns
Jailbreak
~20 patterns
Tool misuse / excessive agency
~20 patterns
Context manipulation
~15 patterns
System prompt leakage
~15 patterns
Multi-agent pipeline
~18 patterns
MCP security
~12 patterns
Excessive agency
~10 patterns
Positioning
“A developer uses Promptfoo to evaluate an LLM response. A CISO uses AgentGuard to prove the whole agent is safe to deploy.”
Evaluation frameworks test prompts. Guardrail products filter runtime traffic. AgentGuard is the layer in between: adversarial testing of the complete agent, with evidence.
Compare vs. Promptfoo / Garak / others
Developer experience

One command. Full report.

agentguard · terminal
# Install
$ pip install agentguard

# Scan your agent
$ agentguard scan \
    --target https://api.your-agent.io/chat \
    --mode gray-box \
    --categories prompt-injection,tool-misuse,mcp \
    --output report.html --compliance eu-ai-act

[✓] Baseline captured (42 benign probes)
[✓] 130 adversarial patterns executed
[!] 3 findings: 1 critical, 1 high, 1 medium
[→] Security Score 72/100
[→] Art. 9 + Art. 15 evidence written to ./evidence/
130+
attack patterns, growing weekly
8
attack categories covered
30 min
from install to first evidence package
€35M
EU AI Act max fine makes €999/mo irrelevant

Find out what your agent actually does under attack.

Run AgentGuard against any HTTP agent endpoint. Free tier included.