Vera benchmark finds high attack success against production agent frameworks
An arXiv paper introduces Vera, a framework for safety testing LLM agents at scale, and reports high attack success rates across several production-style systems.
Read more
An arXiv paper introduces Vera, a framework for safety testing LLM agents from risk discovery to evidence-grounded verification. The authors test agent frameworks including OpenClaw, Hermes, Codex, and Claude Code and report average attack success rates as high as 93.9% in evaluated scenarios. They also release Vera-Bench, a set of 1,600 executable safety cases for comparing agent defenses.
Key details: Vera tests LLM agents from risk discovery through verification, The paper evaluates OpenClaw, Hermes, Codex, and Claude Code, The authors report attack success rates up to 93.9% and introduce Vera-Bench.
Why it matters: Agent products are entering everyday workflows while their tool-use attack surface is still highly exploitable in benchmarked settings.